Internet Engineering Task Force A. Charny
Internet-Draft J. Zhang
Intended status: Informational Cisco Systems
Expires: January 8, 2010 G. Karagiannis
U. Twente
M. Menth
University of Wuerzburg
T. Taylor, Ed.
Huawei Technologies
July 7, 2009
PCN Boundary Node Behaviour for the Single Marking (SM) Mode of
Operation
draft-ietf-pcn-sm-edge-behaviour-00
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt.
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
This Internet-Draft will expire on January 8, 2010.
Copyright Notice
Copyright (c) 2009 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents in effect on the date of
publication of this document (http://trustee.ietf.org/license-info).
Please review these documents carefully, as they describe your rights
Charny, et al. Expires January 8, 2010 [Page 1]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
and restrictions with respect to this document.
Abstract
Precongestion notification (PCN) is a means for protecting quality of
service for inelastic traffic admitted to a Diffserv domain. The
overall PCN architecture is described in RFC 5559. This memo is one
of a series describing possible boundary node behaviours for a PCN
domain. The behaviour described here is that for two-state
measurement-based load control, known informally as Single Marking
(SM).
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1. Terminology . . . . . . . . . . . . . . . . . . . . . . . 3
2. Assumed Core Network Behaviour for SM . . . . . . . . . . . . 4
3. Node Behaviours . . . . . . . . . . . . . . . . . . . . . . . 5
3.1. Overview . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2. Behaviour of the PCN-Egress-Node . . . . . . . . . . . . . 5
3.2.1. PCN-Egress-Node Role In Flow Admission . . . . . . . . 6
3.2.2. PCN-Egress-Node Role In Flow Termination . . . . . . . 6
3.3. Behaviour of the PCN-Ingress-Node . . . . . . . . . . . . 7
3.3.1. PCN-Ingress-Node Role In Flow Admission . . . . . . . 7
3.3.2. PCN-Ingress-Node Role In Flow Termination . . . . . . 7
3.4. Possible Extension to the Basic Algorithm . . . . . . . . 7
4. Specification of Diffserv Per-Domain Behaviour . . . . . . . . 8
4.1. Applicability . . . . . . . . . . . . . . . . . . . . . . 8
4.2. Technical Specification . . . . . . . . . . . . . . . . . 9
4.3. Attributes . . . . . . . . . . . . . . . . . . . . . . . . 9
4.4. Parameters . . . . . . . . . . . . . . . . . . . . . . . . 9
4.5. Assumptions . . . . . . . . . . . . . . . . . . . . . . . 9
4.6. Example Uses . . . . . . . . . . . . . . . . . . . . . . . 9
4.7. Environmental Concerns . . . . . . . . . . . . . . . . . . 9
4.8. Security Considerations . . . . . . . . . . . . . . . . . 9
5. Security Considerations . . . . . . . . . . . . . . . . . . . 9
6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 10
7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 10
8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 10
8.1. Normative References . . . . . . . . . . . . . . . . . . . 10
8.2. Informative References . . . . . . . . . . . . . . . . . . 10
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 11
Charny, et al. Expires January 8, 2010 [Page 2]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
1. Introduction
The objective of Pre-Congestion Notification (PCN) is to protect the
quality of service (QoS) of inelastic flows within a Diffserv domain,
in a simple, scalable, and robust fashion. Two mechanisms are used:
admission control, to decide whether to admit or block a new flow
request, and (in abnormal circumstances) flow termination to decide
whether to terminate some of the existing flows. To achieve this,
the overall rate of PCN-traffic is metered on every link in the
domain, and PCN-packets are appropriately marked when certain
configured rates are exceeded. These configured rates are below the
rate of the link thus providing notification to boundary nodes about
overloads before any congestion occurs (hence "pre-congestion"
notification). The level of marking allows boundary nodes to make
decisions about whether to admit or terminate. For more details see
[RFC5559].
Boundary node behaviours specify a detailed set of algorithms and
edge node behaviours used to implement the PCN mechanisms. Since the
algorithms depend on specific metering and marking behaviour at the
interior nodes, it is also necessary to specify the assumptions made
about interior node behaviour. Finally, because PCN uses DSCP values
to carry its markings, a specification of boundary node behaviour
must include the per domain behaviour (PDB) template specified in
[RFC3086], filled out with the appropriate content. The present
document accomplishes these tasks for the Single Marking (SM) mode of
operation.
1.1. Terminology
In addition to the terms defined in [RFC5559], this document uses the
following terms:
Policy Decision Point (PDP)
The node that provides policy input regarding admission and
termination of flows.
PCN-admission-state
The state ("admit" or "block") derived by PCN-egress-node for a
given ingress-egress-aggregate based on PCN packet marking
statistics. The PCN-ingress-node admits or blocks new flows
offered to the aggregate based on the current value of the PCN-
admission-state. Individual decisions may be modified by policy
input from the PDP. For further details see Section 3.2.1 and
Section 3.3.1.
Charny, et al. Expires January 8, 2010 [Page 3]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
Congestion level estimate (CLE)
A value derived from the measurement of PCN packets received at a
PCN-egress-node for a given ingress-egress-aggregate, representing
the ratio of marked to total PCN traffic (measured in octets) over
a short period. In this specification the CLE is an exponentially
weighted moving average of the ratios observed in successive
fixed-length measurement intervals. For further details see
Section 3.2.1.
Admission decision threshold
A fractional value to which the CLE is compared to determine the
PCN-admission-state. If the CLE is below the admission decision
threshold the PCN-admission-state is set to "admit". If the CLE
is above the admission decision threshold the PCN-admission-state
is set to "block". For further details see Section 3.2.1.
Normal regime
The operating state of the PCN-egress-node with respect to a given
ingress-egress-aggregate during periods when no excess-traffic-
marked packets are received within that aggregate.
Excess traffic regime
The operating state of the PCN-egress-node with respect to a given
ingress-egress-aggregate during periods when excess-traffic-marked
packets are being received within that aggregate. The transition
from normal to excess traffic regime occurs when an excess-
traffic-marked packet is received within the given ingress-egress-
aggregate. The transition from excess traffic regime to normal
regime occurs when a complete measurement interval passes without
receipt of an excess-traffic-marked packet within the given
ingress-egress-aggregate. For further details see Section 3.2.2.
2. Assumed Core Network Behaviour for SM
This section describes the assumed behaviour for nodes of the PCN-
domain when acting in their role as PCN-interior-nodes. The SM mode
of operation assumes that:
o each link has been configured with a PCN-excess-rate having a
value equal to the PCN-admissible-rate for the link;
o PCN-interior-nodes perform excess-traffic-metering of packets
according to the rules specified in [ID.PCN-marking].
o excess-traffic-marking of packets uses the PCN-Marked (PM)
codepoint defined in [ID.PCN-baseline];
Charny, et al. Expires January 8, 2010 [Page 4]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
o no link PCN-threshold-rate is configured, and PCN-interior nodes
perform no threshold-metering.
3. Node Behaviours
3.1. Overview
The Single Marking (SM) mode of operation supports flow admission
based on the smoothed ratio of PCN-marked to total PCN-traffic
observed by the PCN-egress-node (the congestion level estimate, see
Section 1.1) for each ingress-egress-aggregate. When the PCN-
admission-state (see Section 1.1) for a given ingress-egress-
aggregate changes from "Admit" to "Block" or vice versa, the PCN-
egress-node reports this change. The PCN-ingress-node admits or
blocks new PCN flows offered to a given ingress-egress-aggregate
based on the PCN-admission-state, possibly modified by policy
direction from the Policy Decision Point (PDP).
The decision to terminate flows requires measurement data from both
the PCN-ingress-node and the PCN-egress-node. Hence while the the
PCN-admission-state is "block", the PCN-egress-node reports the
measured rate of flow of unmarked PCN-traffic it receives for each
ingress-egress-aggregate. If the admitted traffic rate measured at
the PCN-ingress-node exceeds the reported unmarked received PCN
traffic rate multiplied by a configured factor, flows are selected
for termination to reduce this difference to zero, with policy
guidance from the PDP. The PCN-ingress-node ceases to admit the
selected flows.
[Not sure what to do about identifying flows for ECMP]
3.2. Behaviour of the PCN-Egress-Node
For each ingress-egress-aggregate, the egress node continuously
measures the following quantities over successive intervals of equal
duration. That duration is suggested to be in the range of 100 to
500ms to provide a reasonable tradeoff between signalling demands on
the network and the time taken to react to impending congestion.
NM-count:
Number of octets of PCN-traffic contained in received packets
which are not PCN-marked.
PM-count:
Number of octets of PCN-traffic contained in received packets
which are PCN-marked.
Charny, et al. Expires January 8, 2010 [Page 5]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
3.2.1. PCN-Egress-Node Role In Flow Admission
At the end of each measurement interval, the egress node calculates a
ratio R. If both counts are zero for the interval, the ratio R is set
to zero. Otherwise, the egress node calculates the ratio as:
R = PM-count / (NM-count + PM-count).
The egress node then updates a congestion level estimate (CLE, see
Section 1.1) with this ratio using exponential smoothing:
new_CLE = k*R + (1-k)*old_CLE,
where k is a constant chosen to put most (say 80%) of the weight in
the accumulated average on the most recent 1 to 3 seconds of data.
The value of k thus depends on the length of the measurement
interval.
The next step is to examine the relationship of old-CLE and new_CLE
to a configured admission decision threshold (Section 1.1). If
old_CLE is above the threshold and new_CLE is below it, the egress
node reports that the PCN-admission-state is now "admit" for the
ingress-egress-aggregate. If old-CLE and new-CLE are both below the
threshold, no action is required. If new-CLE is above the threshold,
the PCN-admission-state is now "block" for the ingress-egress-
aggregate. The PCN-egress-node procedure in this case is described
in Section 3.2.2.
Note: In the case of SM, the CLE is an indication of where the
actual load is with respect to the PCN-admissible-rate. In fact,
a admission decision threshold of x implies that the expected
behavior of SM is to keep the mean load at the fraction x above
the PCN-admissible-rate. Hence with SM, the admission decision
threshold should be configured with a small value to avoid
unintended over-admission.
3.2.2. PCN-Egress-Node Role In Flow Termination
When the PCN-egress-node determines that the PCN-admission-state
computed on the basis of the updated CLE is "block", it generates a
report indicating the PCN-admission-state and providing the NM-count
normalized to a rate NM-rate in octets per second.
[Not sure what to do about identifying flows for ECMP.]
Charny, et al. Expires January 8, 2010 [Page 6]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
3.3. Behaviour of the PCN-Ingress-Node
The PCN-related functions of the PCN-ingress-node are described
briefly in section 4.2 of [RFC5559]. This section focusses on the
specific behaviour associated with admission and flow termination.
3.3.1. PCN-Ingress-Node Role In Flow Admission
When the PCN-ingress-node receives a report indicating that the PCN-
admission-state for a given ingress-egress-aggregate is "admit", it
admits new flows to that aggregate. When the PCN-ingress-node
receives a report indicating that the PCN-admission-state for a given
ingress-egress-aggregate is "block", it ceases to admit new flows to
that aggregate. These actions may be modified by policy input from
the Policy Decision Point (PDP).
3.3.2. PCN-Ingress-Node Role In Flow Termination
For each ingress-egress-aggregate, the ingress node continuously
measures the following quantity over successive intervals of equal
duration. That duration is suggested to be in the range of 100 to
500ms, and preferably the same as at the PCN-egress-node.
Sent-count:
Number of octets of PCN-traffic contained in PCN packets which are
admitted to the PCN domain.
When the PCN-ingress-node receives a report containing a value for
the unmarked PCN traffic rate NM-rate for a given ingress-egress-
aggregate, it takes the most recently observed value of Sent-count
and normalizes it to a rate Sent-rate in octets per second. It then
calculates the difference
Sent-rate - U * NM-rate,
where U is a configured network-wide constant. If this difference is
positive, it indicates a required reduction in the rate of admission
of PCN traffic to that ingress-egress-aggregate. Flows are selected
for termination with policy input from the PDP. The PCN-ingress-node
ceases to admit the selected flows.
If the computed difference is negative, the PCN-ingress-node takes no
further action.
3.4. Possible Extension to the Basic Algorithm
The termination mechanisms of SM and CL as described in [I-D.pcn-CL-
edge-behaviour] are both based on excess-rate metering and marking,
Charny, et al. Expires January 8, 2010 [Page 7]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
however, there is a subtle difference between the two mechanisms
stemming from the fact that in SM, the bottleneck condition with
respect to the PCN-supportable-rate is not directly conveyed through
the markings. SM meters against the PCN-admissible- rate and infers
the bottleneck condition based on excess-marked traffic. The
inference process is vulnerable to inaccuracies, such as non-
uniformity in the marking distribution, and may result in over-
termination, especially when ingress-egress aggregation is low (< 50
flows).
If SM is used in a low IE-aggregation enviroment, to mitigate this
problem, a possible extension to the basic algorithm is to implement
an additional control, based on smoothing, to counter the inaccuracy
in the interval measurements and to safeguard the triggering of
termination. One such control can be implemented with a CLE-like
value (referred to as CLE-t). Note, the CLE-t is computed in exactly
the same way as described in Section 3.2.1, only with a different
value of k (so that the termination is independent of the admission
decision). The CLE-t is then compared to the value (U-1)/U. If CLE-t
is smaller, no termination should be applied, even if the computed U
* NM-rate is smaller than the Sent-rate. Otherwise, the aggregate
compares the U * NM-rate to Sent-rate to see if (and how much) to
terminate as described in Section 3.2.2.
4. Specification of Diffserv Per-Domain Behaviour
This section provides the specification required by [RFC3086] for a
per-domain behaviour.
4.1. Applicability
This section draws heavily upon points made in the PCN architecture
document, [RFC5559].
The PCN SM boundary node behaviour specified in this document is
applicable to inelastic traffic (particularly video and voice) where
quality of service for admitted flows is protected primarily by
admission control at the ingress to the domain. In exceptional
circumstances (e.g. due to network failures) already-admitted flows
may be terminated to protect the quality of service of the remainder.
The SM boundary node behaviour is more likely to terminate too many
flows under such circumstances than some alternative PCN boundary
node behaviours.
Single-Marking requires no extension to the baseline PCN encoding
described in [ID.PCN-baseline], thus reducing the work expected to be
performed in the data path of the high-speed routing equipment, and
Charny, et al. Expires January 8, 2010 [Page 8]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
saving valuable real estate in the packet header.
4.2. Technical Specification
The technical specification of the PCN SM per domain behaviour is
provided by the contents of [RFC5559], [ID.PCN-baseline],
[ID.PCN-marking], and the present document.
4.3. Attributes
TBD -- basically low loss, low jitter. Low delay would be nice but
has to be quantified
4.4. Parameters
TBD. Don't think RFC 3068 is looking for the list of configurable
parameters given in the architecture document.
4.5. Assumptions
Assumed that a specific portion of link capacity has been reserved
for PCN traffic. Assumed that recovery from overloads by flow
termination should happen within 1-3 seconds.
4.6. Example Uses
The PCN SM behaviour may be used to carry real-time traffic,
particularly voice and video.
4.7. Environmental Concerns
In some markets, traffic preemption is considered to be
impermissible. In such environments, flow termination would not be
enabled.
4.8. Security Considerations
Please see the security considerations in Section 5 as well as those
in [RFC2474] and [RFC2475].
5. Security Considerations
[RFC5559] provides a general description of the security
considerations for PCN. This memo introduces no new considerations.
Charny, et al. Expires January 8, 2010 [Page 9]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
6. IANA Considerations
This memo includes no request to IANA.
7. Acknowledgements
Excluding the appendices, the content of this memo is drawn from
[ID.briscoe-CL]. The authors of that document were Bob Briscoe,
Philip Eardley, and Dave Songhurst of BT, Anna Charny and Francois Le
Faucheur of Cisco, Jozef Babiarz, Kwok Ho Chan, and Stephen Dudley of
Nortel, Giorgios Karagiannis of U. Twente and Ericsson, and Attila
Bader and Lars Westberg of Ericsson.
8. References
8.1. Normative References
[ID.PCN-baseline]
Moncaster, T., Briscoe, B., and M. Menth, "Baseline
Encoding and Transport of Pre-Congestion Information (Work
in progress)", May 2009.
[ID.PCN-marking]
Eardley, P., "Metering and marking behaviour of PCN-nodes
(Work in progress)", June 2009.
[RFC2474] Nichols, K., Blake, S., Baker, F., and D. Black,
"Definition of the Differentiated Services Field (DS
Field) in the IPv4 and IPv6 Headers", RFC 2474,
December 1998.
[RFC2475] Blake, S., Black, D., Carlson, M., Davies, E., Wang, Z.,
and W. Weiss, "An Architecture for Differentiated
Services", RFC 2475, December 1998.
[RFC5559] Eardley, P., "Pre-Congestion Notification (PCN)
Architecture", RFC 5559, June 2009.
8.2. Informative References
[ID.briscoe-CL]
Briscoe, B., "An edge-to-edge Deployment Model for Pre-
Congestion Notification: Admission Control over a
DiffServ Region (expired Internet Draft)", 2006.
[RFC3086] Nichols, K. and B. Carpenter, "Definition of
Charny, et al. Expires January 8, 2010 [Page 10]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
Differentiated Services Per Domain Behaviors and Rules for
their Specification", RFC 3086, April 2001.
Authors' Addresses
Anna Charny
Cisco Systems
300 Apollo Drive
Chelmsford, MA 01824
USA
Email: acharny@cisco.com
Xinyan (Joy) Zhang
Cisco Systems
300 Apollo Drive
Chelmsford, MA 01824
USA
Georgios Karagiannis
U. Twente
Phone:
Email: karagian@cs.utwente.nl
Michael Menth
University of Wuerzburg
Am Hubland
Wuerzburg D-97074
Germany
Phone: +49-931-888-6644
Email: menth@informatik.uni-wuerzburg.de
Charny, et al. Expires January 8, 2010 [Page 11]
Internet-Draft PCN SM Boundary Node Behaviour July 2009
Tom Taylor (editor)
Huawei Technologies
1852 Lorraine Ave
Ottawa, Ontario K1H 6Z8
Canada
Phone: +1 613 680 2675
Email: tom.taylor@rogers.com
Charny, et al. Expires January 8, 2010 [Page 12]