Inter-Domain Routing H. Gredler
Internet-Draft J. Medved
Intended status: Standards Track A. Farrel
Expires: March 24, 2012 Juniper Networks, Inc.
S. Previdi
Cisco Systems, Inc.
September 21, 2011
North-Bound Distribution of Link-State and TE Information using BGP
draft-gredler-idr-ls-distribution-00
Abstract
In a number of environments, a component external to a network is
called upon to perform computations based on the network topology and
current state of the connections within the network, including
traffic engineering information. This is information typically
distributed by IGP routing protocols within the network
This document describes a mechanism by which links state and traffic
engineering information can be collected from networks and shared
with external components using the BGP routing protocol. This is
achieved using a new BGP Network Layer Reachability Information
(NLRI) encoding format. The mechanism is applicable to physical and
virtual links. The mechanism described is subject to policy control.
Applications of this technique include Application Layer Traffic
Optimization (ALTO) servers, and Path Computation Elements (PCEs).
Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119]
Status of this Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
Gredler, et al. Expires March 24, 2012 [Page 1]
Internet-Draft Link-State Info Distribution using BGP September 2011
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on March 24, 2012.
Copyright Notice
Copyright (c) 2011 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Gredler, et al. Expires March 24, 2012 [Page 2]
Internet-Draft Link-State Info Distribution using BGP September 2011
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 5
2. Motivation and Applicability . . . . . . . . . . . . . . . . . 6
2.1. MPLS-TE with PCE . . . . . . . . . . . . . . . . . . . . . 6
2.2. ALTO Server Network API . . . . . . . . . . . . . . . . . 8
3. Transcoding Link State Information into a BGP NLRI . . . . . . 9
3.1. NLRI Format . . . . . . . . . . . . . . . . . . . . . . . 9
3.2. TLV Format . . . . . . . . . . . . . . . . . . . . . . . . 11
3.3. Node Descriptors . . . . . . . . . . . . . . . . . . . . . 12
3.3.1. Local Node Descriptors . . . . . . . . . . . . . . . . 12
3.3.2. Remote Node Descriptors . . . . . . . . . . . . . . . 13
3.3.3. Node Descriptor Sub-TLVs . . . . . . . . . . . . . . . 13
3.3.4. Router-ID Anchoring Example: ISO Pseudonode . . . . . 14
3.3.5. Router-ID Anchoring Example: OSPFv2 to IS-IS
Migration . . . . . . . . . . . . . . . . . . . . . . 14
3.4. Link Descriptors . . . . . . . . . . . . . . . . . . . . . 14
3.5. Multi Topology ID TLV . . . . . . . . . . . . . . . . . . 15
3.6. Link Attributes . . . . . . . . . . . . . . . . . . . . . 15
3.6.1. MPLS Protocol TLV . . . . . . . . . . . . . . . . . . 16
3.6.2. TE Default Metric TLV . . . . . . . . . . . . . . . . 17
3.6.3. IGP Link Metric TLV . . . . . . . . . . . . . . . . . 17
3.6.4. Shared Risk Link Group TLV . . . . . . . . . . . . . . 18
3.6.5. OSPF specific link attribute TLV . . . . . . . . . . . 18
3.6.6. IS-IS specific link attribute TLV . . . . . . . . . . 19
3.6.7. Link Area TLV . . . . . . . . . . . . . . . . . . . . 19
3.7. Node Attributes . . . . . . . . . . . . . . . . . . . . . 20
3.7.1. Multi Topology Node TLV . . . . . . . . . . . . . . . 20
3.7.2. Node Flag Bits TLV . . . . . . . . . . . . . . . . . . 21
3.7.3. OSPF Specific Node Properties TLV . . . . . . . . . . 21
3.7.4. IS-IS Specific Node Properties TLV . . . . . . . . . . 22
3.7.5. Area Node TLV . . . . . . . . . . . . . . . . . . . . 22
3.8. Inter-AS Links . . . . . . . . . . . . . . . . . . . . . . 23
4. Link to Path Aggregation . . . . . . . . . . . . . . . . . . . 23
4.1. Example: No Link Aggregation . . . . . . . . . . . . . . . 23
4.2. Example: ASBR to ASBR Path Aggregation . . . . . . . . . . 24
4.3. Example: Multi-AS Path Aggregation . . . . . . . . . . . . 24
5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 25
6. Manageability Considerations . . . . . . . . . . . . . . . . . 25
6.1. Operational Considerations . . . . . . . . . . . . . . . . 25
6.1.1. Operations . . . . . . . . . . . . . . . . . . . . . . 25
6.1.2. Installation and Initial Setup . . . . . . . . . . . . 25
6.1.3. Migration Path . . . . . . . . . . . . . . . . . . . . 26
6.1.4. Requirements on Other Protocols and Functional
Components . . . . . . . . . . . . . . . . . . . . . . 26
6.1.5. Impact on Network Operation . . . . . . . . . . . . . 26
6.1.6. Verifying Correct Operation . . . . . . . . . . . . . 26
6.2. Management Considerations . . . . . . . . . . . . . . . . 26
Gredler, et al. Expires March 24, 2012 [Page 3]
Internet-Draft Link-State Info Distribution using BGP September 2011
6.2.1. Management Information . . . . . . . . . . . . . . . . 26
6.2.2. Fault Management . . . . . . . . . . . . . . . . . . . 26
6.2.3. Configuration Management . . . . . . . . . . . . . . . 26
6.2.4. Accounting Management . . . . . . . . . . . . . . . . 27
6.2.5. Performance Management . . . . . . . . . . . . . . . . 27
6.2.6. Security Management . . . . . . . . . . . . . . . . . 27
7. Security Considerations . . . . . . . . . . . . . . . . . . . 27
8. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 28
9. References . . . . . . . . . . . . . . . . . . . . . . . . . . 28
9.1. Normative References . . . . . . . . . . . . . . . . . . . 28
9.2. Informative References . . . . . . . . . . . . . . . . . . 29
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 30
Gredler, et al. Expires March 24, 2012 [Page 4]
Internet-Draft Link-State Info Distribution using BGP September 2011
1. Introduction
The contents of a Link State Database (LSDB) or a Traffic Engineering
Database (TED) has the scope of an IGP area. Some applications, such
as end-to-end Traffic Engineering (TE), would benefit from visibility
outside one area or Autonomous System (AS) in order to make better
decisions.
The IETF has defined the Path Computation Element (PCE) [RFC4655] as
a mechanism for achieving the computation of end-to-end TE paths that
cross the visibility of more than one TED or which require CPU-
intensive or coordinated computations. The IETF has also defined the
ALTO Server [RFC5693] as an entity that generates an abstracted
network topology and provides it to network-aware applications.
Both a PCE and an ALTO Server need to gather information about the
topologies and capabilities of the network in order to be able to
fulfill their function
This document describes a mechanism by which Link State and TE
information can be collected from networks and shared with external
components using the BGP routing protocol [RFC4271]. This is
achieved using a new BGP Network Layer Reachability Information
(NLRI) encoding format. The mechanism is applicable to physical and
virtual links. The mechanism described is subject to policy control.
A router maintains one or more databases for storing link-state
information about nodes and links in any given area. Link attributes
stored in these databases include: local/remote IP addresses, local/
remote interface identifiers, link metric and TE metric, link
bandwidth, reservable bandwidth, per CoS class reservation state,
preemption and Shared Risk Link Groups (SRLG). The router's BGP
process can retrieve topology from these LSDBs and distribute it to a
consumer, either directly or via a peer BGP Speaker (typically a
dedicated Route Reflector), using the encoding specified in this
document.
The collection of Link State and TE link state information and its
distribution to consumers is shown in the following figure.
Gredler, et al. Expires March 24, 2012 [Page 5]
Internet-Draft Link-State Info Distribution using BGP September 2011
+-----------+
| Consumer |
+-----------+
^
|
+-----------+
| BGP | +-----------+
| Speaker | | Consumer |
+-----------+ +-----------+
^ ^ ^ ^
| | | |
+---------------+ | +-------------------+ |
| | | |
+-----------+ +-----------+ +-----------+
| BGP | | BGP | | BGP |
| Speaker | | Speaker | . . . | Speaker |
+-----------+ +-----------+ +-----------+
^ ^ ^
| | |
IGP IGP IGP
Figure 1: TE Link State info collection
A BGP Speaker may apply configurable policy to the information that
it distributes. Thus, it may distribute the real physical topology
from the LSDB or the TED. Alternatively, it may create an abstracted
topology, where virtual, aggregated nodes are connected by virtual
paths. Aggregated nodes can be created, for example, out of multiple
routers in a POP. Abstracted topology can also be a mix of physical
and virtual nodes and physical and virtual links. Furthermore, the
BGP Speaker can apply policy to determine when information is updated
to the consumer so that there is reduction of information flow form
the network to the consumers. Mechanisms through which topologies
can be aggregated or virtualized are outside the scope of this
document
2. Motivation and Applicability
This section describes uses cases from which the requirements can be
derived.
2.1. MPLS-TE with PCE
As described in [RFC4655] a PCE can be used to compute MPLS-TE paths
within a "domain" (such as an IGP area) or across multiple domains
(such as a multi-area AS, or multiple ASes).
Gredler, et al. Expires March 24, 2012 [Page 6]
Internet-Draft Link-State Info Distribution using BGP September 2011
o Within a single area, the PCE offers enhanced computational power
that may not be available on individual routers, sophisticated
policy control and algorithms, and coordination of computation
across the whole area.
o If a router wants to compute a MPLS-TE path across IGP areas its
own TED lacks visibility of the complete topology. That means
that the router cannot determine the end-to-end path, and cannot
even select the right exit router (Area Border Router - ABR) for
an optimal path. This is an issue for large-scale networks that
need to segment their core networks into distinct areas, but which
still want to take advantage of MPLS-TE.
Previous solutions used per-domain path computation [RFC5152]. The
source router could only compute the path for the first area because
the router only has full topological visibility for the first area
along the path, but not for subsequent areas. Per-domain path
computation uses a technique called "loose-hop-expansion" [RFC3209],
and selects the exit ABR and other ABRs or AS Border Routers (ASBRs)
using the IGP computed shortest path topology for the remainder of
the path. This may lead to sub-optimal paths, makes alternate/
back-up path computation hard, and might result in no TE path being
found when one really does exist.
The PCE presents a computation server that may have visibility into
more than one IGP area or AS, or may cooperate with other PCEs to
perform distributed path computation. The PCE obviously needs access
to the TED for the area(s) it serves, but [RFC4655] does not describe
how this is achieved. Many implementations make the PCE a passive
participant in the IGP so that it can learn the latest state of the
network, but this may be sub-optimal when the network is subject to a
high degree of churn, or when the PCE is responsible for multiple
areas.
The following figure shows how a PCE can get its TED information
using the mechanism described in this document.
Gredler, et al. Expires March 24, 2012 [Page 7]
Internet-Draft Link-State Info Distribution using BGP September 2011
+----------+ +---------+
| ----- | | BGP |
| | TED |<-+-------------------------->| Speaker |
| ----- | TED synchronization | |
| | | mechanism: +---------+
| | | BGP with Link-State NLRI
| v |
| ----- |
| | PCE | |
| ----- |
+----------+
^
| Request/
| Response
v
Service +----------+ Signaling +----------+
Request | Head-End | Protocol | Adjacent |
-------->| Node |<------------>| Node |
+----------+ +----------+
Figure 2: External PCE node using a TED synchronization mechanism
The mechanism in this document allows the necessary TED information
to be collected from the IGP within the network, filtered according
to configurable policy, and distributed to the PCE as necessary.
2.2. ALTO Server Network API
An ALTO Server [RFC5693] is an entity that generates an abstracted
network topology and provides it to network-aware applications over a
web service based API. Example applications are p2p clients or
trackers, or CDNs. The abstracted network topology comes in the form
of two maps: a Network Map that specifies allocation of prefixes to
PIDs, and a Cost Map that specifies the cost between PIDs listed in
the Network Map. For more details, see [I-D.ietf-alto-protocol].
ALTO abstract network topologies can be auto-generated from the
physical topology of the underlying network. The generation would
typically be based on policies and rules set by the operator. Both
prefix and TE data are required: prefix data is required to generate
ALTO Network Maps, TE (topology) data is required to generate ALTO
Cost Maps. Prefix data is carried and originated in BGP, TE data is
originated and carried in an IGP. The mechanism defined in this
document provides a single interface through which an ALTO Server can
retrieve all the necessary prefix and network topology data from the
underlying network. Note an ALTO Server can use other mechanisms to
get network data, for example, peering with multiple IGP and BGP
Speakers.
Gredler, et al. Expires March 24, 2012 [Page 8]
Internet-Draft Link-State Info Distribution using BGP September 2011
The following figure shows how an ALTO Server can get network
topology information from the underlying network using the mechanism
described in this document.
+--------+
| Client |<--+
+--------+ |
| ALTO +--------+ BGP with +---------+
+--------+ | Protocol | ALTO | Link-State NLRI | BGP |
| Client |<--+------------| Server |<----------------| Speaker |
+--------+ | | | | |
| +--------+ +---------+
+--------+ |
| Client |<--+
+--------+
Figure 3: ALTO Server using network topology information
3. Transcoding Link State Information into a BGP NLRI
The MP_REACH and MP_UNREACH attributes are BGP's containers for
carrying opaque information. Each Link State NLRI describes either a
single node or link.
All link and node information SHALL be encoded using a TBD AFI / SAFI
1 or SAFI 128 header into those attributes. SAFI 1 SHALL be used for
Internet routing (Public) and SAFI 128 SHALL be used for VPN routing
(Private) applications.
In order for two BGP speakers to exchange Link-State NLRI, they MUST
use BGP Capabilities Advertisement to ensure that they both are
capable of properly processing such NLRI. This is done as specified
in [RFC4760], by using capability code 1 (multi-protocol BGP), with
an AFI of TBD and an SAFI of 1 or 128.
3.1. NLRI Format
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| NLRI Type | Total NLRI Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Link-State NLRI (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Gredler, et al. Expires March 24, 2012 [Page 9]
Internet-Draft Link-State Info Distribution using BGP September 2011
Figure 4: Link State SAFI 1 NLRI Format
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| NLRI Type | Total NLRI Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
+ Route Distinguisher +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Link-State NLRI (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 5: Link State SAFI 128 NLRI Format
The 'Total NLRI Length' field contains the cumulative length of all
the TLVs in the NLRI. For VPN applications it also includes the
length of the Route Distinguisher.
The 'NLRI Type' field can contain one of the following values:
Type = 1: Link NLRI, contains link descriptors and link attributes
Type = 2: Node NLRI, contains node attributes
The Link NLRI (NLRI Type = 1) is shown in the following figure.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Protocol-ID | Reserved | Instance Identifier |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Local Node Descriptors (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Remote Node Descriptors (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Link Descriptors (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Link Attributes (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 6: The Link NLRI format
The Node NLRI (NLRI Type = 2) is shown in the following figure.
Gredler, et al. Expires March 24, 2012 [Page 10]
Internet-Draft Link-State Info Distribution using BGP September 2011
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Protocol-ID | Reserved | Instance Identifier |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Local Node Descriptors (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Node Attributes (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 7: The Node NLRI format
The 'Protocol-ID' field can contain one of the following values:
Type = 0: Unknown, The source of NLRI information could not be
determined
Type = 1: IS-IS Level 1, The NLRI information has been sourced by
IS-IS Level 1
Type = 2: IS-IS Level 2, The NLRI information has been sourced by
IS-IS Level 2
Type = 3: OSPF, The NLRI information has been sourced by OSPF
Both OSPF and IS-IS may run multiple routing protocol instances over
the same link. See [I-D.ietf-isis-mi] and
[I-D.ietf-ospf-multi-instance]. The 'Instance Identifier' field
identifies the protocol instance.
3.2. TLV Format
The Node Descriptors, Link Descriptors, Link Attribute, and Node
Attribute fields are described using a set of Type/Length/Value
triplets. The format of each TLV is shown in Figure 8.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Value (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 8: TLV format
Gredler, et al. Expires March 24, 2012 [Page 11]
Internet-Draft Link-State Info Distribution using BGP September 2011
The Length field defines the length of the value portion in octets
(thus a TLV with no value portion would have a length of zero). The
TLV is not padded to four-octet alignment; Unrecognized types are
ignored.
3.3. Node Descriptors
Each link gets anchored by at least a pair of router-IDs. Since
there are many Router-IDs formats (32 Bit IPv4 router-ID, 56 Bit ISO
Node-ID and 128 Bit IPv6 router-ID) a link may be anchored by more
than one Router-ID pair. The set of Local and Remote Node
Descriptors describe which Protocols Router-IDs will be following to
"anchor" the link described by the "Link attribute TLVs". There must
be at least one "like" router-ID pair of a Local Node Descriptors and
a Remote Node Descriptors per-protocol. If a peer sends an illegal
combination in this respect, then this is handled as an NLRI error,
described in [RFC4760].
It is desirable that the Router-ID assignments inside the Node anchor
are globally unique. However there may be router-ID spaces (e.g.
ISO) where not even a global registry exists, or worse, Router-IDs
have been allocated following private-IP RFC 1918 [RFC1918]
allocation. In order to disambiguate the Router-IDs the local and
remote Autonomous System number TLVs of the anchor nodes may be
included in the NLRI. The Local and Remote Autonomous System TLVs
are 4 octets wide as described in [RFC4893]. 2-octet AS Numbers shall
be expanded to 4-octet AS Numbers by zeroing the two MSB octets.
3.3.1. Local Node Descriptors
The Local Node Descriptors TLV (Type 256) contains Node Descriptors
for the node anchoring the local end of the link. The length of this
TLV is variable. The value contains one or more Node Descriptor Sub-
TLVs defined in Section 3.3.3.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Node Descriptor Sub-TLVs (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 9: Local Node Descriptors TLV format
Gredler, et al. Expires March 24, 2012 [Page 12]
Internet-Draft Link-State Info Distribution using BGP September 2011
3.3.2. Remote Node Descriptors
The Remote Node Descriptors TLV (Type 257) contains Node Descriptors
for the node anchoring the remote end of the link. The length of
this TLV is variable. The value contains one or more Node Descriptor
Sub-TLVs defined in Section 3.3.3.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Node Descriptor Sub-TLVs (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 10: Remote Node Descriptors TLV format
3.3.3. Node Descriptor Sub-TLVs
The Node Descriptor Sub-TLV type codepoints and lengths are listed in
the following table:
+------+-------------------+--------+
| Type | Description | Length |
+------+-------------------+--------+
| 258 | Autonomous System | 4 |
| 259 | IPv4 Router-ID | 4 |
| 260 | IPv6 Router-ID | 16 |
| 261 | ISO Node-ID | 7 |
+------+-------------------+--------+
Table 1: Node Descriptor Sub-TLVs
The TLV values in Node Descriptor Sub-TLVs are as follows:
Autonomous System: opaque value (32 Bit AS ID)
IPv4 Router ID: opaque value (can be an IPv4 address or an 32 Bit
router ID)
IPv6 Router ID: opaque value (can be an IPv6 address or 128 Bit
router ID)
Gredler, et al. Expires March 24, 2012 [Page 13]
Internet-Draft Link-State Info Distribution using BGP September 2011
ISO Node ID: ISO node-ID (6 octets ISO system-ID plus PSN octet)
3.3.4. Router-ID Anchoring Example: ISO Pseudonode
IS-IS Pseudonodes are a good example for the variable Router-ID
anchoring. Consider Figure 11. This represents a Broadcast LAN
between a pair of routers. The "real" (=non pseudonode) routers have
both an IPv4 Router-ID and IS-IS Node-ID. The pseudonode does not
have an IPv4 Router-ID. Two unidirectional links (Node1, Pseudonode
1) and (Pseudonode 1, Node 2) are being generated.
The NRLI for (Node1, Pseudonode1) encodes local IPv4 router-ID, local
ISO node-ID and remote ISO node-id)
The NLRI for (Pseudonode1, Node2) encodes a local ISO node-ID, remote
IPv4 router-ID and remote ISO node-id.
+-----------------+ +-----------------+ +-----------------+
| Node1 | | Pseudonode 1 | | Node2 |
|1921.6800.1001.00|--->|1921.6800.1001.02|--->|1921.6800.1002.00|
| 192.168.1.1 | | | | 192.168.1.2 |
+-----------------+ +-----------------+ +-----------------+
Figure 11: IS-IS Pseudonodes
3.3.5. Router-ID Anchoring Example: OSPFv2 to IS-IS Migration
Migrating gracefully from one IGP to another requires congruent
operation of both routing protocols during the migration period. The
target protocol (IS-IS) supports more router-ID spaces than the
source (OSPFv2) protocol. When advertising a point-to-point link
between an OSPFv2-only router and an OSPFv2 and IS-IS enabled router
the following link information may be generated. Note that the IS-IS
router also supports the IPv6 traffic engineering extensions RFC 6119
[RFC6119] for IS-IS.
The NRLI encodes local IPv4 router-id, remote IPv4 router-id, remote
ISO node-id and remote IPv6 node-id.
3.4. Link Descriptors
The 'Link Descriptor' field is a set of Type/Length/Value (TLV)
triplets. The format of each TLV is shown in Figure 8. The 'Link
descriptor' TLVs uniquely identify a link between a pair of anchor
Routers.
The encoding of 'Link Descriptor' TLVs, i.e. the Codepoints in
'Type', and the 'Length' and 'Value' fields are the same as defined
Gredler, et al. Expires March 24, 2012 [Page 14]
Internet-Draft Link-State Info Distribution using BGP September 2011
in [RFC5305], [RFC5307], and [RFC6119] for sub-TLVs in the Extended
IS reachability TLV. The Codepoints are in the IANA Protocol
Registry for IS-IS, sub-TLV Codepoints for TLV 22, [IANA-ISIS].
Although the encodings for 'Link Descriptor' TLVs were originally
defined for IS-IS, the TLVs can carry data sourced either by IS-IS or
OSPF.
The following link descriptor TLVs are valid in the Link NLRI:
+------+-------------------------------+------------------------+
| Type | Description | Defined in: |
+------+-------------------------------+------------------------+
| 4 | Link Local/Remote Identifiers | [RFC5307], Section 1.1 |
| 6 | IPv4 interface address | [RFC5305], Section 3.2 |
| 8 | IPv4 neighbor address | [RFC5305], Section 3.3 |
| 12 | IPv6 interface address | [RFC6119], Section 4.2 |
| 13 | IPv6 neighbor address | [RFC6119], Section 4.3 |
| 222 | Multi Topology ID | Section 3.5 |
+------+-------------------------------+------------------------+
Table 2: Link Descriptor TLVs
3.5. Multi Topology ID TLV
The Multi Topology ID TLV (Type 222) carries the Multi Topology ID
for this link. The semantics of the Multi Topology ID are defined in
RFC5120, Section 7.2 [RFC5120], and the OSPF Multi Topology ID),
defined in RFC4915, Section 3.7 [RFC4915]. If the value in the Multi
Topology ID TLV is derived from OSPF, then the upper 9 bits of the
Multi Topology ID are set to 0.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|R R R R| Multi Topology ID |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 12: Multi Topology ID TLV format
3.6. Link Attributes
The 'Link Attributes' field is a set of Type/Length/Value (TLV)
triplets. The format of each TLV is shown in Figure 8.
For Codepoints < 255, the encoding of 'Link Attributes' TLVs, i.e.
the Codepoints in 'Type', and the 'Length' and 'Value' fields are the
Gredler, et al. Expires March 24, 2012 [Page 15]
Internet-Draft Link-State Info Distribution using BGP September 2011
same as defined in [RFC5305], [RFC5307], and [RFC6119] for sub-TLVs
in the Extended IS reachability TLV. The Codepoints are in the IANA
Protocol Registry for IS-IS, sub-TLV Codepoints for TLV 22,
[IANA-ISIS]. Although the encodings for 'Link Attributes' TLVs were
originally defined for IS-IS, the TLVs can carry data sourced either
by IS-IS or OSPF.
For Codepoints > 255, the encoding of 'Link Attributes' TLVs is
described in subsequent sections.
The following link attribute TLVs are valid in the Link NLRI:
+-------+--------------------------------+------------------------+
| Type | Description | Defined in: |
+-------+--------------------------------+------------------------+
| 3 | Administrative group (color) | [RFC5305], Section 3.1 |
| 9 | Maximum link bandwidth | [RFC5305], Section 3.3 |
| 10 | Max. reservable link bandwidth | [RFC5305], Section 3.5 |
| 11 | Unreserved bandwidth | [RFC5305], Section 3.6 |
| 20 | Link Protection Type | [RFC5307], Section 1.2 |
| 64509 | MPLS Protocol | Section 3.6.1 |
| 64510 | TE Default Metric | Section 3.6.2 |
| 64511 | IGP Link Metric | Section 3.6.3 |
| 64512 | Shared Risk Link Group | Section 3.6.4 |
| 64513 | OSPF specific link attribute | Section 3.6.5 |
| 64514 | IS-IS specific link attribute | Section 3.6.6 |
| 64515 | Area ID | Section 3.6.7 |
+-------+--------------------------------+------------------------+
Table 3: Link Attribute TLVs
3.6.1. MPLS Protocol TLV
The MPLS Protocol TLV (Type 64511) carries a bit mask describing
which MPLS signaling protocols are enabled. The length of this TLV
is 1. The value is a bit array of 8 flags, where each bit represents
an MPLS Protocol capability.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|L R |
+-+-+-+-+-+-+-+-+
Figure 13: MPLS Protocol TLV
Gredler, et al. Expires March 24, 2012 [Page 16]
Internet-Draft Link-State Info Distribution using BGP September 2011
The following bits are defined:
+-----+---------------------------------------------+-----------+
| Bit | Description | Reference |
+-----+---------------------------------------------+-----------+
| 0 | Label Distribution Protocol (LDP) | [RFC5036] |
| 1 | Extension to RSVP for LSP Tunnels (RSVP-TE) | [RFC3209] |
| 2-7 | Reserved for future use | |
+-----+---------------------------------------------+-----------+
Table 4: MPLS Protocol TLV Codes
3.6.2. TE Default Metric TLV
The TE Default Metric TLV (Type 64512) carries the TE Default metric
for this link. This TLV corresponds to the IS-IS TE Default metric
sub-TLV (Type 18), defined in RFC5305, Section 3.7 [RFC5305], and the
OSPF TE Metric sub-TLV (Type 5), defined in RFC3630, Section 2.5.5
[RFC3630]. If the value in the TE Default metric TLV is derived from
IS-IS TE Default Metric, then the upper 8 bits of this TLV are set to
0.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TE Default Metric |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 14: TE Default metric TLV format
3.6.3. IGP Link Metric TLV
The IGP Metric TLV (Type 64513) carries the IGP metric for this link.
This attribute is only present if the IGP link metric is different
from the TE Default Metric (Type 18). The length of this TLV is 3.
If the length of the IGP link metric from which the IGP Metric value
is derived is less than 3 (e.g. for OSPF link metrics or non-wide
IS-IS metric), then the upper bits of the TLV are set to 0.
Gredler, et al. Expires March 24, 2012 [Page 17]
Internet-Draft Link-State Info Distribution using BGP September 2011
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| IGP Link Metric |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 15: IGP Link Metric TLV format
3.6.4. Shared Risk Link Group TLV
The Shared Risk Link Group (SRLG) TLV (Type 64514) carries the Shared
Risk Link Group information (see Section 2.3, "Shared Risk Link Group
Information", of [RFC4202]). It contains a data structure consisting
of a (variable) list of SRLG values, where each element in the list
has 4 octets, as shown in Figure 16. The length of this TLV is 4 *
(number of SRLG values).
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Shared Risk Link Group Value |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| ............ |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Shared Risk Link Group Value |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 16: Shared Risk Link Group TLV format
Note that there is no SRLG TLV in OSPF-TE. In IS-IS the SRLG
information is carried in two different TLVs: the IPv4 (SRLG) TLV
(Type 138) defined in [RFC5307], and the IPv6 SRLG TLV (Type 139)
defined in [RFC6119]. Since the Link State NLRI uses variable
Router-ID anchoring, both IPv4 and IPv6 SRLG information can be
carried in a single TLV.
3.6.5. OSPF specific link attribute TLV
The OSPF specific link attribute TLV is an envelope that
transparently carries optional link properties TLVs advertised by an
OSPF router. The value field contains one or more optional OSPF link
attribute TLVs. An originating router shall use this TLV for
encoding information specific to the OSPF protocol or new OSPF
extensions for which there is no protocol neutral representation in
Gredler, et al. Expires March 24, 2012 [Page 18]
Internet-Draft Link-State Info Distribution using BGP September 2011
the BGP link-state NLRI.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| OSPF specific link attributes (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 17: OSPF specific link attribute format
3.6.6. IS-IS specific link attribute TLV
The IS-IS specific link attribute TLV is an envelope that
transparently carries optional link properties TLVs advertised by an
IS-IS router. The value field contains one or more optional IS-IS
link attribute TLVs. An originating router shall use this TLV for
encoding information specific to the IS-IS protocol or new IS-IS
extensions for which there is no protocol neutral representation in
the BGP link-state NLRI.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| IS-IS specific link attributes (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 18: IS-IS specific link attribute format
3.6.7. Link Area TLV
The Area TLV (Type 64515) carries the Area ID which is assigned on
this link. If a link is present in more than one Area then several
occurrences of this TLV may be generated. Since only the OSPF
protocol carries the notion of link specific areas, the Area ID has a
fixed length of 4 octets.
Gredler, et al. Expires March 24, 2012 [Page 19]
Internet-Draft Link-State Info Distribution using BGP September 2011
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Area ID |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 19: Link Area TLV format
3.7. Node Attributes
The following node attribute TLVs are valid in the Node NLRI:
+-------+--------------------------------+----------+
| Type | Description | Length |
+-------+--------------------------------+----------+
| 229 | Multi Topology | 2 |
| 65515 | Node Flag Bits | 1 |
| 65516 | OSPF Specific Node Properties | variable |
| 65517 | IS-IS Specific Node Properties | variable |
| 65518 | Node Area ID | variable |
+-------+--------------------------------+----------+
Table 5: Node Attribute TLVs
3.7.1. Multi Topology Node TLV
The Multi Topology TLV (Type 229) carries the Multi Topology ID and
topology specific flags for this node. The format of the Multi
Topology TLV is defined in RFC5120, Section 7.1 [RFC5120]. If the
value in the Multi Topology TLV is derived from OSPF, then the upper
9 bits of the Multi Topology ID and the 'O' and 'A' bits are set to
0.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|O A R R| Multi Topology ID |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 20: Multi Topology Node TLV format
Gredler, et al. Expires March 24, 2012 [Page 20]
Internet-Draft Link-State Info Distribution using BGP September 2011
3.7.2. Node Flag Bits TLV
The Node Flag Bits TLV (Type 1) carries a bit mask describing node
attributes. The value is a bit array of 8 flags, where each bit
represents an MPLS Protocol capability.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Flags |
+-+-+-+-+-+-+-+-+
Figure 21: Node Flag Bits TLV format
The bits are defined as follows:
+-----+--------------+-----------+
| Bit | Description | Reference |
+-----+--------------+-----------+
| 0 | Overload Bit | [RFC1195] |
| 1 | Attached Bit | [RFC1195] |
| 2 | External Bit | [RFC2328] |
| 3 | ABR Bit | [RFC2328] |
+-----+--------------+-----------+
Table 6: Node Flag Bits Definitions
3.7.3. OSPF Specific Node Properties TLV
The OSPF Specific Node Properties TLV is an envelope that
transparently carries optional node properties TLVs advertised by an
OSPF router. The value field contains one or more optional OSPF node
property TLVs, such as the OSPF Router Informational Capabilities TLV
defined in [RFC4970], or the OSPF TE Node Capability Descriptor TLV
described in [RFC5073]. An originating router shall use this TLV for
encoding information specific to the OSPF protocol or new OSPF
extensions for which there is no protocol neutral representation in
the BGP link-state NLRI.
Gredler, et al. Expires March 24, 2012 [Page 21]
Internet-Draft Link-State Info Distribution using BGP September 2011
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| OSPF specific node properties (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 22: OSPF specific Node property format
3.7.4. IS-IS Specific Node Properties TLV
The IS-IS Router Specific Node Properties TLV is an envelope that
transparently carries optional node specific TLVs advertised by an
IS-IS router. The value field contains one or more optional IS-IS
node property TLVs, such as the IS-IS TE Node Capability Descriptor
TLV described in [RFC5073]. An originating router shall use this TLV
for encoding information specific to the IS-IS protocol or new IS-IS
extensions for which there is no protocol neutral representation in
the BGP link-state NLRI.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| IS-IS specific node properties (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 23: IS-IS specific Node property format
3.7.5. Area Node TLV
The Area TLV (Type 65518) carries the Area ID which is assigned to
this node. If a node is present in more than one Area then several
occurrences of this TLV may be generated. Since only the IS-IS
protocol carries the notion of per-node areas, the Area ID has a
variable length of 1 to 20 octets.
Gredler, et al. Expires March 24, 2012 [Page 22]
Internet-Draft Link-State Info Distribution using BGP September 2011
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Area ID (variable) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 24: Area Node TLV format
3.8. Inter-AS Links
The main source of TE information is the IGP, which is not active on
inter-AS links. In order to inject a non-IGP enabled link into the
BGP link-state RIB an implementation must support configuration of
static links.
4. Link to Path Aggregation
Distribution of all links available in the global Internet is
certainly possible, however not desirable from a scaling and privacy
point of view. Therefore an implementation may support link to path
aggregation. Rather than advertising all specific links of a domain,
an ASBR may advertise an "aggregate link" between a non-adjacent pair
of nodes. The "aggregate link" represents the aggregated set of link
properties between a pair of non-adjacent nodes. The actual methods
to compute the path properties (of bandwidth, metric) are outside the
scope of this document. The decision whether to advertise all
specific links or aggregated links is an operator's policy choice.
To highlight the varying levels of exposure, the following deployment
examples shall be discussed.
4.1. Example: No Link Aggregation
Consider Figure 25. Both AS1 and AS2 operators want to protect their
inter-AS {R1,R3}, {R2, R4} links using RSVP-FRR LSPs. If R1 wants to
compute its link-protection LSP to R3 it needs to "see" an alternate
path to R3. Therefore the AS2 operator exposes its topology. All
BGP TE enabled routers in AS1 "see" the full topology of AS and
therefore can compute a backup path. Note that the decision if the
direct link between {R3, R4} or the {R4, R5, R3) path is used is made
by the computing router.
Gredler, et al. Expires March 24, 2012 [Page 23]
Internet-Draft Link-State Info Distribution using BGP September 2011
AS1 : AS2
:
R1-------R3
| : | \
| : | R5
| : | /
R2-------R4
:
:
Figure 25: no-link-aggregation
4.2. Example: ASBR to ASBR Path Aggregation
The brief difference between the "no-link aggregation" example and
this example is that no specific link gets exposed. Consider
Figure 26. The only link which gets advertised by AS2 is an
"aggregate" link between R3 and R4. This is enough to tell AS1 that
there is a backup path. However the actual links being used are
hidden from the topology.
AS1 : AS2
:
R1-------R3
| : |
| : |
| : |
R2-------R4
:
:
Figure 26: asbr-link-aggregation
4.3. Example: Multi-AS Path Aggregation
Service providers in control of multiple ASes may even decide to not
expose their internal inter-AS links. Consider Figure 27. Rather
than exposing all specific R3 to R6 links, AS3 is modeled as a single
node which connects to the border routers of the aggregated domain.
Gredler, et al. Expires March 24, 2012 [Page 24]
Internet-Draft Link-State Info Distribution using BGP September 2011
AS1 : AS2 : AS3
: :
R1-------R3-----
| : : \
| : : vR0
| : : /
R2-------R4-----
: :
: :
Figure 27: multi-as-aggregation
5. IANA Considerations
This document requests a code point from the registry of Address
Family Numbers
This document requests creation of a new registry for node anchor,
link descriptor and link attribute TLVs. The range of Codepoints in
the registry is 0-65535. Values 0-255 will shadow Codepoints of the
IANA Protocol Registry for IS-IS, sub-TLV Codepoints for TLV 22.
Values 256-65535 will be used for Codepoints that are specific to the
BGP TE NLRI. The registry will be initialized as shown in Table 2
and Table 3. Allocations within the registry will require
documentation of the proposed use of the allocated value and approval
by the Designated Expert assigned by the IESG (see [RFC5226]).
Note to RFC Editor: this section may be removed on publication as an
RFC.
6. Manageability Considerations
This section is structured as recommended in [RFC5706].
6.1. Operational Considerations
6.1.1. Operations
Existing BGP operation procedures apply. No new operation procedures
are defined in this document.
6.1.2. Installation and Initial Setup
Configuration parameters defined in Section 6.2.3 SHOULD be
initialized to the following default values:
Gredler, et al. Expires March 24, 2012 [Page 25]
Internet-Draft Link-State Info Distribution using BGP September 2011
o The Link-State NLRI capability is turned off for all neighbors.
o The maximum rate at which Link State NLRIs will be advertised/
withdrawn from neighbors is set to ???.
6.1.3. Migration Path
The proposed extension is only activated between BP peers after
capability negotiation. Moreover, the extensions can be turned on/
off an individual peer basis (see Section 6.2.3), so the extension
can be gradually rolled out in the network.
6.1.4. Requirements on Other Protocols and Functional Components
The protocol extension defined in this document does not put new
requirements on other protocols or functional components.
6.1.5. Impact on Network Operation
Frequency of Link-State NLRI updates could interfere with regular BGP
prefix distribution. A network operator MAY use a dedicated Route-
Reflector infrastructure to distribute Link-State NLRIs.
Distribution of Link-State NLRIs SHOULD be limited to a single admin
domain, which can consist of multiple areas within an AS or multiple
ASes.
6.1.6. Verifying Correct Operation
Existing BGP procedures apply. In addition, an implementation SHOULD
allow an operator to:
o List neighbors with whom the Speaker is exchanging Link-State
NLRIs
6.2. Management Considerations
6.2.1. Management Information
6.2.2. Fault Management
TBD.
6.2.3. Configuration Management
An implementation SHOULD allow the operator to specify neighbors to
which Link-State NLRIs will be advertised and from which Link-State
NLRIs will be accepted.
Gredler, et al. Expires March 24, 2012 [Page 26]
Internet-Draft Link-State Info Distribution using BGP September 2011
An implementation SHOULD allow the operator to specify the maximum
rate at which Link State NLRIs will be advertised/withdrawn from
neighbors
An implementation SHOULD allow the operator to specify the maximum
rate at which Link State NLRIs will be accepted from neighbors
An implementation SHOULD allow the operator to specify the maximum
number of Link State NLRIs stored in router's RIB.
An implementation SHOULD allow the operator to create abstracted
topologies that are advertised to neighbors; Create different
abstractions for different neighbors.
6.2.4. Accounting Management
Not Applicable.
6.2.5. Performance Management
An implementation SHOULD provide the following statistics:
o Total number of Link-State NLRI updates sent/received
o Number of Link-State NLRI updates sent/received, per neighbor
o Number of errored received Link-State NLRI updates, per neighbor
o Total number of locally originated Link-State NLRIs
6.2.6. Security Management
An operator SHOULD define ACLs to limit inbound updates as follows:
o Drop all updates from Consumer peers
7. Security Considerations
Procedures and protocol extensions defined in this document do not
affect the BGP security model.
A BGP Speaker SHOULD NOT accept updates from a Consumer peer.
An operator SHOULD employ a mechanism to protect a BGP Speaker
against DDOS attacks from Consumers.
Gredler, et al. Expires March 24, 2012 [Page 27]
Internet-Draft Link-State Info Distribution using BGP September 2011
8. Acknowledgements
We would like to thank Nischal Sheth for contributions to this
document.
We would like to thank Alia Atlas, David Ward, John Scudder, Kaliraj
Vairavakkalai, Yakov Rekhter, Les Ginsberg, Mike Shand, and Richard
Woundy for their comments.
9. References
9.1. Normative References
[IANA-ISIS]
"IS-IS TLV Codepoint, Sub-TLVs for TLV 22", <http://
www.iana.org/assignments/isis-tlv-codepoints/
isis-tlv-codepoints.xml#isis-tlv-codepoints-3>.
[RFC1195] Callon, R., "Use of OSI IS-IS for routing in TCP/IP and
dual environments", RFC 1195, December 1990.
[RFC1918] Rekhter, Y., Moskowitz, R., Karrenberg, D., Groot, G., and
E. Lear, "Address Allocation for Private Internets",
BCP 5, RFC 1918, February 1996.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC2328] Moy, J., "OSPF Version 2", STD 54, RFC 2328, April 1998.
[RFC3209] Awduche, D., Berger, L., Gan, D., Li, T., Srinivasan, V.,
and G. Swallow, "RSVP-TE: Extensions to RSVP for LSP
Tunnels", RFC 3209, December 2001.
[RFC3630] Katz, D., Kompella, K., and D. Yeung, "Traffic Engineering
(TE) Extensions to OSPF Version 2", RFC 3630,
September 2003.
[RFC4202] Kompella, K. and Y. Rekhter, "Routing Extensions in
Support of Generalized Multi-Protocol Label Switching
(GMPLS)", RFC 4202, October 2005.
[RFC4271] Rekhter, Y., Li, T., and S. Hares, "A Border Gateway
Protocol 4 (BGP-4)", RFC 4271, January 2006.
[RFC4760] Bates, T., Chandra, R., Katz, D., and Y. Rekhter,
"Multiprotocol Extensions for BGP-4", RFC 4760,
Gredler, et al. Expires March 24, 2012 [Page 28]
Internet-Draft Link-State Info Distribution using BGP September 2011
January 2007.
[RFC4893] Vohra, Q. and E. Chen, "BGP Support for Four-octet AS
Number Space", RFC 4893, May 2007.
[RFC4915] Psenak, P., Mirtorabi, S., Roy, A., Nguyen, L., and P.
Pillay-Esnault, "Multi-Topology (MT) Routing in OSPF",
RFC 4915, June 2007.
[RFC5036] Andersson, L., Minei, I., and B. Thomas, "LDP
Specification", RFC 5036, October 2007.
[RFC5120] Przygienda, T., Shen, N., and N. Sheth, "M-ISIS: Multi
Topology (MT) Routing in Intermediate System to
Intermediate Systems (IS-ISs)", RFC 5120, February 2008.
[RFC5226] Narten, T. and H. Alvestrand, "Guidelines for Writing an
IANA Considerations Section in RFCs", BCP 26, RFC 5226,
May 2008.
[RFC5305] Li, T. and H. Smit, "IS-IS Extensions for Traffic
Engineering", RFC 5305, October 2008.
[RFC5307] Kompella, K. and Y. Rekhter, "IS-IS Extensions in Support
of Generalized Multi-Protocol Label Switching (GMPLS)",
RFC 5307, October 2008.
[RFC6119] Harrison, J., Berger, J., and M. Bartlett, "IPv6 Traffic
Engineering in IS-IS", RFC 6119, February 2011.
9.2. Informative References
[I-D.ietf-alto-protocol]
Alimi, R., Penno, R., and Y. Yang, "ALTO Protocol",
draft-ietf-alto-protocol-08 (work in progress), May 2011.
[I-D.ietf-isis-mi]
Previdi, S., Ginsberg, L., Shand, M., Roy, A., and D.
Ward, "IS-IS Multi-Instance", draft-ietf-isis-mi-04 (work
in progress), March 2011.
[I-D.ietf-ospf-multi-instance]
Lindem, A., Roy, A., and S. Mirtorabi, "OSPF Multi-
Instance Extensions", draft-ietf-ospf-multi-instance-04
(work in progress), April 2011.
[RFC4655] Farrel, A., Vasseur, J., and J. Ash, "A Path Computation
Element (PCE)-Based Architecture", RFC 4655, August 2006.
Gredler, et al. Expires March 24, 2012 [Page 29]
Internet-Draft Link-State Info Distribution using BGP September 2011
[RFC4970] Lindem, A., Shen, N., Vasseur, JP., Aggarwal, R., and S.
Shaffer, "Extensions to OSPF for Advertising Optional
Router Capabilities", RFC 4970, July 2007.
[RFC5073] Vasseur, J. and J. Le Roux, "IGP Routing Protocol
Extensions for Discovery of Traffic Engineering Node
Capabilities", RFC 5073, December 2007.
[RFC5152] Vasseur, JP., Ayyangar, A., and R. Zhang, "A Per-Domain
Path Computation Method for Establishing Inter-Domain
Traffic Engineering (TE) Label Switched Paths (LSPs)",
RFC 5152, February 2008.
[RFC5693] Seedorf, J. and E. Burger, "Application-Layer Traffic
Optimization (ALTO) Problem Statement", RFC 5693,
October 2009.
[RFC5706] Harrington, D., "Guidelines for Considering Operations and
Management of New Protocols and Protocol Extensions",
RFC 5706, November 2009.
Authors' Addresses
Hannes Gredler
Juniper Networks, Inc.
1194 N. Mathilda Ave.
Sunnyvale, CA 94089
US
Email: hannes@juniper.net
Jan Medved
Juniper Networks, Inc.
1194 N. Mathilda Ave.
Sunnyvale, CA 94089
US
Email: jmedved@juniper.net
Gredler, et al. Expires March 24, 2012 [Page 30]
Internet-Draft Link-State Info Distribution using BGP September 2011
Adrian Farrel
Juniper Networks, Inc.
1194 N. Mathilda Ave.
Sunnyvale, CA 94089
US
Email: afarrel@juniper.net
Stefano Previdi
Cisco Systems, Inc.
Via Del Serafico, 200
Roma 00142
Italy
Email: sprevidi@cisco.com
Gredler, et al. Expires March 24, 2012 [Page 31]