Skip to main content

BGP Link Bandwidth Extended Community
draft-ietf-idr-link-bandwidth-19

The information below is for an old version of the document.
Document Type
This is an older version of an Internet-Draft whose latest revision state is "Active".
Authors Pradosh Mohapatra , Reshma Das , SATYA R MOHANTY , Serge Krier , Rafal Jan Szarecki , Akshay Gattani
Last updated 2025-10-28 (Latest revision 2025-10-06)
Replaces draft-rfernando-idr-link-bandwidth
RFC stream Internet Engineering Task Force (IETF)
Formats
Reviews
Additional resources Mailing list discussion
Stream WG state Submitted to IESG for Publication
Document shepherd Jeffrey Haas
Shepherd write-up Show Last changed 2025-08-06
IESG IESG state IESG Evaluation::Revised I-D Needed
Consensus boilerplate Yes
Telechat date (None)
Has enough positions to pass.
Responsible AD Ketan Talaulikar
Send notices to jhaas@pfrc.org
IANA IANA review state IANA OK - Actions Needed
draft-ietf-idr-link-bandwidth-19
Network Working Group                                       P. Mohapatra
Internet-Draft                                                Google LLC
Intended status: Standards Track                             R. Das, Ed.
Expires: 9 April 2026                             Juniper Networks, Inc.
                                                         S. Mohanty, Ed.
                                                                 Zscaler
                                                                S. Krier
                                                           Cisco Systems
                                                           R.J. Szarecki
                                                              Google LLC
                                                              A. Gattani
                                                         Arista Networks
                                                          6 October 2025

                 BGP Link Bandwidth Extended Community
                    draft-ietf-idr-link-bandwidth-19

Abstract

   This document specifies a type of BGP Extended Community that enables
   routers to perform weighted load-balancing in multipath scenarios.

Requirements Language

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in [RFC2119].

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on 9 April 2026.

Mohapatra, et al.         Expires 9 April 2026                  [Page 1]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

Copyright Notice

   Copyright (c) 2025 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents (https://trustee.ietf.org/
   license-info) in effect on the date of publication of this document.
   Please review these documents carefully, as they describe your rights
   and restrictions with respect to this document.  Code Components
   extracted from this document must include Revised BSD License text as
   described in Section 4.e of the Trust Legal Provisions and are
   provided without warranty as described in the Revised BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Link Bandwidth Extended Community . . . . . . . . . . . . . .   3
   3.  Protocol Procedures . . . . . . . . . . . . . . . . . . . . .   4
     3.1.  Sender (Originating Link Bandwidth Extended Community)  .   4
     3.2.  Receiver (Receiving Link Bandwidth Extended Community)  .   4
     3.3.  Re-advertisement Procedures . . . . . . . . . . . . . . .   5
       3.3.1.  Re-advertisement with Next hop Change . . . . . . . .   5
       3.3.2.  Re-advertisement with Next Hop Unchanged  . . . . . .   5
     3.4.  Link Bandwidth Extended Community Arithmetic and BGP
           Multipath . . . . . . . . . . . . . . . . . . . . . . . .   6
   4.  Error Handling  . . . . . . . . . . . . . . . . . . . . . . .   6
   5.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .   6
   6.  Security Considerations . . . . . . . . . . . . . . . . . . .   7
   7.  Operational Considerations  . . . . . . . . . . . . . . . . .   7
     7.1.  Inconsistent Deployment . . . . . . . . . . . . . . . . .   7
   8.  Contributors  . . . . . . . . . . . . . . . . . . . . . . . .   8
   9.  Acknowledgments . . . . . . . . . . . . . . . . . . . . . . .   8
   10. References  . . . . . . . . . . . . . . . . . . . . . . . . .   9
     10.1.  Normative References . . . . . . . . . . . . . . . . . .   9
     10.2.  Informative References . . . . . . . . . . . . . . . . .   9
   Appendix A.  Document History . . . . . . . . . . . . . . . . . .   9
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  10

1.  Introduction

   Load balancing is a critical aspect of network design, enabling
   efficient utilization of available bandwidth and improving overall
   network performance.  Traditional equal-cost multi-path (ECMP)
   routing does not account for the varying capacities of different
   paths.  This document suggests that the bandwidth be carried in the
   network using one of two new extended communities [RFC4360] - the
   transitive and non-transitive Link Bandwidth Extended Community.  The

Mohapatra, et al.         Expires 9 April 2026                  [Page 2]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   Link Bandwidth Extended Community provides a mechanism for routers to
   advertise the bandwidth of their downstream path that may either be a
   directly connected link or multi-hop/multipath nexthop.  This
   mechanism facilitates maximizing utilization of network resources.

2.  Link Bandwidth Extended Community

   The Link Bandwidth Extended Community is defined as a BGP extended
   community that carries the bandwidth information of a router,
   represented by BGP Next Hop, connecting to a remote network.  This
   community can be used to inform other routers about the available
   bandwidth through a given route.

   The Link Bandwidth Extended Community can be either transitive or
   non-transitive.  Therefore the value of the high-order octet of the
   extended Type Field can be 0x00 or 0x40, respectively.  The value of
   the low-order octet of the extended type field for this communities
   is 0x04.  The value of the Global Administrator subfield in the Value
   Field SHOULD represent the Autonomous System of the router that
   attaches the Link Bandwidth Extended Community, but it can be set to
   any 2-byte value.  If the Autonomous System number cannot be
   represented in two octets, AS_TRANS [RFC6793], SHOULD be used in the
   Global Administrator subfield.  The encoding of 4-octet ASN is out of
   scope of this document.  The bandwidth value is expressed as 4 octets
   in [IEEE.754-2019] floating point format, units being bytes (not
   bits!) per second.  It is carried in the Local Administrator subfield
   of the Value Field.

    0                   1                   2                   3
    0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Type=0x00/0x40 | SubType= 0x04 |       AS Number               |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                     Bandwidth Value                           |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

    Type:   1-octet field MUST be set to 0x00 or 0x40
            to indicate transitive/non-transitive.

    SubType: 1-octet field MUST be set to 0x04
             to indicate 'Link-Bandwidth'.

    Global Administrator sub-field:
             2-octet represent the Autonomous System.

    Local Administrator sub-field:
             Bandwidth value (bytes per sec) encoded as 4 octets
             in IEEE floating point format.

Mohapatra, et al.         Expires 9 April 2026                  [Page 3]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

                Figure 1: Link Bandwidth Extended Community

3.  Protocol Procedures

   The procedures cover both the transitive and non-transitive variants
   of the Link Bandwidth Extended Community so that implementations can
   handle both variants in a way that supports existing deployments.
   Please refer to Section 5 and Appendix A for more details.

3.1.  Sender (Originating Link Bandwidth Extended Community)

   A BGP speaker that attaches a Link Bandwidth Extended Community
   SHOULD be able to advertise either a transitive or a non-transitive
   Link Bandwidth Extended Community.  Implementations SHOULD provide
   configuration to set the transitivity type of the Link Bandwidth
   Extended Community, as well as the Global Administrator and bandwidth
   values in the Local Administrator field, using local policy.
   Different implementations MAY use different default values for the
   transitivity type of the Link Bandwidth Extended Community.  The
   provided configuration SHOULD allow operators to override the default
   transitivity value as needed.  An implementation MAY advertise
   bandwidth value as zero.

   Generally, a single Link Bandwidth Extended Community of the
   transitivity type that is desired in a deployment is attached to a
   route.  However during transition (refer Section 7 for details), a
   BGP speaker MAY attach one Link Bandwidth Extended Community per
   transitivity (transitive/non-transitive) both having the same
   'Bandwidth Value' field.

   A Link Bandwidth Extended Community MAY be attached or updated for a
   BGP route upon receipt during Adj-RIB-In processing.  The Link
   Bandwidth Extended Community MAY be attached or updated for a BGP
   route's Adj-RIB-Out entry while being advertised to a neighboring BGP
   speaker.

   Implementations MAY provide a configuration option to send non-
   transitive Link Bandwidth Extended Communities on external BGP
   sessions.

3.2.  Receiver (Receiving Link Bandwidth Extended Community)

   A BGP receiver MUST be able to process Link Bandwidth Extended
   Community of both transitive and non-transitive types.  The receiver
   MUST NOT flap or treat the route as malformed based on the
   transitivity of the Link Bandwidth Extended Community and/or BGP
   session type (internal vs. external).

Mohapatra, et al.         Expires 9 April 2026                  [Page 4]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   Implementations MAY provide configuration to accept non-transitive
   Link Bandwidth Extended Communities from external BGP sessions.

   A BGP update with an attached Link Bandwidth Extended Community with
   a bandwidth value of zero is valid.  When all contributing paths have
   a non-zero value in the Link Bandwidth Extended Community, the
   bandwidth values of those paths (or their ratio) can be utilized as
   weights to enable weighted load-balancing.  Details of weighted load-
   balancing are outside the scope of this document.  However, in the
   case where the paths have a mix of zero and non-zero values, or all
   zero values, the behavior is determined by local policy.  For
   example, implementations MAY exclude the paths with zero value from
   weighted load balancing formation as long as at least one path with
   non-zero value exists or they MAY fallback to ECMP.

3.3.  Re-advertisement Procedures

   This section describes the procedures to be followed when a BGP
   speaker receives a route with an attached Link Bandwidth Extended
   Community and subsequently re-advertises that route.

3.3.1.  Re-advertisement with Next hop Change

   When a BGP speaker re-advertises a route received with Link Bandwidth
   Extended Community and sets the next hop to itself or to another
   address, it MAY do any one of the following as its default behavior
   -remove the Link Bandwidth Extended Community, re-advertise it
   unchanged, or regenerate it with an appropriate value.
   Implementations SHOULD provide a local configuration method to alter
   their default behavior to the other options with per-session
   granularity.

   When regenerating Link Bandwidth Extended Community, the same
   procedures as outlined in Section 3.1 apply.  Please also refer to
   Section 3.4 for use in a BGP multipath environment.

3.3.2.  Re-advertisement with Next Hop Unchanged

   A BGP speaker that receives a route with a Link Bandwidth Extended
   Community and re-advertises or reflects the same without changing its
   next hop, SHOULD NOT change the Link Bandwidth Extended Community in
   any way.

Mohapatra, et al.         Expires 9 April 2026                  [Page 5]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

3.4.  Link Bandwidth Extended Community Arithmetic and BGP Multipath

   In a BGP multipath environment, the bandwidth value that is sent or
   re-advertised MAY be calculated based on the Link Bandwidth Extended
   Community associated with each constituent path contributing to
   multipath in the Local Routing Information Base (Local-RIB).  This
   topic is beyond the scope of this document.  Refer to
   [draft-ietf-bess-ebgp-dmz] which describes how this could be done in
   specific scenarios.

4.  Error Handling

   If a BGP speaker receives a route with more than one Link Bandwidth
   Extended Communities and uses the route to compute weighted load
   balancing, it SHOULD use the extended community with the lowest
   "Bandwidth Value", ignoring the transitivity.  Implementations MAY
   provide configuration to change the above preference.

   Between transitive and non-transitive types of Link Bandwidth
   Extended Communities that have the same 'Bandwidth Value', the
   transitivity doesn't matter for purpose of computing weighted load
   balancing or programming to FIB (Forwarding Information Base).

   Note that these procedures mean that a BGP speaker reflecting a route
   with next hop unchanged (e.g. RR) will re-advertise the Link
   Bandwidth Extended Communities received on the route as-is without
   any modification, while following the extended community transitivity
   rules.

   Link Bandwidth Extended Communities with a negative value SHALL be
   ignored and MUST NOT be advertised.

   Link Bandwidth Extended Communities with a zero value MUST NOT be
   considered malformed.

   If any of the paths lack a valid Link Bandwidth Extended Community,
   ECMP (Equal-Cost Multi-Path) MUST be used instead.

5.  IANA Considerations

   IANA is requested to update the Transitive Two-Octet AS-Specific
   Extended Community Sub-Types registry (Type 0x00) and Sub-Type 0x04
   to:

       Name
       ----
       transitive Link Bandwidth Extended Community

Mohapatra, et al.         Expires 9 April 2026                  [Page 6]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   IANA is requested to update the Non-Transitive Two-Octet AS-Specific
   Extended Community Sub-Types registry (Type 0x40) and Sub-Type 0x04
   to:

       Name
       ----
       non-transitive Link Bandwidth Extended Community

   Both updates are to reference this document.

6.  Security Considerations

   This extension to BGP has similar security implications as BGP
   Extended Communities [RFC4360]

   The Link Bandwidth Extended Community conveys bandwidth and capacity
   information that may be sensitive.  Exporting this community outside
   of an administrative domain can expose private network resource
   details.  When propagating the routes with Link Bandwidth Extended
   Community towards an untrusted network or outside of an
   administrative domain, it is recommended operators use policy to
   filter out this community.

7.  Operational Considerations

7.1.  Inconsistent Deployment

   Prior deployments of the feature specified in this document have
   involved implementations that only understood one of the two extended
   community transitivity types.  As a result, such implementations
   would treat the use of the other transitivity type in a "ships in the
   night" fashion.  The procedures in this document govern how multiple
   transitivity types for bandwidth should operate.

   In circumstances where networks have deployed a mixture of
   implementations supporting this document's procedures for both
   transitivity types, and older implementations that only understand
   one transitivity type, inconsistent behavior could result.  A prime
   example is when a route received by a BGP speaker contains both a
   transitive and a non-transitive Link Bandwidth Extended Community and
   that BGP speaker performs an operation that updates only one of the
   Link Bandwidth Extended Communities, the other community may have an
   inconsistent value.  As a result, downstream BGP speakers that may
   receive such routes may perform inappropriate weighted load
   balancing.

Mohapatra, et al.         Expires 9 April 2026                  [Page 7]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   To mitigate such issues, when operators are aware that older
   implementations are present in their networks, they may wish to take
   actions to address such inconsistencies.  One option would be to
   filter either at advertisement time on the older BGP speaker the
   unsupported transitivity type of Link Bandwidth Extended Community -
   if the implementation is capable of such filtering.  Alternatively, a
   receiving BGP speaker, knowing that the sending speaker is incapable
   of doing such operations, could strip the Link Bandwidth Extended
   Community type that is unsupported by the sender.

   Ideally this operational consideration is short-lived until all the
   routers in the network have been upgraded to implementations that
   consistently support the procedures in this document.

8.  Contributors

   Kaliraj Vairavakkalai
   Juniper Networks, Inc.
   1133 Innovation Way,
   Sunnyvale, CA 94089
   United States of America
   Email: kaliraj@juniper.net

   Natrajan Venkataraman
   Juniper Networks, Inc.
   1133 Innovation Way,
   Sunnyvale, CA 94089
   United States of America
   Email: natv@juniper.net

   Rex Fernando
   Cisco Systems
   170 W. Tasman Drive
   San Jose, CA 95134
   United States of America
   Email: rex@cisco.com

9.  Acknowledgments

   The authors would like to thank Yakov Rekhter, Srihari Sangli and Dan
   Tappan for proposing unequal cost load balancing as one possible
   application of the extended community attribute.  The authors would
   like to thank Jeff Haas for all the discussions and providing text
   for operational considerations.

Mohapatra, et al.         Expires 9 April 2026                  [Page 8]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   The authors would like to thank Bruno Decraene, Robert Raszuk, Joel
   Halpern, Aleksi Suhonen, Randy Bush, Stephane Litkowski, Mankamana
   Mishra, Moshiko Nayman, Keon Vafai, Ketan Talaulikar, Yingzhen Qu,
   Anoop Ghanwani, Dongjie (Jimmy) and John Scudder for their comments
   and contributions.

10.  References

10.1.  Normative References

   [IEEE.754-2019]
              IEEE, "IEEE Standard for Floating-Point Arithmetic", 22
              July 2019, <https://ieeexplore.ieee.org/document/8766229>.

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119,
              DOI 10.17487/RFC2119, March 1997,
              <https://www.rfc-editor.org/info/rfc2119>.

   [RFC4360]  Sangli, S., Tappan, D., and Y. Rekhter, "BGP Extended
              Communities Attribute", RFC 4360, DOI 10.17487/RFC4360,
              February 2006, <https://www.rfc-editor.org/info/rfc4360>.

   [RFC6793]  Vohra, Q. and E. Chen, "BGP Support for Four-Octet
              Autonomous System (AS) Number Space", RFC 6793,
              DOI 10.17487/RFC6793, December 2012,
              <https://www.rfc-editor.org/info/rfc6793>.

10.2.  Informative References

   [draft-ietf-bess-ebgp-dmz]
              Mohanty, S., "Cumulative DMZ Link Bandwidth and load-
              balancing", 20 July 2025,
              <https://tools.ietf.org/html/draft-ietf-bess-ebgp-dmz>.

Appendix A.  Document History

   BGP Link Bandwidth Extended Community has evolved over several
   versions of the IETF draft.  In the earlier versions up to draft-
   ietf-idr-link-bandwidth-08, only the non-transitive version of Link
   Bandwidth Extended Community was supported.  However, starting from
   draft-ietf-idr-link-bandwidth-09, both transitive and non-transitive
   versions of Link Bandwidth Extended Community are supported.

Mohapatra, et al.         Expires 9 April 2026                  [Page 9]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   A BGP speaker (Sender or Receiver) needs to be upgraded to support
   the procedures defined in this document to provide full
   interoperability for both transitive and non-transitive versions of
   Link Bandwidth Extended Community.  In order to simplify
   implementations, it is not a goal to provide interoperability by
   upgrading only the RR.

Authors' Addresses

   Pradosh Mohapatra
   Google LLC
   Email: pradosh@google.com

   Reshma Das (editor)
   Juniper Networks, Inc.
   1133 Innovation Way,
   Sunnyvale, CA 94089
   United States of America
   Email: dreshma@juniper.net

   Satya Mohanty (editor)
   Zscaler
   120 Holger Way,
   San Jose, CA 95134
   United States of America
   Email: smohanty@zscaler.com

   Serge Krier
   Cisco Systems
   Pegasus Parc, De Kleetlaan 6a
   Belgium
   Email: sekrier@cisco.com

   Rafal Jan Szarecki
   Google LLC
   1160 N Mathilda Ave,
   Sunnyvale, CA 94089
   United States of America
   Email: rszarecki@gmail.com

Mohapatra, et al.         Expires 9 April 2026                 [Page 10]
Internet-Draft    BGP Link Bandwidth Extended Community     October 2025

   Akshay Gattani
   Arista Networks
   5453 Great America Parkway
   Santa Clara, CA 95054
   United States of America
   Email: akshay@arista.com

Mohapatra, et al.         Expires 9 April 2026                 [Page 11]