Skip to main content

TRILL Distributed Layer 3 Gateway Framework
draft-hao-trill-irb-03

The information below is for an old version of the document.
Document Type
This is an older version of an Internet-Draft whose latest revision state is "Replaced".
Authors Hao Weiguo , Yizhou Li , Donald E. Eastlake 3rd
Last updated 2014-02-13
Replaced by draft-ietf-trill-irb, RFC 7956
RFC stream (None)
Formats
Stream Stream state (No stream defined)
Consensus boilerplate Unknown
RFC Editor Note (None)
IESG IESG state I-D Exists
Telechat date (None)
Responsible AD (None)
Send notices to (None)
draft-hao-trill-irb-03
TRILL                                                        Weiguo Hao
                                                              Yizhou Li
                                                        Donald Eastlake
Internet Draft                                                   Huawei
Intended status: Informational                        February 14, 2014
Expires: August 2014

                TRILL Distributed Layer 3 Gateway Framework
                        draft-hao-trill-irb-03.txt

Status of this Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79. This document may not be modified,
   and derivative works of it may not be created, and it may not be
   published except as an Internet-Draft.

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79. This document may not be modified,
   and derivative works of it may not be created, except to publish it
   as an RFC and to translate it into languages other than English.

   This document may contain material from IETF Documents or IETF
   Contributions published or made publicly available before November
   10, 2008. The person(s) controlling the copyright in some of this
   material may not have granted the IETF Trust the right to allow
   modifications of such material outside the IETF Standards Process.
   Without obtaining an adequate license from the person(s) controlling
   the copyright in such materials, this document may not be modified
   outside the IETF Standards Process, and derivative works of it may
   not be created outside the IETF Standards Process, except to format
   it for publication as an RFC or to translate it into languages other
   than English.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six
   months and may be updated, replaced, or obsoleted by other documents
   at any time.  It is inappropriate to use Internet-Drafts as
   reference material or to cite them other than as "work in progress."

Hao & Li               Expires August 14, 2014                [Page 1]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html

   This Internet-Draft will expire on August 14, 2014.

Copyright Notice

   Copyright (c) 2014 IETF Trust and the persons identified as the
   document authors. All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document. Please review these documents
   carefully, as they describe your rights and restrictions with
   respect to this document. Code Components extracted from this
   document must include Simplified BSD License text as described in
   Section 4.e of the Trust Legal Provisions and are provided without
   warranty as described in the Simplified BSD License.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document. Please review these documents
   carefully, as they describe your rights and restrictions with
   respect to this document.

Abstract

   Currently TRILL solution can only provide optimum unicast forwarding
   just for Layer2 traffic of intra-subnet forwarding, not for Layer3
   traffic(inter-subnet forwarding). In this document, a TRILL
   distributed gateway solution is introduced to provide optimum
   unicast forwarding not just for Layer 2 traffic (intra-subnet
   forwarding), but also for Layer 3 traffic (inter-subnet forwarding).
   In the TRILL distributed gateway scenario, an edge RB MUST perform
   the Layer 2 routing function for the End Systems that are on the same
   subnet and the IP routing for the End Systems that are on the
   different subnets of same tenant. ESADI extension can be used for
   synchronizing <MAC, IP> correspondence among edge RBridges. To
   reduce the number of ESADI session among edge RBridges, Management
   Data Label for ESADI is suggested to be used.

Hao & Li               Expires August 14, 2014                [Page 2]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

Table of Contents

   1. Introduction ................................................ 3
   2. Conventions used in this document............................ 4
   3. Problem Statement ........................................... 4
   4. Requirements When Edge RB Acts as Default GW................. 6
   5. Protocol extension to support <MAC, IP> correspondence
   synchronization ................................................ 8
   6. Management Data Labels for ESADI............................. 8
   7. Security Considerations...................................... 9
   8. IANA Considerations ......................................... 9
   9. Normative References......................................... 9
   10. Informative References...................................... 9
   11. Acknowledgments ........................................... 10

1. Introduction

   The IETF has standardized the TRILL (Transparent Interconnection of
   Lots of Links) protocol [RFC6325] that provides a solution for least
   cost transparent routing in multi-hop networks with arbitrary
   topologies and link technologies, using [IS-IS] [RFC6165]
   [RFC6326bis] link-state routing and a hop count. TRILL switches are
   sometimes called RBridges (Routing Bridges).

   Currently, TRILL only provides optimum unicast forwarding for Layer
   2 LAN traffic (intra-subnet forwarding), not for Layer 3 traffic
   (inter-subnet forwarding).

   In this document, a TRILL distributed layer 3 gateway solution is
   introduced to provide optimum unicast forwarding not just for Layer
   2 traffic (intra-subnet forwarding), but also for Layer 3 traffic
   (inter-subnet forwarding). In the TRILL distributed gateway solution,
   the edge RBridge provides a per tenant virtual switching and routing
   instance with address isolation and Layer 3 tunnel encapsulation
   across the core. The edge RBridge supports bridging among end
   stations that belong to same subnet and routing among end stations
   that belongs to different subnets of same routing domain.

   This document is organized as follows: Section 3 describes why an
   distributed gateway solution is needed. Section 4 gives forwarding
   procedures. Section 5 describes TRILL protocol extensions to support
   TRILL distributed gateway solution.

   Familiarity with [RFC6325] and [ESADI] is assumed in this document.

Hao & Li               Expires August 14, 2014                [Page 3]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

2. Conventions used in this document

   End Station: VM or physical server, whose address is either a
   destination or the source of a data frame.

   ND: IPv6's Neighbor Discovery [RFC4861]

   VN:  Virtual Network. Each virtual network is identified by a unique
   12-bit VLAN ID or 24-bit Fine Grained Label [FGL] in TRILL network.

   VRF:  Virtual Routing and Forwarding. In IP-based computer networks,
   Virtual Routing and Forwarding (VRF) is a technology that allows
   multiple instances of a routing table to co-exist within the same
   router at the same time.

3. Problem Statement
                        --------                         ---------
                        | GW1   |                         | GW2   |
                        |       |                         |       |
                        ---------                         ---------
                          |                                 |
                          |                                 |
                        ---------                         ---------

                        | AGG1  |                         | AGG2  |
                        |       |                         |       |
                        ---------                         ---------

                          |                                 |
                 __________|_________________________________|_______________________

                  |         |           |                    |                     |
                 __|_________|___________|___________________ |____________________ |

                 | |                   | |                  | |                   | |
                 | |                   | |                  | |                   | |

               ---------            ---------             ---------              ---------
               | TOR1  |            | TOR2  |             | TOR3  |              | TOR4  |
               |       |            |       |             |       |              |       |
               ---------            ---------             ---------              ---------
                |    |               |    |                |    |                 |    |
                |    |               |    |                |    |                 |    |
               __|_  _|___           ____  ____           ____  ____             ____  ____
               |E |  |E |            |E |  |E |           |E |  |E |             |E |  |E |
               |S1|  |S2|            |S3|  |S4|           |S5|  |S6|             |S7|  |S8|
               ----  ----            ----  ----           ----  ----             ----  ----

                       Figure 1 A typical DC network

Hao & Li               Expires August 14, 2014                [Page 4]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   Figure-1 depicts a Data Center Network (DCN) using TRILL where edge
   RB functionality resides in physical Top of Rack (ToR) switches.
   Centralized gateway (GW) nodes are provided not only for north-south
   bound L3 forwarding but also for east-west bound inter-subnet L3
   forwarding. If two end stations of same tenant are on two different
   subnets and need to communicate with each other, their packets need
   to be forwarded all the way to a centralized layer 3 GW so one of
   the GW devices can perform L3 forwarding. This is generally sub-
   optimal because the two end stations may be connected to the same
   TOR where L3 switching could have been performed locally. If an edge
   RB has distributed gateway capability, then it can perform optimum
   L2 forwarding for intra-subnet traffic and optimum L3 forwarding for
   inter-subnet traffic, delivering optimum forwarding for unicast
   packets in all important cases.

                   +---------------------------------------------+
                   |                                             |
                   |      +-----------+         +-----------+    |
                   |      | Tenant n  |---------|  VRF n    |    |
                   |   +------------+ |     +------------+  |    |
                   |   |  +-----+   | |     |            |  |    |
                   |   |  | VN1 |   | |     |            |  |    |
                   |   |  +-----+   | |     |    VRF 1   |  |    |
                   |   |     ..     +-------+            |  |    |
                   |   |  +-----+   | |     |            |  |    |
                   |   |  | VNm |   | |     |            |  |    |
                   |   |  +-----+   | |     |            |  |    |
                   |   |  Tenant 1   |-+     |            |  |    |
                   |   +------------+       |            |  |    |
                   |   +------------+       +------------+       |
                   |                                             |
                   |                 Edge RB                     |
                   +---------------------------------------------+
                   Figure 2 Edge RB Model as default GW

   In a data center network (DCN), each tenant may include one or more
   IP subnets. Each IP subnet corresponds to one layer 2 virtual
   network and in normal cases each tenant corresponds to one routing
   domain (RD). One layer 2 virtual network (VN) maps to a unique IP
   subnet within a VRF context. Each layer 2 virtual network in a TRILL
   campus is identified by a unique 12-bit VLAN ID or 24-bit Fine
   Grained Label [FGL]. Different routing domains may have overlapping
   address space but need distinct and separate routes. The end systems
   that belongs to the same subnet communicate through L2 forwarding,

Hao & Li               Expires August 14, 2014                [Page 5]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   end systems of same tenant that belongs to different subnet
   communicate through L3 forwarding.

   The above figure 2 depicts the model where there are N VRFs
   corresponding to N tenants with each tenant having up to M
   segments/subnets (virtual network).

4. Requirements When Edge RB Acts as Default GW

   In the TRILL distributed gateway scenario, an edge RBridge must
   perform Layer 2 routing for the End Systems that are on the same
   subnet and the IP routing for the End Systems that are on the
   different subnets of same tenant. For Layer 3 traffic, an edge
   RBridge must act as default GW for the connected end systems that
   belong to each routing domain.

   Each GW should establish a gateway interface and VRF for each
   routing domain. Each Layer 2 VN maps to a unique IP subnet within a
   VRF context. Because the end systems in each routing domain may
   spread over multiple edge RBs, all these edge RBs should act as
   default GWs and have same gateway IP and MAC address for the
   connected end systems that belong to same routing domain. The
   default GW must satisfy following requirements:

   1, Support <MAC, IP> correspondence learning on each default GW for
   layer 2 connecting end systems. An edge RBridge can learn IP/MAC
   correspondence of locally attached end stations by inspecting the
   ARP message or other data frame. An end system uses the ARP/ND
   protocol to discover other end system MAC addresses if they are on
   the same subnet; An end system sends a packet to a known gateway if
   the destination of the packet is on different subnet from the sender
   end system and the end system uses ARP/ND protocol to find the
   gateway MAC address. When the default GW receives ARP/ND request
   packet from an access link, if destination IP in the packet equals
   the IP address of the default GW, it returns an ARP reply with self
   MAC and IP mapping information. After the end system acquires the
   MAC address of the GW, it will send unicast IP packets to
   destination end systems with destination MAC equals to the MAC of
   default GW, the default GW will perform L2 termination and find
   routing table entry with destination IP to perform L3 forwarding for
   the unicast packet.

   2, Support <MAC, IP> correspondence synchronization for each routing
   domain among default GWs.

Hao & Li               Expires August 14, 2014                [Page 6]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   For each tenant, there may be multiple L2 VNs and the end systems in
   each L2 VN may spread over multiple edge RBs. These edge RBs can
   only acquire the ARP/ND table for locally attached end systems. To
   support inter-subnet communication between locally attached end
   stations and remote end stations, an edge RBridge with attached end
   stations for several tenants should have <MAC, IP, L2 VNID > mapping
   information for all remote end stations of those tenants that are
   attached to all other edge RBs.

   After ARP/ND table synchronization is finished, all edge RBs keep
   all ARP/ND tables and install an IP forwarding table for all end
   systems in each VRF. After that, these edge RBs can support inter-
   subnet L3 forwarding for all end systems in each routing domain.

   3, Support L2 forwarding for intra-subnet traffic and L3 forwarding
   for inter-subnet traffic on each default GW.

   When ingress edge RB receives packets from a local attached end
   station, the RB performs following process:

   1. The RB will check the destination MAC, if the destination MAC
      equals to default GW's MAC, the GW will perform L3 forwarding
      process. Otherwise, the RB will perform L2 forwarding process and
      jump to step 4.

   2. The RB will find IP forwarding table by destination IP to get the
      MAC and VN ID(VLAN ID or FGL) of destination end station.

   3. The RB will modify source MAC, destination MAC and VN ID of the
      packet. Source MAC is modified to GW's MAC, destination MAC is
      modified to destination end station's MAC, VN ID is modified to
      destination end station's VN ID.

   3.4. The RB will perform L2 forwarding process by destination MAC
      in destination L2 VN ID and will get remote nickname by finding
      MAC table entry in destination L2 VN. Then it performs TRILL
      encapsulation and goes through optimal TRILL forwarding to the
      egress RB. After decapsulation at the egress RB, the packet will
      reach to destination end station.

   So when edge RBs support default GW function, optimum unicast
   forwarding will be performed not just for L2 traffic (intra-subnet
   forwarding), but also for L3 traffic (inter-subnet forwarding).In
   the TRILL IRB solution, edge RBridges are connected to each other
   via one or multiple RBridge hops, however they are always a single
   IP hop away.

Hao & Li               Expires August 14, 2014                [Page 7]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

5. Protocol extension to support <MAC, IP> correspondence
   synchronization

   Edge RBs that belong to same routing domain should synchronize their
   ARP/ND tables with each other. One routing domain may include
   multiple subnets and each subnet maps to a L2 VN ID. A possible
   method to synchronize ARP/ND tables among edge RBs  described in
   [ESADI].

   ESADI is a Data Label scoped way for RBridges to announce and learn
   end station MAC addresses. There is a separate ESADI instance for
   each Data Label (VLAN or FGL). The ESADI protocol can be extended to
   announce and learn end station ARP/ND tables amongst all edge RBs
   for each routing domain where edge RB acts as a default GW for local
   attached end stations.

   The Interface Addresses APPsub-TLV is used to indicate that a set of
   addresses on the same end-station interface and to associate that
   interface with the TRILL switch by which the interface is reachable.
   The TLV supports multiple address families and can be used to
   declare MAC and IPV4/IPV6 correspondence on each edge RBridge to
   TRILL campus.

   When an edge RBridge learns IP/MAC correspondence of a locally
   attached end station 1 by inspecting the ARP message or other data
   frame, it will use Interface Addresses APP sub-TLV and flood such
   information to all other edge RBs belonging to same routing domain.
   Edge RBs in the same routing domain must establish ESADI sessions
   for each layer 2 network beforehand. When an edge RBridge receives
   Interface Addresses APPsub-TLV, it retrieves IPv4 and MAC mapping
   information of the end station and installs that information in its
   IP routing table in the corresponding VRF. After that, the end
   stations attached to the receiving edge RBridges can communicate to
   end station 1 through layer 2 and layer 3 forwarding procedures.

6. Management Data Labels for ESADI

   As ESADI is a Data Label (VLAN or FGL) scoped solution, each edge
   RBridge needs to establish ESADI session for each L2 VN in a routing
   domain. Therefore the number of ESADI session is huge and is a big
   burden for each RBridge's CPU. So we suggest a Management Data Label
   for ESADI to be used for a set of VNs or domains.

   Every RBridge should be configured with a globally unique management
   data label. Rbridges establishes ESADI session using this management

Hao & Li               Expires August 14, 2014                [Page 8]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   Data Label. In extreme case, we can use one management ESADI session
   for all routing domains. With this approach CPU consumption can be
   greatly reduced on every RBridge. The correspondence of management
   Data Label and L2 VNs can be configured on every RBridge. The
   operator must make sure the configuration is consistent for all
   RBridges. A new TLV is suggested to be defined in ESADI to
   synchronize ARP/ND tables for multiple L2 VN in one ESADI session.

7. Security Considerations

   For general TRILL Security Considerations, see [RFC6325].

8. IANA Considerations

   This document requires no IANA actions. RFC Editor: Please remove
   this section before publication.

9. Normative References

   [1]  [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate

         Requirement Levels", BCP 14, RFC 2119, March 1997.

10. Informative References

   [1]  [RFC6325] Perlman, R., Eastlake 3rd, D., Dutt, D., Gai, S.,
         and A. Ghanwani, "Routing Bridges (RBridges): Base Protocol

         Specification", RFC 6325, July 2011.

   [2]   [rfc6326bis] - Eastlake, D., Banerjee, A., Dutt, D., Perlman,
         R., and A. Ghanwani, "TRILL Use of IS-IS", draft-ietf-
         isisrfc6326bis-00.txt, work in progress.

   [3]   [ESADI] - Zhai, H., F. Hu, R. Perlman, D. Eastlake, J. Hudson,
         "TRILL(Transparent Interconnection of Lots of Links): The
         ESADI (End Station Address Distribution Information) Protocol",
         draft-ietf-trill-esadi-02.txt, work in progress.

   [4]  [FGL] Eastlake, D., M. Zhang, P. Agarwal, R. Perlman, D.
         Dutt, "TRILL (Transparent Interconnection of Lots of Links):
         Fine-Grained Labeling", draft-ietf-trill-fine-labeling, in RFC
         Ediotr's queue.

   [5]  [DIRECTORY] -  L.Dunar., D. Eastlake,  " TRILL: Directory
         Assistance Mechanisms", draft-dunbar-trill-scheme-for-
         directory-assist-04.txt, work in progress.

Hao & Li               Expires August 14, 2014                [Page 9]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   [6]  [RFC6165] Banerjee,A., Ward, D., Dutt, D.,

         , "Extensions to IS-IS for Layer-2 Systems", RFC 6165, April
         2011.

11. Acknowledgments

   The authors wish to acknowledge the important contributions of

   Zhenbin Li.

Hao & Li               Expires August 14, 2014               [Page 10]
Internet-Draft   TRILL Distributed Gateway Solution      February 2014

   Authors' Addresses

   Weiguo Hao
   Huawei Technologies
   101 Software Avenue,
   Nanjing 210012
   China
   Phone: +86-25-56623144
   Email: haoweiguo@huawei.com

   Yizhou Li
   Huawei Technologies
   101 Software Avenue,
   Nanjing 210012
   China
   Phone: +86-25-56625375
   Email: liyizhou@huawei.com

   Donald E. Eastlake
   Huawei Technologies
   155 Beaver Street
   Milford, MA 01757 USA

   Phone: +1-508-333-2270
   EMail: d3e3e3@gmail.com

Hao & Li               Expires August 14, 2014               [Page 11]