MBONED Working Group                               Hugh LaMaster
Internet Draft                                     Steve Shultz
                                                   NASA ARC/NREN
                                                   John Meylor
                                                   David Meyer
                                                   Cisco Systems

Category                                           Informational
draft-ietf-mboned-mix-01.txt                       June, 1999

               Multicast-Friendly Internet Exchange (MIX)

1. Status of this Memo

   This document is an Internet-Draft and is in full conformance with
   all provisions of Section 10 of RFC 2026.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet- Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt.

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.

LaMaster, Shultz, Meylor, Meyer                                 [Page 1]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

2. Abstract

   This document describes an architecture for a Multicast-friendly
   Internet eXchange (MIX), and the actual implementation at the NASA
   Ames Research Center Federal Internet eXchange (FIX-West, or FIX).
   The MIX has three objectives: native IP multicast routing, scalable
   interdomain policy-based route exchange, and to allow a variety of
   IGP protocols and topologies for intra-domain use. In support of
   these objectives, the MIX architecture defines the following
   components: a peer-peer routing protocol, a method for multicast
   forwarding, a method for exchanging information about active sources,
   and a medium which provides native multicast.  This document
   describes the protocols and configurations necessary to provide a
   current, working multicast-friendly internet exchange, or MIX.

   This memo is a product of the MBONE Deployment Working Group (MBONED)
   in the Operations and Management Area of the Internet Engineering
   Task Force. Submit comments to <mboned@ns.uoregon.edu> or the
   authors.

   Acknowledgments

   Thanks to the NASA HPCC program for supporting the NREN staff portion
   of this project; thanks to William P. Jones of the NASA ARC Gateway
   Facility for making the gateway facility available for housing this
   project.

3. Copyright Notice

   Copyright (C) The Internet Society (1999).  All Rights Reserved.

4. Introduction

   The MIX objective was to use current technology to implement a
   scalable, high-performance, efficient, native IP multicast
   architecture.

   Past experience at ARC, NASA WANs, and at FIX-West, had shown that
   mrouted/DVMRP "Mbone" tunnels were an inefficient of routing
   multicast through an exchange point.  Specifically, at FIX-West, the
   large number of tunnels often resulted in unicast traffic loads on
   the FIX FDDI that were 10 times the underlying multicast load.  In
   addition, some WANs had multiple tunnels criss-crossing the same
   physical links, resulting in wasted WAN bandwidth.  And, the separate
   workstation and router infrastructure for the "Mbone" tunnels created
   numerous problems.  Maintenance of Unix system and tunnel

LaMaster, Shultz, Meylor, Meyer                                 [Page 2]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

   configurations was often ad hoc, because some of the network
   operators lacked the necessary expertise.  And the hardware and
   software configuration and performance of the tunnel infrastructure
   was often out of step with the underlying router-based unicast
   structure. In addition, use of a single, shared, distance-vector IGP
   in the inter-domain space led to instability.

   Therefore, it was desired to implement a new multicast internet
   exchange from the ground up, using current technology, and
   significantly improving performance, efficiency, and reliability.

   Four elements were identified as being necessary for the MIX
   architecture in order to meet the objectives.  These were to define a
   peer-peer routing protocol, a method for multicast forwarding, a
   method for exchanging information about distant sources and groups,
   and a non-switched broadcast medium.

   NASA Ames Research Center hosts the Federal Internet eXchange (FIX-
   West, or, "the FIX") as well as hosting the Ames Internet eXchange
   (AIX), which is connected at high speed to the MAE-West, and, which
   also shares the same address space as the MAE-West. These facilities
   are co-located at the Ames Telecommunications Gateway Facility.  It
   was felt that this would be an excellent location to test the
   viability of the native multicast technologies. The Multicast-
   friendly Internet eXchange (MIX) is co-located adjacent to the FIX
   for easy access from the existing FIX routers.

   Choices were made for each element, and the MIX was implemented
   adjacent to the existing NASA ARC FIX gateway facility.  At the time
   of writing, there are eight direct participants in the MIX, peering
   and exchanging routes and multicast traffic natively, and the
   performance and reliability have already far exceeded the tunneled
   infrastructure the MIX replaced.

5. Requirements and Technology

   In order to meet the objectives for this multicast exchange, all
   peering partners had to agree mutually to standardize on the
   following four elements.  These are:

    - the protocol to be used for multicast route exchange
    - the method for performing multicast forwarding
    - the method for identifying active sources
    - the physical medium for the multicast exchange

   The elements chosen to implement the MIX were BGP4+ (also known as
   "MBGP") for routing and route exchange [BGP4+], PIM-SM for multicast

LaMaster, Shultz, Meylor, Meyer                                 [Page 3]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

   forwarding on the exchange, the MSDP protocol for information on
   sources and groups, and, FDDI for the multicast medium.

5.1. Routing

   Two of the objectives of the MIX were to provide an EGP for scalable
   interdomain policy-based route exchange, and to allow a variety of
   IGP protocols and topologies for intra-domain use.  As with unicast
   interdomain routing, BGP could be used as the EGP to exchange routes
   for multicast.  However, the unicast and multicast routing paths and
   policies would have to be completely congruent.  In practice, this is
   sometimes not the case.  It is possible, however, to take advantage
   of the extensions in BGP4+ to deal with these policy and path
   incongruencies.

   BGP4+ [BGP4+] describes extensions to (unicast) BGP that allow use of
   the existing BGP machinery to provide the necessary scalability,
   policy control, and route stability features and mechanisms to be
   applied to both unicast and multicast routes consistently.

   BGP4+ allows routes to be marked "unicast forwarding", "multicast
   forwarding", or "both unicast and multicast forwarding".  In this
   way, BGP4+ supports different multicast and unicast forwarding paths
   and policies.  This removes the dependency on unicast-only routing.

   The ability of BGP4+ to support separate paths and policies for
   multicast is important for meeting the objectives of the exchange in
   various ways.  It allows for a participant's multicast routing policy
   to be independent of its established unicast routing policy.  This is
   important in order that the exchange can support providers migrating
   to BGP4+ as an IDMR.  This is because it allows for the exchange of
   routes previously exchanged via DVMRP, even though those routes would
   not meet the existing unicast routing policy.  It allows for
   different policy in the interim.  For example, routes may be
   exchanged for BGP4+ multicast forwarding even though they would not
   be permitted under existing unicast routing policy.  BGP4+ also
   provides for the possibility that even after full migration is
   complete, a separate multicast routing policy can be applied.

   The exchange architecture imposes no requirements on the IGP or the
   multicast forwarding protocol or topology used internal to an AS.

LaMaster, Shultz, Meylor, Meyer                                 [Page 4]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

5.2. Multicast Forwarding

   The first requirement for the multicast forwarding protocol is that
   it be able to use routes exchanged via BGP4+.  In addition, there is
   a requirement that the protocol only forward data upon explicit joins
   from participating peers. For these reasons, PIM-SM was selected.

   The use of PIM on a shared LAN has certain consequences. It is
   necessary for all MIX participants to agree on certain configuration
   conventions affecting PIM forwarding on multi-access LANs.  In
   particular, it is necessary to establish a standard protocol "metric
   preference" (also known as "distance" or process "precedence") to be
   used by all peers for the PIM Assert process, because the PIM Assert
   process [PIM-SM] uses the "metric preference" [PIM-SM] as a mechanism
   by which the multicast forwarder is chosen.  If all parties are not
   following the convention, there may be black holes, in which a route
   appears to be valid, but traffic does not flow, or, there may be
   multicast loops, which can have deleterious consequences.

   For the MIX, a standard set of metric preferences are applied to the
   BGP4+ routes as the convention for the PIM forwarding mechanism.

5.3. Active Sources

   There are two current methods for distributing information about
   active sources to participating AS's.  The AS's may be dense-mode
   regions, or, they may contain PIM-SM RP's.  One method is to use
   dense-mode to flood data packets to dense-mode regions and to
   sparse-mode RPs co-located on the exchange.  The second method is to
   use a protocol that allows each AS to share information about the
   sources contained within it without flooding data.

   Recently, a new protocol, MSDP [MSDP] has been proposed that, when
   combined with PIM-SM, allows independent AS's to share information
   about distant sources and groups without flooding. Instead of
   flooding all data, only <S,G> information is flooded, and then, only
   to systems, such as PIM-SM RP's, which require the information.  MSDP
   allows each AS to run its own sparse-mode region, independent of all
   other sparse-mode regions.

   MSDP peering is established by all MIX participants. Most MIX-
   connected AS's are now running sparse-mode internally, or are
   actively migrating.

LaMaster, Shultz, Meylor, Meyer                                 [Page 5]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

5.4. Medium

   A primary objective for a multicast exchange is to provide support
   for native multicast among multiple peering partners.

   There exist a number of unresolved issues regarding use of layer-2
   switched media for multicast at interexchange points, and, until
   these issues are resolved, running native multicast on such media can
   be problematic.

   Fortunately, BGP4+ permits unicast and multicast to be carried on
   different media, permitting a multicast medium to be used
   independently of the unicast medium if necessary.

   A FDDI concentrator was selected for the Ames MIX to provide the
   native multicast exchange medium. It was router-efficient, because it
   permitted the medium to do the multicast packet replication, with a
   single copy from a router being replicated to all neighbors.  And
   FDDI was considered operationally convenient by most of the
   participants.  Unicast traffic continues to be routed over the
   existing unicast exchange media.

   Any medium which supports native multicast at layer 2 can be used
   efficiently.  Other multicast exchanges are using switches with fast
   and gigabit ethernet ports and the switch configured with multicast
   as broadcast.  Use of switching technology to handle layer 3
   multicast requirements for inter-router communication is still
   unresolved.  At some exchanges, native multicast is handled over ATM
   point-point VC's.

6. The NASA Ames Research Center Multicast-Friendly Internet Exchange

   The Ames Multicast-friendly Internet eXchange, or MIX, began with the
   first beta-test trials in March 1998, and became operational,
   exchanging BGP4+ routes externally and using BGP4+ between multiple
   AS's, in May 1998.  NREN implemented BGP4+ and internal BGP4+ and
   began trial external peerings in the same time frame, evolving from
   the first trials, to full deployment by October. As of June 1999,
   there were 10 AS's peering using PIM-SM, BGP4+, and MSDP to actively
   exchange multicast on the MIX FDDI.  One of the AS's, AS10888, acts
   as a gateway between the DVMRP-based "Mbone" and the BGP4+ area.  A
   router within AS10888 has been located on the MIX by NREN. The
   physical and logical topologies are as follows:

LaMaster, Shultz, Meylor, Meyer                                 [Page 6]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

                          AS10888---R----"MBone"
                                    |
                               MIX  |  multicast_exchange
                             ----------------
                                /         \
                               /           \
                  bgp4+_peer---R             R---bgp4+_peer
                               \           /
                                \         /
                              ---------------
                           FIX unicast_exchanges

7. Topology, Architecture, and Special Considerations

   BGP4+

    -PIM Asserts and Metric preference

   The PIM Assert mechanism requires that all routing protocols
   "compete" to see which router is allowed for forward onto the shared
   medium.  To first order, the protocol metric preference is used to
   determine the forwarder.  All MIX peers must coordinate routing
   protocol parameters so that one router does not inadvertantly win PIM
   asserts over a neighbor which has a functional path.  This requires
   that BGP4+ routes have preference over other routes, such as BGP,
   OSPF, and DVMRP.  In particular, it was necessary to standardize
   protocol metric preferences, and give BGP4+ routes the lowest,
   preferred, dynamic routing protocol metric preferences.  For this
   reason, the standard set of BGP4+ metric preferences was chosen to be
   less than any other dynamic unicast routing protocol metric
   preferences. Any MIX routers which are using DVMRP must use a DVMRP
   metric preference higher than the BGP4+ metric preferences, rather
   than what many people have used previously as the DVMRP metric
   preference, of 0.

    -Default

   One transitional requirement is the necessity to have routes to
   "Mbone" sources, that is, sources within the global DVMRP routing
   region.  Currently, the mechanism used is to have a single router in
   AS10888 on the MIX originate MBGP default to all external peers.

LaMaster, Shultz, Meylor, Meyer                                 [Page 7]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

   DVMRP routing

    -DVMRP route redistribution

   At present, all BGP4+ routes tagged with a particular community are
   redistributed at the MIX into DVMRP within AS10888.  This is to
   provide DVMRP region users access to sources originating within AS's
   that are being routed via BGP4+ exclusively. Unless a particular
   community string is set, it is assumed that redistribution is not
   desired.  In the reverse direction, instead of sending DVMRP routes
   into BGP4+, BGP4+ default is originated from the intermediary router.

   In addition, local, stub-region DVMRP routes are redistributed into
   BGP4+ internally by several of the peers.  As long as the regions
   remain stub regions, there is no danger, but, the possibility of a
   backdoor into the Mbone presents an ever-present threat of loops
   unless care is taken to redistribute only the routes which are known
   to be owned within the AS.

8. Conclusions and Recommendations

    -Provide support for native multicast
    -Use BGP4+ as a method of exchanging routes for
     inter-domain multicast
    -Use PIM-SM with MSDP
    -Concurrent use of BGP4+ and DVMRP for inter-domain
     routing is not recommended.  It is strongly
     recommended to use BGP4+ exclusively for inter-domain
     route exchange.

9. Security Considerations

   There are no security considerations unique to the multicast
   exchange.

LaMaster, Shultz, Meylor, Meyer                                 [Page 8]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

10. References

    [DVMRP]   T. Pusateri, "Distance Vector Multicast Routing
                 Protocol", <draft-ietf-idmr-dvmrp-v3-07.txt>,
                 August 1998.

    [BGP4+]   T. Bates, R. Chandra, D. Katz, Y. Rekhter,
                 "Multiprotocol Extensions for BGP-4", RFC 2283,
                 February 1998.

    [BGP4+2]  T. Bates, R. Chandra, D. Katz, Y. Rekhter,
                 "Multiprotocol Extensions for BGP-4", Internet Draft,
                 <draft-ietf-idr-bgp4-multiprotocol-v2-01.txt>,
                 August 1998.

    [PIM-SM]  D. Estrin, D. Farinacci, A. Helmy, D. Thaler, S. Deering,
                 M. Handley, V. Jacobson, C. Liu, P. Sharma, L. Wei,
                 "Protocol Independent Multicast-Sparse Mode (PIM-SM):
                 Protocol Specification", RFC 2362, June 1998.

    [PIM-DM]  S. Deering, D. Estrin, D. Farinacci, V. Jacobson, A.
   Helmy,
                 D. Meyer, L. Wei, "Protocol Independent Multicast
                 Version 2 Dense Mode Specification", Internet Draft,
                 <draft-ietf-pim-v2-dm-01.txt>, November 1998.

    [MSDP]    D. Farinacci, Y. Rekhter, P. Lothberg, H. Kilmer, J. Hall,
                 "Multicast Source Discovery Protocol (MSDP)",
                 <draft-farinacci-msdp-00.txt>, June 1998.

11. Author's Address

   Hugh LaMaster
   Steve Shultz
   NASA Ames Research Center
   Mail Stop 233-21
   Moffett Field, CA 94035-1000
   email: hlamaster@arc.nasa.gov
          shultz@arc.nasa.gov

   David Meyer
   John Meylor
   Cisco Systems
   San Jose, CA
   email: dmm@cisco.com

LaMaster, Shultz, Meylor, Meyer                                 [Page 9]


Internet Draft        draft-ietf-mboned-mix-02.txt            June, 1999

          jmeylor@cisco.com

LaMaster, Shultz, Meylor, Meyer                                [Page 10]