An Open Congestion Control Architecture for high performance fabrics
draft-zhuang-tsvwg-open-cc-architecture-00

Document Type Active Internet-Draft (individual)
Last updated 2019-11-03
Stream (None)
Intended RFC status (None)
Formats plain text xml pdf htmlized bibtex
Stream Stream state (No stream defined)
Consensus Boilerplate Unknown
RFC Editor Note (None)
IESG IESG state I-D Exists
Telechat date
Responsible AD (None)
Send notices to (None)
TSVWG                                                          Y. Zhuang
Internet-Draft                                                    W. Sun
Intended status: Informational                                    L. Yan
Expires: May 6, 2020                       Huawei Technologies Co., Ltd.
                                                        November 3, 2019

  An Open Congestion Control Architecture for high performance fabrics
               draft-zhuang-tsvwg-open-cc-architecture-00

Abstract

   This document describes an open congestion control architecture of
   high performance fabrics for the cloud operators and algorithm
   developers to deploy or develop new congestion control algorithms as
   well as make appropriate configurations for traffics on smart NICs in
   a more efficient and flexible way.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on May 6, 2020.

Copyright Notice

   Copyright (c) 2019 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (https://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of

Zhuang, et al.             Expires May 6, 2020                  [Page 1]
Internet-Draft           open congestion control           November 2019

   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Conventions . . . . . . . . . . . . . . . . . . . . . . . . .   3
   3.  Abbreviations . . . . . . . . . . . . . . . . . . . . . . . .   3
   4.  Observations in storage network . . . . . . . . . . . . . . .   4
   5.  Requirements of the open congestion control architecture  . .   5
   6.  Open Congestion Control (OpenCC) Architecture Overview  . . .   5
     6.1.  Congestion Control Platform and its user interfaces . . .   6
     6.2.  Congestion Control Engine (CCE) and its interfaces  . . .   7
   7.  Interoperability Consideration  . . . . . . . . . . . . . . .   7
     7.1.  Negotiate the congestion control algorithm  . . . . . . .   7
     7.2.  Negotiate the congestion control parameters . . . . . . .   8
   8.  Security Considerations . . . . . . . . . . . . . . . . . . .   8
   9.  Manageability Consideration . . . . . . . . . . . . . . . . .   8
   10. IANA Considerations . . . . . . . . . . . . . . . . . . . . .   8
   11. References  . . . . . . . . . . . . . . . . . . . . . . . . .   8
     11.1.  Normative References . . . . . . . . . . . . . . . . . .   8
     11.2.  Informative References . . . . . . . . . . . . . . . . .   8
   Appendix A.  Experiments  . . . . . . . . . . . . . . . . . . . .   9
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  12

1.  Introduction

   The datacenter networks (DCNs) nowadays is not only providing traffic
   transmission for tenants using TCP/IP network protocol stack, but
   also is required to provide RDMA traffic for High Performance
   Computing (HPC) and distributed storage accessing applications which
   requires low latency and high throughput.

   Thus, for datacenter application nowadays, the requirements of
   latency and throughput are more critical than the normal internet
   traffics, while network congestion and queuing caused by incast is
   the point that increases the traffic latency and affect the network
   throughput.  With this, congestion control algorithms aimed for low
   latency and high bandwidth are proposed such as DCTCP[RFC8257], [BBR]
   for TCP, [DCQCN] for [RoCEv2].

   Besides, the CPU utilization on NICs is another point to improve the
   efficiency of traffic transmission for low latency applications.  By
   offloading some protocol processing into smart NICs and bypassing
   CPU, applications can directly write to hardware which reduces the
   latency of traffic transmission.  RDMA and RoCEv2 is currently a good
   example to show the benefit of bypassing kernel/CPU while TCP
Show full document text