Skip to main content

Adaptive Routing Notification
draft-wh-rtgwg-adaptive-routing-arn-03

The information below is for an old version of the document.
Document Type
This is an older version of an Internet-Draft whose latest revision state is "Expired".
Expired & archived
Authors Haibo Wang , Hongyi Huang , Xuesong Geng , Xiaohu Xu , Yinben Xia
Last updated 2025-03-17 (Latest revision 2024-09-13)
RFC stream (None)
Formats
Stream Stream state (No stream defined)
Consensus boilerplate Unknown
RFC Editor Note (None)
IESG IESG state Expired
Telechat date (None)
Responsible AD (None)
Send notices to (None)

This Internet-Draft is no longer active. A copy of the expired Internet-Draft is available in these formats:

Abstract

Large-scale supercomputing and AI data centers utilize multipath to implement load balancing and/or improve transport reliability. Adaptive routing (AR), widely used in direct topologies such as dragonfly, is growing popular in commodity data centers to dynamically adjust routing policies based on path congestion and failures. When congestion or failure occurs, the sensing node can not only apply AR locally but also send the congestion/failure information to other nodes in a timely and accurate manner to enforce AR on other nodes, thus avoiding exacerbating congestion on the reported path. This document specifies Adaptive Routing Notification (ARN), a general mechanism to proactively disseminate congestion detection and congestion elimination information for remote nodes to perform re-routing policies.

Authors

Haibo Wang
Hongyi Huang
Xuesong Geng
Xiaohu Xu
Yinben Xia

(Note: The e-mail addresses provided for the authors of this Internet-Draft may no longer be valid.)