Draft:Datacenter Congestion Control

Review waiting, please be patient.

This may take 2 months or more, since drafts are reviewed in no specific order. There are 2,840 pending submissions waiting for review.

If the submission is accepted, then this page will be moved into the article space.
If the submission is declined, then the reason will be posted here.
In the meantime, you can continue to improve this submission by editing normally.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Reviewer tools

Instructions · What links here · Datacenter Congestion Control (talk: + · bio) · (log) · Copyvios report · reFill · Citation Bot · (Search: Google, Wikipedia) · Submitted 31 days ago by 216.228.125.129 (talk: D · +) · Last edited 31 days ago by Citation bot

Datacenter congestion control is the set of techniques and mechanisms used to manage network traffic within datacenters to prevent network congestion and guarantee efficient transmission. When multiple servers send data simultaneously through shared network infrastructure, congestion can occur. Congestion control algorithms determine how fast each sender should transmit data, when to slow down, and when it's safe to speed up again.

Importantly, datacenter congestion control algorithms operate in an environment that is fundamentally different than traditional internet congestion control, which is mostly handled using TCP (Transmission Control Protocol). TCP is designed for internet, which has high latency and unpredictable conditions. Datacenters network must meet much lower latency (micro seconds instead of milli seconds), high bandwidth, and can count on having more predictable network topologies. Datacenter congestion control mechanisms therefore must react much faster and more precisely than their internet counterparts. Also, since Round-trip times within a datacenter can be as low as a few microseconds, congestion can build up within microseconds.

Methods for Datacenter Congestion control

Data Center TCP (DCTCP)

DCTCP^[1] takes a fundamentally different approach from traditional TCP. Instead of treating congestion as a binary event DCTCP provides multi-bit feedback about the extent of congestion. It leverages Explicit Congestion Notification (ECN), a feature where switches can mark packets when their queues exceed a certain threshold, rather than dropping them. The sender tracks the fraction of packets marked with ECN and adjusts transmission rate in a way that is proportional to congestion level.

TIMELY

TIMELY^[2] uses delay as the primary congestion signal. In datacenter networks, increases in round-trip time (RTT) correlate strongly with growing queue lengths at switches. TIMELY senders measures RTT at microsecond granularity and use a rate-based control algorithm to increase sending rate when RTT is low and stable.

DCQCN (Data Center Quantized Congestion Notification)

DCDQN^[3] is designed for RDMA over Converged Ethernet (RoCE) networks. It control transmission rates at the network interface card level,

and signals congestion by marking Explicit Congestion Notification (ECN). It also uses a feedback mechanism where receivers send explicit congestion notification packets back to senders. DCQCN reduces the rate immediately when congestion is detected, then gradually increases it like the TCP additive increase approach.

ADPG (Reinforcement Learning for Datacenter Congestion Control)

ADPG^[4], rather than designing explicit rules for adjusting rates, this approach uses a reinforcement learning (RL) algorithm to trains an agent that learns optimal congestion control policies through experience. The RL agent uses packet loss, latency measurements, and response patterns to select an action (raising or lowering the sending rate) that would lead to the best outcomes in terms of throughput and latency. This learning-based approach outperforms fixed rules by discovering complex control policies that are hard-to-find for human designers.

References

^ Alizadeh, Mohammad; Greenberg, Albert; Maltz, David A.; Padhye, Jitendra; Patel, Parveen; Prabhakar, Balaji; Sengupta, Sudipta; Sridharan, Murari (2010-08-30). "Data center TCP (DCTCP)". SIGCOMM Comput. Commun. Rev. 40 (4): 63–74. doi:10.1145/1851275.1851192. ISSN 0146-4833.
^ Mittal, Radhika; Lam, Vinh The; Dukkipati, Nandita; Blem, Emily; Wassel, Hassan; Ghobadi, Monia; Vahdat, Amin; Wang, Yaogong; Wetherall, David; Zats, David (2015-08-17). "TIMELY: RTT-based Congestion Control for the Datacenter". SIGCOMM Comput. Commun. Rev. 45 (4): 537–550. doi:10.1145/2829988.2787510. ISSN 0146-4833.
^ Zhu, Yibo; Eran, Haggai; Firestone, Daniel; Guo, Chuanxiong; Lipshteyn, Marina; Liron, Yehonatan; Padhye, Jitendra; Raindel, Shachar; Yahia, Mohamad Haj; Zhang, Ming (2015-08-17). "Congestion Control for Large-Scale RDMA Deployments". SIGCOMM Comput. Commun. Rev. 45 (4): 523–536. doi:10.1145/2829988.2787484. ISSN 0146-4833.
^ Tessler, Chen; Shpigelman, Yuval; Dalal, Gal; Mandelbaum, Amit; Haritan Kazakov, Doron; Fuhrer, Benjamin; Chechik, Gal; Mannor, Shie (2022-01-20). "Reinforcement Learning for Datacenter Congestion Control". SIGMETRICS Perform. Eval. Rev. 49 (2): 43–46. doi:10.1145/3512798.3512815. ISSN 0163-5999.

[1] Alizadeh, Mohammad; Greenberg, Albert; Maltz, David A.; Padhye, Jitendra; Patel, Parveen; Prabhakar, Balaji; Sengupta, Sudipta; Sridharan, Murari (2010-08-30). "Data center TCP (DCTCP)". SIGCOMM Comput. Commun. Rev. 40 (4): 63–74. doi:10.1145/1851275.1851192. ISSN 0146-4833.

[2] Mittal, Radhika; Lam, Vinh The; Dukkipati, Nandita; Blem, Emily; Wassel, Hassan; Ghobadi, Monia; Vahdat, Amin; Wang, Yaogong; Wetherall, David; Zats, David (2015-08-17). "TIMELY: RTT-based Congestion Control for the Datacenter". SIGCOMM Comput. Commun. Rev. 45 (4): 537–550. doi:10.1145/2829988.2787510. ISSN 0146-4833.

[3] Zhu, Yibo; Eran, Haggai; Firestone, Daniel; Guo, Chuanxiong; Lipshteyn, Marina; Liron, Yehonatan; Padhye, Jitendra; Raindel, Shachar; Yahia, Mohamad Haj; Zhang, Ming (2015-08-17). "Congestion Control for Large-Scale RDMA Deployments". SIGCOMM Comput. Commun. Rev. 45 (4): 523–536. doi:10.1145/2829988.2787484. ISSN 0146-4833.

[4] Tessler, Chen; Shpigelman, Yuval; Dalal, Gal; Mandelbaum, Amit; Haritan Kazakov, Doron; Fuhrer, Benjamin; Chechik, Gal; Mannor, Shie (2022-01-20). "Reinforcement Learning for Datacenter Congestion Control". SIGMETRICS Perform. Eval. Rev. 49 (2): 43–46. doi:10.1145/3512798.3512815. ISSN 0163-5999.

[1]

[2]

[3]

[4]