{"id":864,"date":"2026-05-07T15:50:02","date_gmt":"2026-05-07T07:50:02","guid":{"rendered":"https:\/\/connectword.dpdns.org\/?p=864"},"modified":"2026-05-07T15:50:02","modified_gmt":"2026-05-07T07:50:02","slug":"openai-introduces-mrc-multipath-reliable-connection-a-new-open-networking-protocol-for-large-scale-ai-supercomputer-training-clusters","status":"publish","type":"post","link":"https:\/\/connectword.dpdns.org\/?p=864","title":{"rendered":"OpenAI Introduces MRC (Multipath Reliable Connection): A New Open Networking Protocol for Large-Scale AI Supercomputer Training Clusters"},"content":{"rendered":"<p>Training frontier AI models is not just a compute problem \u2014 it is increasingly a networking problem. And OpenAI just introduced its solution.<\/p>\n<p>OpenAI announced the release of <strong>MRC (Multipath Reliable Connection)<\/strong>, a novel networking protocol developed over the past two years in partnership with AMD, Broadcom, Intel, Microsoft, and NVIDIA. The specification was published through the Open Compute Project (OCP), enabling the broader industry to use and build on it. <\/p>\n<h3 class=\"wp-block-heading\"><strong>Why Networking is the Hidden Bottleneck in AI Training<\/strong><\/h3>\n<p>To understand why MRC matters, you need to understand what happens inside a supercomputer during model training. When training large AI models, a single step can involve many millions of data transfers. One transfer arriving late can ripple through the entire job, potentially causing GPUs to sit idle.<\/p>\n<p>Network congestion, link, and device failures are the most common sources of delay and jitter in transfers \u2014 and these problems get more frequent, and harder to solve, as the size of the cluster increases. This is the compounding infrastructure challenge OpenAI set out to fix.<\/p>\n<p>According to OpenAI, more than 900 million people use ChatGPT every week. Sustaining and improving those models at that scale means every second of GPU idle time represents real cost and capability loss. The OpenAI states its goal as \u201cnot just to build a fast network, but also to build one that delivers very predictable performance, even in the presence of failures, to keep training jobs moving.\u201d<\/p>\n<h3 class=\"wp-block-heading\"><strong>What MRC Actually Does: Three Core Mechanisms<\/strong><\/h3>\n<p>MRC is not a ground-up invention. It extends RDMA over Converged Ethernet (RoCE) \u2014 an InfiniBand Trade Association (IBTA) standard that enables hardware-accelerated remote direct memory access among GPUs and CPUs. It draws on techniques developed by the Ultra Ethernet Consortium (UEC) and extends them with SRv6-based source routing to support large-scale AI networking fabrics.<\/p>\n<p>RoCE is a protocol that allows one machine to read or write memory on another machine directly over an Ethernet network, bypassing the CPU for maximum throughput. SRv6 (Segment Routing over IPv6) takes this further \u2014 the sending machine encodes the exact route the packet should follow directly inside the packet header, so switches no longer need to run complex routing calculations. This reduces the processing load on switches and saves power \u2014 a meaningful factor at data center scale.<\/p>\n<h4 class=\"wp-block-heading\"><strong>1. Adaptive Packet Spraying to Eliminate Congestion<\/strong><\/h4>\n<p>Instead of sending each transfer over a single network path, MRC spreads packets across hundreds of paths simultaneously, reducing congestion in the core of the network. With traditional RoCEv2, packets were stuck in a single path from point A to point B, which contributes to congestion. To overcome this, MRC introduced Intelligent Packet-Spray Load Balancing, so that if a packet\u2019s path is unusable, packets can traverse across other paths on the network. This enables higher bandwidth utilization, reduced tail latency, and fine-grained load balancing at the packet level.<\/p>\n<h4 class=\"wp-block-heading\"><strong>2. Microsecond-Level Failure Recovery via SRv6 Static Source Routing<\/strong><\/h4>\n<p>When network paths, links, or switches fail, MRC can detect the problem and route around it on a microsecond timescale. Conventional network fabrics can take seconds or even tens of seconds to stabilize after failures. A key architectural decision makes this possible: the switches don\u2019t need to recompute routes or do anything other than blindly follow the static routes they were configured with. All routing intelligence lives at the NIC level, not the switch level. This is a deliberately unconventional design \u2014 disabling dynamic routing in the switches entirely to prevent two adaptive mechanisms from interfering with each other.<\/p>\n<p>Before MRC, if a link between a GPU\u2019s network interface and a tier-0 switch failed, the training job would fail. With MRC, the job survives with reasonable performance. If an 8-port network interface loses one port, the maximum rate is reduced by one eighth. MRC detects this, recalculates paths to avoid the failed plane, and immediately tells peers not to use that plane for inbound traffic. Most failed links recover within a minute, at which point MRC brings the plane back into use.<\/p>\n<h4 class=\"wp-block-heading\"><strong>3. Multi-Plane Networks with Fewer Switch Tiers and Lower Cost<\/strong><\/h4>\n<p>This is where MRC changes cluster architecture fundamentally. Instead of treating each network interface as one 800Gb\/s link, it is split into multiple smaller links. For example, one interface can connect to eight different switches. A switch that can connect 64 ports at 800Gb\/s can instead connect 512 ports at 100Gb\/s. This lets to build a network fully connecting about 131,000 GPUs with only two tiers of switches. A conventional 800Gb\/s network would require three or four tiers.<\/p>\n<p>The savings compound further: the research team quantifies that for full bisection bandwidth, the two-tier multi-plane design requires <strong>2\/3 of the optics and 3\/5 the number of switches<\/strong> compared to a three-tier network. Fewer switch tiers also means lower latency \u2014 the longest path traverses only three switches rather than five or seven \u2014 and smaller blast radius when any individual component fails.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Hardware: Which NICs and Switches Run MRC<\/strong><\/h3>\n<p>As per the <a href=\"https:\/\/cdn.openai.com\/pdf\/resilient-ai-supercomputer-networking-using-mrc-and-srv6.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">research paper,<\/a> MRC is already running in production on specific, named hardware. It is implemented across 400 and 800Gb\/s RDMA NICs \u2014 including NVIDIA ConnectX-8, AMD Pollara, AMD Vulcano, and Broadcom Thor Ultra \u2014 with SRv6 switch support on NVIDIA Spectrum-4 and Spectrum-5 (running Cumulus and SONiC) and Broadcom Tomahawk 5 via Arista EOS. On the protocol side, AMD contributed the NSCC congestion control algorithm, now part of the UEC Congestion Control specification, along with IB\/RDMA transport semantic layer extensions that allow MRC to integrate with existing RDMA programming models while adding the multipath capabilities that set it apart from traditional transports.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Already in Production: From Stargate to Fairwater<\/strong><\/h3>\n<p>MRC is not just a prototype. It is already deployed across all of OpenAI\u2019s largest NVIDIA GB200 supercomputers used to train frontier models, including the site with Oracle Cloud Infrastructure (OCI) in Abilene, Texas, and in Microsoft\u2019s Fairwater supercomputers. MRC has been used to train multiple OpenAI models, leveraging hardware from NVIDIA and Broadcom. Microsoft\u2019s Fairwater supercomputers are located in Atlanta and Wisconsin.<\/p>\n<p>MRC has been used specifically to train frontier large language models for <strong>ChatGPT and Codex<\/strong>. During the training of a recent frontier model, OpenAI had to reboot four tier-1 switches. With MRC, the company did not need to coordinate the reboot with the teams running training jobs in the cluster.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>OpenAI Introduces MRC<\/strong> \u2014 OpenAI partnered with AMD, Broadcom, Intel, Microsoft, and NVIDIA to release MRC (Multipath Reliable Connection) through the Open Compute Project (OCP). <\/li>\n<li><strong>Packet Spraying Kills Congestion<\/strong> \u2014 MRC spreads packets across hundreds of paths simultaneously, eliminating core congestion and reducing tail latency during large-scale GPU training. <\/li>\n<li><strong>Microsecond Failure Recovery<\/strong> \u2014 MRC detects link and switch failures and reroutes traffic in microseconds, keeping training jobs alive through failures that would previously have caused full job termination. <\/li>\n<li><strong>Two-Tier Topology for 131,000+ GPUs<\/strong> \u2014 By splitting 800Gb\/s interfaces into eight 100Gb\/s planes, MRC supports supercomputers with over 100,000 GPUs using only two tiers of switches instead of three or four. <\/li>\n<li><strong>Already used for ChatGPT and Codex<\/strong> \u2014 MRC is already deployed across OpenAI\u2019s largest NVIDIA GB200 supercomputers and has been used to train frontier large language models for ChatGPT and Codex.<\/li>\n<\/ul>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out\u00a0the\u00a0<strong><a href=\"https:\/\/cdn.openai.com\/pdf\/resilient-ai-supercomputer-networking-using-mrc-and-srv6.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">Paper <\/a><\/strong>and<strong>\u00a0<a href=\"https:\/\/openai.com\/index\/mrc-supercomputer-networking\/\" target=\"_blank\" rel=\"noreferrer noopener\">Technical details<\/a><\/strong>.<strong>\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/MTNLpmJtsFA3VRVd9\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/05\/07\/openai-introduces-mrc-multipath-reliable-connection-a-new-open-networking-protocol-for-large-scale-ai-supercomputer-training-clusters\/\">OpenAI Introduces MRC (Multipath Reliable Connection): A New Open Networking Protocol for Large-Scale AI Supercomputer Training Clusters<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Training frontier AI models is&hellip;<\/p>\n","protected":false},"author":1,"featured_media":29,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-864","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/864","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=864"}],"version-history":[{"count":0,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/864\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/media\/29"}],"wp:attachment":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=864"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=864"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=864"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}