AI RESEARCH

OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol

arXiv CS.AI

ArXi:2605.28717v1 Announce Type: new Modern datacenter RDMA is bottlenecked at the network interface, not the wire. A NIC running RoCE or InfiniBand holds per-connection state for every (application, remote-endpoint) pair - hundreds of megabytes at 1024-application fanout - and pays a four-traversal PCIe round trip on a 64-byte operation, inflating latency an order of magnitude beyond the wire. Both follow from the Queue Pair over PCIe abstraction RDMA inherits from InfiniBand.