Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters
NVIDIA DGX SuperPOD: Next Generation Scalable Infrastructure for AI Leadership
Fabric-Scheduled Ethernet as an Effective Backend Interconnect for Large AI Compute Clusters
RDG for a Scalable, High-performance Kubernetes Cluster over NVIDIA Ethernet Fabric
Maintaining large-scale AI capacity at Meta
Scale-Out Ethernet fabric deployment for RoCE accelerated K8s cluster