More about HKUST
Network Transport for AI-Centric Networking
PhD Thesis Proposal Defence Title: "Network Transport for AI-Centric Networking" by Mr. Hao WANG Abstract: Driven by the increasing complexity of machine learning (ML) applications, such as autonomous driving, generative artificial intelligence (GAI), and game AI, the size of ML models is increasing explosively, from ResNet50 with 23M parameters to GPT-3 with 175B parameters. ML practitioners usually leverage distributed training systems to parallelize the training process for large models and datasets, and communication network could become the bottleneck in this case. To train large models, people start to use customized AI clusters, e.g., xAI’s Colossus with 100,000 GPUs, to replace the traditional data centers. A paradigm shift occurs from general-purpose networking to AI-centric networking. Traditional network transport mechanisms, however, are often inadequate to meet the unique latency, throughput, scalability, and reliability requirement of AI-centric applications. Additionally, current transports overlook the inherent characteristics of AI-centric applications, e.g., loss tolerance and traffic predictability. This thesis seeks to handle the paradigm shift by proposing specialized network transports tailored for AI-centric networking. We first propose our domain-specific network transport protocol for distributed training, which includes message semantics, reliability, rate control, flow scheduling, and load balancing. Next, to further enhance throughput and scalability, we introduce our transport that leverages in-network computing with data-plane memory scheduling on programmable switches. Finally, we discuss the emerging AI-centric networking protocols in the industry, such as UEC (Ultra Ethernet Consortium) and their relationship to ours, focusing on congestion control and load balancing. Date: Monday, 9 December 2024 Time: 4:00pm - 6:00pm Venue: Room 3494 Lifts 25/26 Committee Members: Prof. Kai Chen (Supervisor) Prof. Qiong Luo (Chairperson) Dr. Yangqiu Song Dr. Binhang Yuan