More about HKUST
A Survey on the GPU Multiplexing for DNN Inference and Training: A Bottom-up Approach
PhD Qualifying Examination
Title: "A Survey on the GPU Multiplexing for DNN Inference and Training: A
Bottom-up Approach"
by
Mr. Haoxuan YU
Abstract:
The intermittent and uneven resource demands of deep neural network (DNN)
workloads lead to suboptimal utilization of GPUs as multi-dimensional
resources, a problem exacerbated by the rapid advancement of GPU capabilities.
To enhance utilization of dominant GPUs, GPU multiplexing has become a widely
adopted strategy, effectively reducing the total cost of ownership (TCO) of GPU
clusters. However, the software infrastructure provided by hardware vendors
lacks native support for GPU multiplexing, posing challenges for fine-grained
GPU resource allocation and performance isolation.
This survey reviews recent research efforts on GPU multiplexing, with a focus
on resource utilization and performance isolation. We analyze device drivers,
programming toolkits, machine learning (ML) frameworks, and cluster management
from the bottom up, exploring opportunities for improving GPU multiplexing from
multiple perspectives. We hope this survey sheds light on system optimization
for GPU multiplexing and facilitates future designs of GPU multiplexing
software stacks.
Date: Tuesday, 20 August 2024
Time: 3:00pm - 5:00pm
Venue: Room 5510
Lifts 25/26
Committee Members: Prof. Song Guo (Supervisor)
Dr. Wei Wang (Chairperson)
Prof. Kai Chen
Prof. Qian Zhang