More about HKUST
Flow Scheduling for Parallel Computing Applications in Datacenters
PhD Thesis Proposal Defence Title: "Flow Scheduling for Parallel Computing Applications in Datacenters" by Mr. Li CHEN Abstract: Distributed and parallel computing systems are cornerstones of this era of Big Data, machine learning, and arti ficial intelligence. This type of computing systems span over hundreds or thousands of machines in datacenter(s), so as to cope with the ever-expanding data volume and the increasing complexity of models/problems. For example, most of the recent interesting applications, such as web search, recommendation systems, and deep networks, run on clusters of thousands of machines for both small companies and large enterprises. As such scale, the communication between machines is a bottleneck issue, and the scheduling of communication sessions within applications (or network flows) is a key factor in the acceleration of these applications. This thesis focus on flow scheduling problems in datacenters. Specifically, we look at three important scheduling problems in real-world applications in datacenters: 1. Scheduling general flows for applications without knowledge of flow size, such as database query/response. We adopted the Multi-Level-Feedback queues in operating systems to network flows, and developed a queueing theory model to determine the optimal parameter settings. 2. Scheduling flows with or without completion time constraints (deadlines), which impacts user-facing applications, such as web search. We constructed a systematic solution for this problem, and derive a congestion window update function using Lyapunov Optimization techniques in control theory. 3. Scheduling groups of flows with semantic relationships (coflows) with application transparency, which emerges from data processing pipelines, such as Hadoop MapReduce. We used unsupervised learning to identify relationships between flows, and we designed an error-tolerant scheduling to mitigate the impact of mis-identification. We present the proposed solutions for each problem, and demonstrate the effectiveness via extensive simulations and experiments. Date: Friday, 2 December 2016 Time: 2:00pm - 4:00pm Venue: Room 3494 (lifts 25/26) Committee Members: Dr. Kai Chen (Supervisor) Prof. Gary Chan (Chairperson) Dr. Wei Wang Prof. Qian Zhang **** ALL are Welcome ****