More about HKUST
Task-aware Scheduling for Data Center Networks
PhD Qualifying Examination Title: "Task-aware Scheduling for Data Center Networks" by Miss Jingjie JIANG Abstract: As the most prevalent computing platform for data parallel applications, data centers have received extensive attentions recently. Leveraging the scale of economy, data centers are much more cost-efficient than traditional enterprise networks. The ambulant redundancy of both computation servers and network devices guarantee the reliability of data centers. The symmetric network typologies in most data centers further provide many candidate paths to ensure network connectivity. In addition, data centers usually adopt uniform management strategies across the network, which can elastically control the network and improve the security of stored data. Tasks in a data center, such as web search queries or MapReduce jobs, usually consist of plenty of flows transferring data among different virtual machines across the network. The completion time of these flows significantly influence the user perceived performance and network efficiency. In this paper, we investigate the state-of-art flow scheduling schemes in data center networks. Before the introduction of task-awareness, both academic research and industry standard have focused on reducing the completion time of each individual flow, aiming to minimize the average completion time of all the flows in a network. This is equivalent to maximizing the overall throughput of a network. Nevertheless, different flows within the same task are correlated with one another, in the sense that a task is not considered to be completed till the completion of all its constituent flows. Therefore, the task completion time is far more relevant than the completion time of any individual flow within. We need to coordinate and schedule the flows with the task semantics into consideration. Task-aware scheduling is a novel concept, and there exists many challenges and open problems to solve. In this paper, we comprehensively investigate existing literature regarding flow scheduling in data center networks. By comparing the scheduling strategies with and without task-awareness, we reiterate the importance of task-aware scheduling. We further examine the feasibility and limitations of several tentative task-aware scheduling schemes. Based on the in-depth analysis, we discuss some promising techniques that can be integrated to further improve the task performance in data center networks. Date: Tuesday, 10 June 2014 Time: 4:30pm - 6:30pm Venue: Room 3501 Lifts 25/26 Committee Members: Prof. Bo Li (Supervisor) Dr. Lei Chen (Chairperson) Dr. Kai Chen Dr. Lin Gu **** ALL are Welcome ****