More about HKUST
Communication-Efficient Algorithms for Tracking Distributed Data Streams
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "Communication-Efficient Algorithms for Tracking Distributed Data Streams" By Mr. Qin Zhang Abstract We investigate several basic problems in the distributed streaming model. In the this model, we have k sites, each receiving a stream of elements over time. There is a designated coordinator who would like to track, that is, maintain continuously at all times, some function f of all the elements received from the k sites. There is a two-way communication channel between each site and the coordinator, and the goal is to track f with minimum communication. This model is motivated by applications in distributed databases, network monitoring and sensor networks. In this thesis, we design algorithms to track some fundamental and useful functions, including random sampling, heavy hitters and quantiles. We also show that our algorithms have optimal communication costs in the worst case (up to some polylogarithmic factors in a few cases). In addition, we observed that for some problems considering the worst-case communication cost is meaningless, so we propose to use competitive analysis and give online tracking algorithms whose performance is competitive against the optimal offline algorithm that knows the entire stream in advance. Although this thesis is primarily concerned with the theory of distributed streaming, we expect that our study will also have a significant impact on real-world distributed streaming systems. Date: Tuesday, 18 May 2010 Time: 10:00am – 12:00noon Venue: Room 3494 Lifts 25/26 Chairman: Prof. Guohua Chen (CBME) Committee Members: Prof. Mordecai Golin (Supervisor) Prof. Ke Yi (Supervisor) Prof. Sunil Arya Prof. Siu Wing Cheng Prof. Jeffrey Chasnov (MATH) Prof. Tak Wah Lam (Comp. Sci., HKU) **** ALL are Welcome ****