More about HKUST
Distributed Algorithms for Computing Statistical Information on Massive Data
PhD Qualifying Examination Title: "Distributed Algorithms for Computing Statistical Information on Massive Data" by Mr. Zengfeng Huang Abstract: Consider a distributed system with k nodes, where each node holds data set, and the goal is to design communication-efficient algorithms for computing functions over the union of the k data sets. In this survey, we focus on computing some most important statistical information of the underlying data, in particular item frequencies, heavy hitters, quantiles, top-m items, and random samples. We will consider both a flat network structure and more complicated tree networks. We also consider the case where the inputs are not static sets, but k data streams, and the goal is to continuously track these functions over the data that has arrived at all nodes so far. Date: Friday, 7 May 2010 Time: 2:00pm - 4:00pm Venue: Room 3304 lifts 17/18 Committee Members: Dr. Ke Yi (Supervisor) Prof. Siu-Wing Cheng (Chairperson) Dr. Sunil Arya Prof. Mordecai Golin **** ALL are Welcome ****