More about HKUST
Random Sampling on Big Data: Techniques and Applications
Speaker: Professor Ke YI Department of Computer Science and Engineering Hong Kong University of Science and Technology Title: "Random Sampling on Big Data: Techniques and Applications" Date: Monday, 14 Oct 2019 Time: 4:00pm - 5:00pm Venue: Lecture Theater F (near lift no. 25/26), HKUST Abstract: Random sampling is a powerful tool for big data analytics. It can be used whenever complete accuracy is not required, while offering order-of-magnitude improvements in query efficiency. Random sampling has been extensively studied in both the statistics and computer science literature. This talk will take a "sample" of this huge literature. In particular, we will discuss random sampling over streaming and distributed data, importance sampling, merge-reduce sampling, and sampling for approximate query processing. ***************** Biography: Ke Yi is a Professor in the Department of Computer Science and Engineering, Hong Kong University of Science and Technology. He obtained his Bachelor's degree from Tsinghua University (2001) and PhD from Duke University (2006), both in computer science. His research spans theoretical computer science and database systems. He has received a Google Faculty Research Award (2010), the Young Investigator Research Award from HKUST (2012), a SIGMOD Best Demonstration Award (2015), and the SIGMOD Best Paper Award (2016). He currently serves as an Associate Editor of ACM Transactions on Database Systems and IEEE Transactions on Knowledge and Data Engineering.