More about HKUST
Efficient Aggregation under Differential Privacy
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "Efficient Aggregation under Differential Privacy" By Miss Juanru FANG Abstract: Differential privacy (DP) has become the mainstream privacy standard due to its strong protection of individual users' information. It guarantees that for any neighboring datasets, the output of the mechanism is practically identical. There are two common definitions of neighborhoods in differential privacy. The first is record-level DP, where neighboring datasets differ by exactly one record; the second is user-level DP, where two datasets are neighbors if they differ by a single user's data. Some commonly used aggregation functions include SUM, MAX, AVERAGE, etc. In our first work, we note that many frequently encountered aggregation functions are monotonic or can be derived from monotonic functions. Based on this insight, we design a general user-level DP mechanism that estimates monotonic functions with strong optimality guarantees. While our general mechanism may run in super-polynomial time, we show how to instantiate an approximate version that runs in polynomial time, making it efficient for certain common monotonic functions. From our first work, we observe that existing user-level DP mechanisms still have high computational costs due to the complex relationships between users and records. To address this challenge, random sampling has proven to be an effective approach for reducing computational costs. However, all existing sampling-based mechanisms are designed for record-level DP. Therefore, in our second work, we take the first step towards investigating the application of random sampling under user-level DP and propose an accurate and efficient sampling-based mechanism for the sum estimation function. A limitation of the first two works is that they focus on the central DP model, where all users trust the data curator. However, this assumption is often impractical. Thus, in our third work, we shift to the distributed setting and focus on the shuffle DP model. We explore the subgraph counting problem under edge DP, which is a specific case of aggregation function estimation in the context of record-level DP. We propose a general sampling-based mechanism, apply it to several common subgraphs, and analyze the performance guarantees. Date: Thursday, 29 May 2025 Time: 3:00pm - 5:00pm Venue: Room 2128A Lift 19 Chairman: Dr. Zhihong GUO (CHEM) Committee Members: Prof. Ke YI (Supervisor) Prof. Qiong LUO Dr. Dimitris PAPADOPOULOS Prof. Yuan YAO (MATH) Dr. Hubert CHAN (HKU)