More about HKUST
Distributed Oblivious Compaction and Applications to Joins
The Hong Kong University of Science and Technology
Department of Computer Science and Engineering
MPhil Thesis Defence
Title: "Distributed Oblivious Compaction and Applications to Joins"
By
Mr. Xian WANG
Abstract:
Cloud service providers, such as Amazon and Google, employ cryptographic
primitives and trusted execution environments (TEEs) to safeguard private
data. TEEs play a crucial role in enabling privacy-preserving computations;
however, they are vulnerable to access-pattern and side-channel attacks when
solely relying on encryption. Recent advancements in TEE-based oblivious
databases aim to mitigate these vulnerabilities by protecting memory access
patterns and isolating software side channels. Significantly, recent
research (VLDB'20; USENIX-SEC'25) has introduced effective oblivious
primitives for database operations, including join and sort operations.
Nevertheless, these single-server solutions encounter scalability challenges
due to the limited size of the Enclave Page Cache (EPC) and the overhead
associated with intensive page swaps, particularly when dealing with large
datasets. To address these constraints, research initiatives such as Opaque
(NSDI'17) and Jodes (VLDB'25) have investigated distributed oblivious
primitives for database joins within TEEs. Despite these advancements,
challenges persist, including performance bottlenecks, large constants in
computational complexity, and a lack of full parallelism.
This thesis proposes scalable and distributed oblivious algorithms that
prioritize parallelism, algorithmic complexity, practical constants, and
communication costs. We introduce the first distributed oblivious compaction
and expansion algorithms across multiple servers. Our work also includes
improved distributed oblivious sorting and aggregation tree algorithms that
are more efficient. Additionally, we apply the above algorithms and
implement new scalable and distributed oblivious joins. These algorithms
have smaller constants and communication overhead and are fully
parallelized. The improved aggregation tree algorithm offers O(log N)
complexity and O(1) communication overhead, and the remaining algorithms
have O(N log N) complexity and O(N) communication overhead. Our extensive
evaluations show speedups of 5.8 to 27.7x for compaction, up to 11.0x for
sorting, and 3.55x for aggregation trees. Our joins surpass baselines by
1.43 to 89.2x, demonstrating superior scalability.
Date: Friday, 27 June 2025
Time: 2:00pm - 4:00pm
Venue: Room 3494
Lifts 25/26
Chairman: Dr. Shuai WANG
Committee Members: Dr. Dimitris PAPADOPOULOS (Supervisor)
Prof. Cunsheng DING