Research Aims

Fundamental
Data Science Research
Transformation and Innovation of Knowledge
High-Impact  Solutions for 
Data  Science 
Apply Data Science in Key Domains

Research Topics

01

Scalable Data Management System

Develop “off-the-shelve” general-purpose big data management solutions that can be used to support abundant Data Science applications, including DBMS, Spatio-temporal Data Management, DB4AI, etc.  

02

Artificial ​Intelligence

Study the intersection of AI/LLM and the challenges related to data, aiming to address critical issues such as data quality, searching-based generation, bias, privacy, and ethics. 

03

AI Engineering and Implementation

Explore the transformative potential of AI/LLM technologies in various domains, leveraging advanced algorithms and models to revolutionize information processing.

Hardware

Providing researchers with world-class research platforms and facilities

Dell DSS8440 GPU Servers: 
5 servers featuring 40 Nvidia Ampere A100 GPUs, optimized for robust data processing and analysis.

Dell PowerEdge R940 Servers:   
6 servers, each with 4 Intel Xeon Gold 6248 processors and 1.5TB memory, coupled with high-speed 2.4TB SAS hard drives.

Dell PowerEdge R740xd2 Storage Servers: 
2 servers, each with Intel Xeon Silver 4210R processors, 256GB memory, and extensive storage options including 1.6TB NVMe drives and 480TB HDDs.

Supermicro 420GP-TNR:
2 servers with 8 Nvidia RTX4090 GPU with 24GB memory

Selected Publications

[1]  Xi Zhao, Zhonghan Chen, Kai Huang, Ruiyuan Zhang, Bolong Zheng, Xiaofang Zhou, "​Efficient Approximate Maximum Inner Product Search over Sparse Vectors", ICDE 2024.
[2] Yao Tian, Yan Tingyun, Ruiyuan Zhang, Kai Huang, Bolong Zheng, Xiaofang Zhou, "A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams", SIGMOD 2024.
[3] Ziyi Liu, Lei Li, Mengxuan Zhang, Wen Hua, Xiaofang Zhou, “Approximate Skyline Index for Constrained Shortest Pathfinding with Theoretical Guarantee”, ICDE 2024.
[4] Jing Zhao, Lei Li, Mengxuan Zhang, Zihan Luo, Xi Zhao, Xiaofang Zhou, “A Just-In-Time Framework for Continuous Routing”,  ICDE 2024.
[5] Shiwen Wu, Qiyu Wu, Honghua Dong, Wen Hua, Xiaofang Zhou, "Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution", PVLDB 2024. ​ 
[6] Hanmo LIU, Shimin DI, Lei CHEN, "Incremental Tabular Learning on Heterogeneous Feature Space", SIGMOD 2023.
[7] Jiajia Li, Xing Xiong, Lei Li, Dan He, Chuanyu Zong, Xiaofang Zhou, "Finding Top-k Optimal Routes with Collective Spatial Keywords on Road Networks", ICDE 2023.
[8] Ziyi Liu, Lei Li, Mengxuan Zhang, Wen Hua, Xiaofang Zhou, "FHL-Cube: Multi-Constraint Shortest Path Querying with Flexible Combination of Constraints", PVLDB 2022.
[9] Yao Tian, Xi Zhao, Xiaofang Zhou, "DB-LSH: Locality-Sensitive Hashing With Query-based Dynamic Bucketing", ICDE 2022.
[10] Dan He, Thomas Zhou, Xiaofang Zhou, Jiwon Kim, "An Efficient Algorithm for Maximum Trajectory Coverage Query with Approximation Guarantee", IEEE Transactions on Intelligent Transportation Systems. 
[11] Yao Tian, Tingyun Yan, Xi Zhao, Kai Huang, Xiaofang Zhou, "A Learned Index for Exact Similarity Search in Metric Spaces", IEEE Transactions on Knowledge and Data Engineering.

See more