Research Aims

Fundamental
Data Science Research
Transformation and Innovation of Knowledge
High-Impact  Solutions for 
Data  Science 
Apply Data Science in Key Domains

Research Topics

01

Scalable Data Management System

Develop “off-the-shelve” general-purpose big data management solutions that can be used to support abundant Data Science applications, including DBMS, Spatio-temporal Data Management, DB4AI, etc.  

02

Artificial ​Intelligence

Study the intersection of AI/LLM and the challenges related to data, aiming to address critical issues such as data quality, searching-based generation, bias, privacy, and ethics. 

03

AI Engineering and Implementation

Explore the transformative potential of AI/LLM technologies in various domains, leveraging advanced algorithms and models to revolutionize information processing.

Hardware

Providing researchers with world-class research platforms and facilities

Dell DSS8440 GPU Servers: 
5 servers featuring 40 Nvidia Ampere A100 GPUs, optimized for robust data processing and analysis.

Dell PowerEdge R940 Servers:   
6 servers, each with 4 Intel Xeon Gold 6248 processors and 1.5TB memory, coupled with high-speed 2.4TB SAS hard drives.

Dell PowerEdge R740xd2 Storage Servers: 
2 servers, each with Intel Xeon Silver 4210R processors, 256GB memory, and extensive storage options including 1.6TB NVMe drives and 480TB HDDs.

Supermicro 420GP-TNR:
2 servers with 8 Nvidia RTX4090 GPU with 24GB memory

Selected Publications

1.       Jiawen Zhang, Shun Zheng, Xumeng Wen, Xiaofang Zhou, Jiang Bian, Jia Li, "ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer", NeurIPS 2024. 
2.       Ran Li, Shimin Di, Lei Chen, Xiaofang Zhou, "SimDiff: Simple Denoising Probabilistic Latent Diffusion Model for Data Augmentation on Multi-modal Knowledge Graph", SIGKDD 2024. 
3.       Weijia Zhang, Chenlong Yin, Hao Liu, Xiaofang Zhou, Hui Xiong, "Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks Approach",  ICML 2024. 
4.       Xi Zhao, Zhonghan Chen, Kai Huang, Ruiyuan Zhang, Bolong Zheng, Xiaofang Zhou, "​Efficient Approximate Maximum Inner Product Search over Sparse Vectors", ICDE 2024. 
5.       Yao Tian, Yan Tingyun, Ruiyuan Zhang, Kai Huang, Bolong Zheng, Xiaofang Zhou, "A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams", SIGMOD 2024. 
6.       Ziyi Liu, Lei Li, Mengxuan Zhang, Wen Hua, Xiaofang Zhou, “Approximate Skyline Index for Constrained Shortest Pathfinding with Theoretical Guarantee”, ICDE 2024. 
7.       Jing Zhao, Lei Li, Mengxuan Zhang, Zihan Luo, Xi Zhao, Xiaofang Zhou, “A Just-In-Time Framework for Continuous Routing”, ICDE 2024. 
8.       Shiwen Wu, Qiyu Wu, Honghua Dong, Wen Hua, Xiaofang Zhou, "Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution", PVLDB 2024. ​  
9.       Hanmo LIU, Shimin DI, Lei CHEN, "Incremental Tabular Learning on Heterogeneous Feature Space", SIGMOD 2023. 
10.     Jiajia Li, Xing Xiong, Lei Li, Dan He, Chuanyu Zong, Xiaofang Zhou, "Finding Top-k Optimal Routes with Collective Spatial Keywords on Road Networks", ICDE 2023.  

See more