More about HKUST
A Survey on AI-based Table Column Semantic Type Annotation Methods
PhD Qualifying Examination Title: "A Survey on AI-based Table Column Semantic Type Annotation Methods" by Mr. Yushi SUN Abstract: Table column semantic type annotation is crucial for a variety of data preparation and discovery tasks within the database domain. Accurately annotating the data types of table columns holds tremendous benefits across a diverse range of applications. Specifically, precise annotation of column type enhances the quality of data integration, schema matching, data cleaning, and data visualization. Due to its great value and importance in the data preparation and integration field, there is a growing interest in developing advanced table column semantic type annotation methods. Recent methods introduce state-of-the-art AI techniques for this task to achieve improvement in annotation quality. Despite the rapid advancement in the performance of AI-based table column semantic type annotation approaches, there is a lack of a systematic review of existing approaches to summarize the limitations of existing solutions, challenges that remain unsolved, and the future directions of this task. In this survey, we first introduce the background of the table column semantic type annotation task. We then define the important concepts and the problems of the task. Based on the problem statement and the concepts defined, we classify the existing AI-based table column semantic type annotation methods and evaluate the advantages and disadvantages of each category. The survey ends with a discussion of the future directions and opportunities based on the classification and evaluation of the existing approaches. Date: Friday, 12 July 2024 Time: 2:00pm - 4:00pm Venue: Room 3494 Lifts 25/26 Committee Members: Prof. Lei Chen (Supervisor) Prof. Xiaofang Zhou (Chairperson) Dr. Dan Xu Dr. Lei Li (HKUST-GZ)