More about HKUST
Vision-Language Pre-training for Medical Imaging
PhD Qualifying Examination Title: "Vision-Language Pre-training for Medical Imaging" by Mr. Xiaoyu ZHENG Abstract: Data plays a crucial role in advancing current Artificial Intelligence (AI) technologies. Various data modalities such as images, texts, videos and audios are collected and utilized for training AI systems to obtain better performance. In medical domain, a doctor will also take multi-modal factors (e.g. blood test results, medical images, previous drug history) into consideration for diagnosis and treatment. Among them, medical imaging examination result is one of the most decisive factors, thus, many AI systems in early stage aims to predict the labels of corresponding medical images for classification, segmentation and object detection by feeding the images and its annotation from doctors to a deep learning model. However, the human annotation in medical domain often requires trained experts, which makes it hard to generate very large-scale labelled data for supervised model training. In recent years, self-supervised learning is considered as a promising approach to boost the AI model performance in medical domain. The self-supervised learning aim to learn robust representations form data itself, which doesn't require data labeling during the training stage. In medical imaging domain, one advantage for self-supervised learning is that the images are often paired with their reports consisting of the image description and the diagnosis result, which makes the vision-language pre-training realizable. By aligning the text representations and image representations into the same space, the pre-training models can even achieve a better performance compared with supervised methods and some models can also gain the zero-shot capability. In this survey, we present the mainstream vision-language pretraining frameworks and datasets for 2D, 3D and video medical imaging. Beyond that, we also provide a summary of downstream datasets and tasks. Meanwhile, the challenges and future directions are also discussed. Date: Tuesday, 30 July 2024 Time: 4:00pm - 6:00pm Venue: Room 5508 Lifts 25/26 Committee Members: Dr. Hao Chen (Supervisor) Dr. Qifeng Chen (Chairperson) Dr. Junxian He Dr. Dan Xu