More about HKUST
An Exploration of Three Approaches to Automating Data Augmentation for Machine Learning
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "An Exploration of Three Approaches to Automating Data Augmentation for Machine Learning" By Mr. Tsz Him CHEUNG Abstract: Data Augmentation is an effective way to improve the generalization of deep learning models. However, current augmentation methods rely on manual operations, such as flipping and cropping for image data. These methods are often designed based on human expertise or trial and error. Meanwhile, Automated Data Augmentation (AutoDA) is a promising research direction that treats data augmentation as a learning task and discovers the most effective ways to augment the data. This thesis examines recent AutoDA research and introduces four new methods to enhance AutoDA using composition-based, mixing-based, and generation-based approaches. Existing composition-based AutoDA methods apply a fixed policy to the entire dataset and mainly focus on image classification tasks. In response, we propose AdaAug to learn class- and instance-adaptive augmentation policies and MODALS to learn modality-agnostic latent space augmentation strategies. For mixing-based AutoDA, we introduce TransformMix to learn transformation and mixing augmentation strategies from data. Compared to previous mixing-based methods, TransformMix considers the saliency information of input images and produces more compelling mixed images that contain accurate and important information for the target tasks. Regarding generation-based AutoDA, we present AutoGenDA to address imbalanced classification tasks. Specifically, AutoGenDA identifies and transfers label-invariant changes across data classes through image captions and text-guided generative models. We also propose an automated search strategy to adapt the AutoGenDA augmentation to each data class, leading to better generalization. Date: Thursday, 16 January 2025 Time: 10:00am - 12:00noon Venue: Room 5506 Lifts 25/26 Chairman: Dr. Jin QI (IEDA) Committee Members: Prof. Dit-Yan YEUNG (Supervisor) Prof. Albert CHUNG Prof. Raymond WONG Dr. Can YANG (MATH) Prof. Sinno Jialin PAN (CUHK)