More about HKUST
An Exploration of Three Approaches to Automating Data Augmentation for Machine Learning
PhD Thesis Proposal Defence Title: "An Exploration of Three Approaches to Automating Data Augmentation for Machine Learning" by Mr. Tsz Him CHEUNG Abstract: Data Augmentation is an effective way to improve the generalization of deep learning models. However, the current augmentation methods primarily rely on manual operations, such as flipping and cropping for image data. These methods are often designed based on human expertise or trial and error. Meanwhile, Automated Data Augmentation (AutoDA) is a promising research direction that treats the data augmentation process as a learning task and discovers the most effective ways to augment the data. In this thesis, we examine recent AutoDA research and introduce four new methods to enhance AutoDA using composition-based, mixing-based, and generation-based approaches. Existing composition-based AutoDA methods apply a fixed policy to the entire dataset and mainly focus on image classification tasks. In response, we propose AdaAug to learn class- and instance-adaptive augmentation policies and MODALS to learn modality-agnostic latent space augmentation strategies. For mixing-based AutoDA, we introduce TransformMix to learn transformation and mixing augmentation strategies from data. Compared to previous mixing-based methods, TransformMix takes into account the saliency information of the input images and produces more compelling mixed images that contain accurate and important information for the target tasks. Regarding generation-based AutoDA, we present AutoGenDA to address imbalanced classification tasks. Specifically, AutoGenDA identifies and transfers label-invariant changes across data classes through image captions and text-guided generative models. We also propose an automated search strategy to adapt the AutoGenDA augmentation to each data class, leading to better generalization. Date: Tuesday, 27 August 2024 Time: 4:00pm - 6:00pm Venue: Room 3494 Lifts 25/26 Committee Members: Prof. Dit-Yan Yeung (Supervisor) Prof. Nevin Zhang (Chairperson) Prof. Albert Chung Dr. Qifeng Chen