More about HKUST
Label-efficient Learning by Exploiting Unlabeled Data with Higher-quality Supervision and Wider Applicability
PhD Thesis Proposal Defence
Title: "Label-efficient Learning by Exploiting Unlabeled Data with
Higher-quality Supervision and Wider Applicability"
by
Miss Huimin WU
Abstract:
This thesis seeks to explore label-efficient learning techniques that aim to
mitigate the reliance on large-scale human labels in training deep learning
models. Our focus is on developing strategies that leverage unlabeled data,
which are often redundant, and gradually decrease the number of human labels
required for training predictive models. Specifically, our investigation covers
semi- supervised learning (with 20% labeled data), barely-supervised learning
(with only a few labeled data points), and self-supervised learning that
requires no human labels at all. However, the quality of the constructed
supervision for the purpose of exploiting unlabeled data is the primary
challenge we face. As such, our primary objective is to construct
higher-quality supervision that is more expressive, accurate, and generic.
Moreover, the current state-of-the-art techniques are limited in terms of their
scope of use. Thus, we endeavor to enhance the range of applicability of the
constructed method.
Firstly, we enhance semi-supervised medical image segmentation by seeking more
expressive forms of supervision without the aid of modality or task priors.
Traditional methods of supervising segmentation are often limited to one-hot
vectors or their soft variants. Alternatively, we propose utilizing contrastive
loss to train a more compact and better-separated feature space. This approach
is more robust to noise and exhibits better generalization abilities for test
data. Furthermore, our proposed method is not reliant on specific modalities or
tasks, making it more adaptable to diverse application scenarios.
Then, we reduce the number of human labels and explore the barely supervised
learning setting, which has received limited attention. This setting is
characterized by the presence of only a few labeled data points, making it
challenging to achieve high-quality supervision. Standard semi-supervised
methods constructed with fewer human annotations often produce unsatisfactory
results. Additionally, prior research on barely supervised learning has limited
applicability due to the assumption of structural similarity between the data
as the supervision. To address these challenges, we propose an online
confidence thresholding technique that can generate more accurate pseudo
labels. Without resorting to ground truth similarity, this algorithm can be
applied to a broader range of realistic segmentation problems.
Thirdly, we explore the topic of self-supervised learning, a type of learning
that does not rely on human labels. Instead, it utilizes data as a form of
supervision. Presently, pre-training strategies lack data generality.
Therefore, the objective of this work is centered around finding data-generic
supervision capable of being applied to any data modality. To achieve this
objective, we propose randomized quantization as contrastive learning
augmentation. Our method has demonstrated better performance than prior
data-agnostic self-supervised techniques. We have validated its effectiveness
over a vast range of data modalities, including vision, audio, 3D point clouds,
and DABS, a public benchmark for data-agnostic self-supervised learning.
In our last work, we enhance the downstream application generality of
self-supervised learning techniques. Current self-supervised learning
techniques are mainly designed for semantic tasks. In order to expand the
application scenarios of these methods, we propose to adapt general-purpose
large-scale pre-trained models on natural videos to multi-view geometrical
tasks with an empirical study on optical flow estimation. Unlike previous flow
estimation methods that rely on complicated architectural components
specialized to geometry tasks, our overall architecture does not have any
task-specific inductive bias, which significantly simplifies the architectural
design. The strong performance validates its effectiveness.
Date: Thursday, 30 May 2024
Time: 2:00pm - 4:00pm
Venue: Room 5510
Lifts 25/26
Committee Members: Prof. Tim Cheng (Supervisor)
Dr. Xiaomeng Li (Supervisor)
Dr. Xiaojuan Ma (Chairperson)
Dr. Shuai Wang