More about HKUST
Robust Scene Understanding in Challenging Scenarios
The Hong Kong University of Science and Technology
Department of Computer Science and Engineering
PhD Thesis Defence
Title: "Robust Scene Understanding in Challenging Scenarios"
By
Mr. Tuan Anh VU
Abstract:
In recent years, computer vision and graphics fields have witnessed significant
progress with the emergence of novel techniques and architectures to address
complex challenges. This thesis presents novel methodologies and advancements
to enhance the accuracy and robustness of current methodologies in scene
understanding. The study focuses on developing algorithms for 2D transparent
object segmentation in challenging indoor environments, aiming to significantly
improve detection capabilities in scenarios with cluttered backgrounds and
variable lighting conditions.
Additionally, it explores 3D test time augmentation methods for classification
and segmentation, targeting both static and dynamic objects in indoor and
outdoor scenes to enhance model resilience and accuracy. By implementing these
methods, the research aims to provide robust solutions that adapt to variations
in data, thereby improving the overall performance of 3D models.
Furthermore, the research combines 3D reconstruction with motion flow
estimation to achieve a comprehensive understanding of dynamic objects, such as
humans and animals, in indoor settings. This approach aims to accurately track
and predict object movements, enhancing the analysis of dynamic scenes.
The study also utilizes text-to-image diffusion models for 2D open vocabulary
camouflage instance segmentation, addressing the detection and segmentation of
camouflaged objects in outdoor and underwater environments. This method
leverages advanced diffusion models to identify and segment camouflaged
instances from diverse vocabularies, improving the detection and analysis of
such objects in challenging settings.
Lastly, the creation of a comprehensive dataset for camouflaged animals in
videos aims to improve classification, detection, and segmentation algorithms.
This dataset includes outdoor and underwater videos of camouflaged animals,
facilitating the development of robust algorithms capable of accurately
analyzing these animals in their natural habitats. This effort contributes to
wildlife conservation and environmental monitoring by providing a valuable
resource for further research.
In summary, this research pushes the boundaries of scene understanding by
offering more accurate and robust solutions for interpreting complex scenes. By
addressing specific challenges associated with different environments and
object types, it aims to significantly advance the field of computer vision.
Date: Tuesday, 13 August 2024
Time: 10:00am - 12:00noon
Venue: Room 5510
Lifts 25/26
Chairman: Prof. Hongbo FU (AMC)
Committee Members: Prof. Sai Kit YEUNG (Supervisor)
Prof. Huamin QU
Dr. Dan XU
Dr. Stanley Chun Kwan LAU (OCES)
Dr. Di LIN (TJU)