More about HKUST
From Pixels to Perception: Generative and Adaptive Methodologies for Degraded Visual Scenarios
PhD Thesis Proposal Defence Title: "From Pixels to Perception: Generative and Adaptive Methodologies for Degraded Visual Scenarios" by Mr. Qiang WEN Abstract: Enhancing images and videos in degraded visual scenarios is a vital task in computer vision, with applications across diverse fields such as medical imaging, robotics, and photography. Existing works have focused heavily on improving pixel-level quality in the final enhanced images through techniques like denoising, deblurring, and deraining. However, increasing evidence shows that while these pixel-level enhancement methods produce visually appealing results for human sense, they may not be optimal for downstream-task perception, such as object detection and semantic segmentation, which are essential for applications like robotics and autonomous driving. In this proposal, we aim to explore image and video enhancement from pixel-level enhancement for human sense to perception-level enhancement tailored for downstream computer vision tasks. This will be achieved by developing innovative enhancement systems leveraging cutting-edge methodologies, such as generative models and domain adaptation.Specifically, this proposal explores four representative tasks: waterdrop removal for driving scenes in rainy scenarios, low-light image enhancement with generative models, enhancing HDR imaging with joint denoising and deblurring, adapting large multi-modal models to see and read in the dark. With extensive experiments, it gradually highlights the distinction between pixel-level enhancement for human sense and perception-level enhancement for downstream computer vision tasks, providing a clearer direction for future research on enhancement objectives. Date: Tuesday, 6 May 2025 Time: 2:00pm - 4:00pm Venue: Room 2408 Lifts 17/18 Committee Members: Dr. Qifeng Chen (Supervisor) Prof. Nevin Zhang (Chairperson) Prof. Qiong Luo