More about HKUST
High-fidelity Image and Video Restoration and Enhancement by Recovering RAW Sensor Data
The Hong Kong University of Science and Technology
Department of Computer Science and Engineering
PhD Thesis Defence
Title: "High-fidelity Image and Video Restoration and Enhancement by Recovering
RAW Sensor Data"
By
Mr. Yazhou XING
Abstract:
Image and video restoration and enhancement have long posed significant
challenges in computer vision and computational photography. The task of
recovering high-fidelity images and videos from corrupted or low-quality pure
RGB signals is highly complex and ill-posed. Alternatively, leveraging camera
RAW sensor data, which captures unprocessed signals with a linear relationship
to scene irradiance and typically ranges from 12 to 14 bits, can greatly
enhance the performance of restoration and enhancement tasks. Accessing RAW
data, however, can be quite hard due to their memory-demanding property: RAW
images may be discarded during the process of data storing, transferring, and
sharing.
This thesis aims to close this research gap by the recovery of RAW data for
robust image and video enhancement and restoration. We begin this thesis with a
general solution to recover RAW sensor data from sRGB images, dubbed Invertible
Image Signal Processing (InvISP). Unlike synthesizing RAW data from sRGB
images, our innovative InvISP enables the rendering of visually appealing sRGB
images while also facilitating the recovery of nearly perfect RAW data.
Then, we study another fundamental problem in RAW data recovery: high dynamic
range (HDR) videos reconstruction. We present an online learning-based system
designed to reduce overexposure artifacts in HDR video imaging. Our system
leverages the temporal instabilities of autoexposure, eliminating the need for
complex acquisition mechanisms such as alternating exposures or costly
processing commonly associated with HDR imaging.
Lastly, we explore the effect of a special form of raw images, uncorrupted
complete backgrounds, for the realistic compositing of portrait photographs or
videos. By unifying foreground alpha matte generation and post-blending
harmonization, we enable the realistic composition of portrait images and
deliver temporally stable results in videos.
Date: Thursday, 11 April 2024
Time: 10:00am - 12:00noon
Venue: Room 5501
Lifts 25/26
Chairman: Prof. Shiheng WANG (ACCT)
Committee Members: Prof. Qifeng CHEN (Supervisor)
Prof. Pedro SANDER
Prof. Long CHEN
Prof. Ling SHI (ECE)
Prof. Jinwei GU (CUHK)