More about HKUST
Object Segmentation in Neural Radiance Field
The Hong Kong University of Science and Technology Department of Computer Science and Engineering MPhil Thesis Defence Title: "Object Segmentation in Neural Radiance Field" By Mr. Yichen LIU Abstract: First, we present one of the first learning-based 3D instance segmentation pipelines in Neural Radiance Field (NeRF), dubbed as Instance-NeRF. Taking a NeRF pretrained from multi-view RGB images as input, Instance-NeRF can learn 3D instance segmentation of a given scene, represented as an instance field component of the NeRF model. To this end, we adopt a 3D proposal-based mask prediction network on the sampled volumetric features from NeRF, which generates discrete 3D instance masks. The coarse 3D mask prediction is then projected to image space to match 2D segmentation masks from different views generated by existing panoptic segmentation models, which are used to supervise the training of the instance field. Notably, beyond generating consistent 2D segmentation maps from novel views, Instance-NeRF can query instance information at any 3D point, which greatly enhances NeRF object segmentation and manipulation. Next, we introduce the Segment Anything for NeRF in High Quality (SANeRF-HQ) to achieve high-quality 3D segmentation of any target object in a given scene. SANeRF-HQ utilizes Segment Anything Model (SAM) for open-world object segmentation guided by user-supplied prompts, while leveraging NeRF to aggregate information from different x viewpoints. To overcome the aforementioned challenges, we employ density field and RGB similarity to enhance the accuracy of segmentation boundary during the aggregation. Emphasizing on segmentation accuracy, we evaluate our method on multiple NeRF datasets where high-quality ground-truths are available or manually annotated. SANeRFHQ shows a significant quality improvement over state-of-the-art methods in NeRF object segmentation, provides higher flexibility for object localization, and enables more consistent object segmentation across multiple views. Date: Monday, 3 June 2024 Time: 10:00am - 12:00noon Venue: Room 5510 Lifts 25/26 Chairman: Prof. Pedro SANDER Committee Members: Prof. Chi-Keung TANG (Supervisor) Prof. Dit-Yan YEUNG