More about HKUST
Robust Registration and Semantic Understanding of 3D Point Clouds
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "Robust Registration and Semantic Understanding of 3D Point Clouds" By Mr. Xuyang BAI Abstract In recent decades, with the prevalence of affordable RGB-D cameras and LiDAR (Light Detection and Ranging) scanners, point cloud representation has become increasingly practical and popular in many computer vision applications, such as structure from motion (SfM), simultaneous localization and mapping (SLAM). Since a collection of point clouds is usually limited to only one perspective, several acquisitions are required to cover the whole area of interest. In order to register these acquisitions from different viewpoints, point cloud registration is applied as a fundamental step for finding an optimal transformation between partially overlapped point cloud fragments and recovering a complete underlying geometry. Subsequently, with the reconstructed 3D model represented as point clouds, performing semantic scene understanding is necessary for many applications, such as autonomous driving and augmented reality. In this thesis, we present our contributions to these two problems, namely, point cloud registration and point cloud semantic understanding. First, we present two methods for efficient and robust point cloud registration based on the key idea of decomposing the point cloud registration pipeline into three learnable sub-modules. In the first method, we design a keypoint detector sub-module and a keypoint descriptor sub-module for efficient local feature extraction, emphasizing the importance of a reliable keypoint detector and demonstrating the superiority of joint learning of both detection and description tasks. In the second method, we propose a correspondence filtering sub-module to improve the robustness towards large outlier ratio. We explicitly incorporate the spatial consistency constrained by rigid transformations for pruning outlier correspondences. Second, we study the problem of semantic understanding on registered point clouds. Specifically, we propose a LiDAR-camera fusion solution for 3D perception. Our studies investigate the inherent difficulties of LiDAR-camera fusion and reveal a crucial aspect to robust fusion, namely, the soft-association mechanism. The proposed module is integrated into object detection and multiple object tracking frameworks and can be easily extended to other tasks such as LiDAR semantic segmentation. In summary, we developed three learning-based methods for robust point cloud registration and semantic parsing on the registered point clouds. The proposed methods have been extensively evaluated on standardized benchmarks, where their superior performance and strong generalization ability have been demonstrated. Date: Monday, 23 May 2022 Time: 4:00pm - 6:00pm Zoom Meeting: https://hkust.zoom.us/j/3966929732 Chairperson: Prof. Weiping LI (MATH) Committee Members: Prof. Chiew Lan TAI (Supervisor) Prof. Qifeng CHEN Prof. Pedro SANDER Prof. Kai TANG (MAE) Prof. Chi Wing FU (CUHK) **** ALL are Welcome ****