More about HKUST
High-fidelity Surface Reconstruction and Semantic Understanding
PhD Thesis Proposal Defence Title: "High-fidelity Surface Reconstruction and Semantic Understanding" by Mr. Shiwei LI Abstract: Capturing 3D models of real world objects and scenes from multi-view images is becoming increasingly popular in recent years, thanks to the rapid development of consumer cameras, mobile phones and flying drones. Surface reconstruction is one of the core steps in 3D reconstruction, which recovers the actual geometry and is crucial to the final reconstruction quality. Subsequently, with the reconstructed 3D model represented as the mesh surface, semantic understanding (e.g., shape classification or segmentation) on meshes is desirable for many applications. In this thesis, we study and contribute to these two problems. First, we present methods regarding the high-fidelity surface reconstruction. The term "high-fidelity" contains a double meaning, namely the topological accuracy and the geometric accuracy. On one hand, the topological accuracy concerns about the structural correctness and completeness of the reconstructed surface. For example, thin structures often fail to be retained in the reconstruction due to incomplete and noisy point clouds. To address this problem, we leverage the spatial curve representation for thin and elongated structures, and present a novel surface reconstruction method using both curves and point clouds. On the other hand, the geometric accuracy measures the holistic similarity between the reconstructed model to the ground truth model, and can be optimized by minimizing the reprojection error in surface refinement. Such optimization is iterative and requires repeated computation of gradients over all surface regions, which is the bottleneck affecting adversely the computational efficiency of the refinement. Therefore, we present a flexible and efficient framework for mesh surface refinement in multi-view stereo, dubbed Adaptive Resolution Control (ARC). The ARC evaluates an optimal trade-off between the geometry accuracy and the performance via curve analysis, and accelerates the stereo refinement by severalfold by culling out most insignificant regions, while still maintaining a similar level of geometry details that the state-of-the-art methods could achieve. Second, we present methods regarding the semantic understanding of reconstructed surface. The mesh surface texture-mapped by images, is a photo-realistic and standalone representation that renders the reality of objects or scenes. We present a convolutional network architecture for direct feature learning on mesh surfaces through their atlases of texture maps. Since the parameterization of texture map is unpredictable, and depends on the surface topologies, we therefore introduce a novel cross-atlas convolution to recover the original mesh geodesic neighborhood, so as to achieve the equivariance property to arbitrary parameterization. The proposed module is integrated into classification and segmentation architectures, which takes the input texture map of a mesh, and infers the output predictions. In sum, this thesis provides methods for high-fidelity and efficient surface reconstruction, as well as the semantic parsing on the reconstructed mesh surface. We are successful to concatenate these components and create a pipeline for surface reconstruction and the subsequent semantic parsing in a fully automatic manner. Date: Monday, 4 March 2019 Time: 2:00pm - 4:00pm Venue: Room 2408 (lifts 17/18) Committee Members: Prof. Long Quan (Supervisor) Prof. Huamin Qu (Chairperson) Dr. Pedro Sander Prof. Chiew-Lan Tai **** ALL are Welcome ****