More about HKUST
High-fidelity Surface Reconstruction and Semantic Understanding
PhD Thesis Proposal Defence
Title: "High-fidelity Surface Reconstruction and Semantic Understanding"
by
Mr. Shiwei LI
Abstract:
Capturing 3D models of real world objects and scenes from multi-view
images is becoming increasingly popular in recent years, thanks to the
rapid development of consumer cameras, mobile phones and flying drones.
Surface reconstruction is one of the core steps in 3D reconstruction,
which recovers the actual geometry and is crucial to the final
reconstruction quality. Subsequently, with the reconstructed 3D model
represented as the mesh surface, semantic understanding (e.g., shape
classification or segmentation) on meshes is desirable for many
applications. In this thesis, we study and contribute to these two
problems.
First, we present methods regarding the high-fidelity surface
reconstruction. The term "high-fidelity" contains a double meaning,
namely the topological accuracy and the geometric accuracy. On one hand,
the topological accuracy concerns about the structural correctness and
completeness of the reconstructed surface. For example, thin structures
often fail to be retained in the reconstruction due to incomplete and
noisy point clouds. To address this problem, we leverage the spatial curve
representation for thin and elongated structures, and present a novel
surface reconstruction method using both curves and point clouds. On the
other hand, the geometric accuracy measures the holistic similarity
between the reconstructed model to the ground truth model, and can be
optimized by minimizing the reprojection error in surface refinement. Such
optimization is iterative and requires repeated computation of gradients
over all surface regions, which is the bottleneck affecting adversely the
computational efficiency of the refinement. Therefore, we present a
flexible and efficient framework for mesh surface refinement in multi-view
stereo, dubbed Adaptive Resolution Control (ARC). The ARC evaluates an
optimal trade-off between the geometry accuracy and the performance via
curve analysis, and accelerates the stereo refinement by severalfold by
culling out most insignificant regions, while still maintaining a similar
level of geometry details that the state-of-the-art methods could achieve.
Second, we present methods regarding the semantic understanding of
reconstructed surface. The mesh surface texture-mapped by images, is a
photo-realistic and standalone representation that renders the reality of
objects or scenes. We present a convolutional network architecture for
direct feature learning on mesh surfaces through their atlases of texture
maps. Since the parameterization of texture map is unpredictable, and
depends on the surface topologies, we therefore introduce a novel
cross-atlas convolution to recover the original mesh geodesic
neighborhood, so as to achieve the equivariance property to arbitrary
parameterization. The proposed module is integrated into classification
and segmentation architectures, which takes the input texture map of a
mesh, and infers the output predictions.
In sum, this thesis provides methods for high-fidelity and efficient
surface reconstruction, as well as the semantic parsing on the
reconstructed mesh surface. We are successful to concatenate these
components and create a pipeline for surface reconstruction and the
subsequent semantic parsing in a fully automatic manner.
Date: Monday, 4 March 2019
Time: 2:00pm - 4:00pm
Venue: Room 2408
(lifts 17/18)
Committee Members: Prof. Long Quan (Supervisor)
Prof. Huamin Qu (Chairperson)
Dr. Pedro Sander
Prof. Chiew-Lan Tai
**** ALL are Welcome ****