More about HKUST
Accurate, Scalable and Parallel Structure from Motion
PhD Thesis Proposal Defence Title: "Accurate, Scalable and Parallel Structure from Motion" by Mr. Siyu ZHU Abstract: Structure from motion (SfM) is a photogrammetric technique for estimating three-dimensional structures from two-dimensional images. Thanks to the rapid development of portable photo acquisition equipments cameras and the explosion of on-line image collections, large-scale SfM has achieved extraordinary progress in the past few years. However, large-scale SfM is still challenging in three aspects, namely accuracy, scalability and efficiency. The target of this work is to handle highly accurate and consistent large-scale SfM problems in a parallel and scalable manner. First, we tackle the accurate and consistent Structure from Motion (SfM) problem, in particular camera registration, far exceeding the memory of a single computer in parallel. Different from the previous methods which drastically simplify the parameters of SfM, we propose a camera clustering algorithm to divide a large SfM problem into smaller sub-problems in terms of camera clusters with overlapping while preserving as many connectivity among cameras and tracks as possible. We next exploit a hybrid formulation leveraging the relative motions from local incremental SfM into a global motion averaging framework to produce superior accurate and consistent initial camera poses. Our scalable formulation in terms of camera clusters is highly applicable to the whole SfM pipeline including track generation, local SfM, 3D point triangulation and bundle adjustment, and able to reconstruct camera poses of a city-scale data-set containing 665K high-resolution images with the state-of-the-art accuracy and robustness evaluated on both the benchmark and Internet data-sets. Then we propose a divide-and-conquer algorithm to solve large-scale motion averaging problems in a highly parallel scheme. First, we partition the full camera set into clusters in which local SfM is performed to provide robust relative motions and initialize global camera poses. Then the full motion averaging problem is decoupled into several sub-problems with respect to their local coordinate frame encoded by a similarity transformation for independent optimization in parallel. Finally, we can merge sub-problems globally without caching the whole reconstruction in memory at once. A hierarchical system is subsequently proposed not only able to solve large-scale motion averaging problems including one consisted of 665K images in an inherently parallel scheme but also simplifies challenging translation averaging to a well-posed similarity averaging problem. Experiments on benchmark and Internet data-sets confirm that our unified system improves accuracy over the state-of-the-art methods with comparable efficiency. Global bundle adjustment usually converges to a non-zero residual and produces sub-optimal camera poses for local areas, which leads to loss of details for high-resolution reconstruction. Instead of trying harder to optimize everything globally, we argue that we should live with the non-zero residual and adapt the camera poses to local areas. To this end, we propose a segment-based approach to readjust the camera poses locally and improve the reconstruction for fine geometry details. The key idea is to partition the globally optimized structure from motion points into well-conditioned segments for re-optimization, reconstruct their geometry individually, and fuse everything back into a consistent global model. This significantly reduces severe propagated errors and estimation biases caused by the initial global adjustment. The results on several datasets demonstrate that this approach can significantly improve the reconstruction accuracy, while maintaining the consistency of the 3D structure between segments. To the best of my knowledge, ours is the first pipeline able to reconstruct highly accurate and consistent camera poses from 665K high-resolution images in a parallel manner. Date: Wednesday, 22 March 2017 Time: 1:30pm - 3:30pm Venue: Room 1504 (lifts 25/26) Committee Members: Prof. Long Quan (Supervisor) Prof. Huamin Qu (Chairperson) Dr. Pedro Sander Prof. Chiew-Lan Tai **** ALL are Welcome ****