More about HKUST
Urban Scene Parsing with Images and Scan Data
PhD Thesis Proposal Defence Title: "Urban Scene Parsing with Images and Scan Data" by Mr. Honghui Zhang ABSTRACT: Urban scene parsing, segmenting interested objects and identifying their categories in urban scenes, is a fundamental issue in many applications, like 3D city modeling and autonomous vehicles navigation. Different from the general scene parsing task, urban scene parsing is a typical representative of the constrained scene parsing task, which is an active research area. In this proposal, we investigate the methods for the urban scene parsing task with images and scan data, from small scale to large scale. With both images and scan data, we propose a novel joint image and scan data scene parsing method which can be applied in large scale urban scenes. The proposed method can automatically obtain necessary training data from the input data, which is usually obtained through manually labeling in previous work. With the automatically obtained training data, we use an associative Hierarchical CRF to jointly optimize the segmentation of images and scan point cloud simultaneously. With only images, we propose a nonparametric scene parsing method which exploits the partial similarity between images, and a parametric scene parsing method, the supervised label transfer method. The partial similarity based nonparametric method involves no training process and reduces the inference problem in the scene parsing to a matching problem. By contrast, the supervised label transfer method transforms the inference problem in the scene parsing to a supervised matching problem. Last but not least, we propose an efficient Iterative Passive-Aggressive learning algorithm to learn the parameters involved in the random field models to formulate the scene parsing task, from some given training data. The parameters are iteratively updated by solving a structured output optimization problem, sharing similar updating form as the projected sub-gradient methods but without using any predefined step-size. The proposed methods are evaluated and compared with some state-of-the-art methods on several public datasets and the real Google Street View data, with encouraging performance achieved. Date: Friday, 27 April 2012 Time: 3:00pm - 5:00pm Venue: Room 5510 lifts 25/26 Committee Members: Prof. Long Quan (Supervisor) Dr. Chiew-Lan Tai (Chairperson) Dr. Pedro Sander Prof. Chi-Keung Tang **** ALL are Welcome ****