More about HKUST
Object Detection from Videos
The Hong Kong University of Science and Technology Department of Computer Science and Engineering Final Year Thesis Oral Presentation Title: "Object Detection from Videos" by Mr. Rui PENG Abstract: Object detection from videos is an emerging area in very large scale visual recognition. While previous state-of-the-art image object detection algorithms can be directly employed in video object detection in a frame-by-frame manner to produce acceptable results, this straightforward solution fails to exploit the rich temporal information inherent video input. In this thesis, I propose a comprehensive video object detection pipeline consisting of a number of flexible modules to produce end-to-end video object detection results. Working in tandem with the Fast-RCNN on VGG16 architecture, our pipeline scores a mAP of 42.1% in the ILSVRC2015 "Object Detection from Video" contest track, placing us the **fifth* in one of the best known worldwide big-data competition in visual recognition. Along the direction of incorporating temporal information, we present the Fusion-Net, the main network architecture of our detector. Fusion-Net utilizes the convolution layers (or RNN) to operate on feature maps to provide a comprehensive fusion effect of features along the temporal axis. Preliminary experiments show it is capable of pushing the mAP frontier at least by a margin of 0.5%. Date : 9 May 2016 (Monday) Time : 2:30pm to 3:30pm Venue : Room 5510 (lift 25/26) Advisor : Prof. C.K. TANG 2nd Reader : Dr. Pedro SANDER