More about HKUST
The Application of Foundation Models in Robotics
PhD Qualifying Examination
Title: "The Application of Foundation Models in Robotics"
by
Mr. Siyuan ZHOU
Abstract:
Foundation models that are pre-trained on diverse data at scale have
demonstrated substantial potential across numerous tasks in both vision and
language domains. Yet, developing foundation models for general-purpose robots
remains a significant challenge. General-purpose robots must be capable of
performing seamlessly across any task in any environment. However, current
robotic systems are constrained in specific tasks and specific environments,
resulting in generalization issues -- models struggle to perform effectively
when encountering unseen tasks or new environments.
Motivated by the remarkable open-set performance and common sense reasoning of
foundation models, such as Large Language Model (LLM) and Vision Language Model
(VLM), we devote this survey to exploring how these foundation models integrate
seamlessly into robotic systems. We start with a thorough overview of both
robotic systems and foundation models in the vision and language fields. Next,
we discuss current works on leveraging existing foundation models for robotic
tasks. Finally, we consider several promising directions for future research.
Date: Thursday, 23 May 2024
Time: 10:00am - 12:00noon
Venue: Room 3494
Lifts 25/26
Committee Members: Prof. Dit-Yan Yeung (Supervisor)
Prof. Fangzhen Lin (Chairperson)
Dr. Long Chen
Dr. Qifeng Chen