Towards Prevalence of On-Device AI with Full Runtime Adaptability

Speaker: Professor Wei Gao
University of Pittsburgh

Title: Towards Prevalence of On-Device AI with Full Runtime Adaptability

Date: Thursday, 20 February 2025

Time: 10:00am - 11:00am

Venue: Room 3523 (CSE conference Room, via lift 25/26), HKUST

Abstract:

With the recent democratization of AI, there is a pressing need of supporting AI on mobile and embedded devices at the edge, to allow intelligent and prompt decision making autonomously on these devices. To meet the devices’ constraints in computing capacity, current software solutions to on-device AI reduce the ML model’s complexity, but have major weaknesses in adapting to the changes of online data patterns and environmental contexts, resulting in significant reduction of model performance in difficult learning tasks. In this talk, I will present our recent research on achieving such full runtime adaptability, as a key enabler for prevalence of on-device AI in practical systems. I will first present how we leverage explainability in AI to adaptively involve the most appropriate model structures for on-device computations, so as to support real-time inference, runtime training and LLM fine-tuning on devices with extreme resource constraints. Afterwards, I will further show how such on-device AI techniques can be applied to various application domains, including smart healthcare and embodied AI systems, to achieve high system performance with heterogeneous data characteristics and diverse environmental settings.


Biography:

Wei Gao is currently an Associate Professor in the Department of Electrical and Computer Engineering, University of Pittsburgh. His research interests lie in the intersection between AI and computer systems, with a focus on the design and deployment of on-device AI models and algorithms on mobile, embedded and networked devices. He also has strong interests in applying the computationally efficient AI models into practical application domains to make societal impacts and benefit the human welfare. The integrated AI and sensing systems developed by his team have been applied to more than 400 patients at Children's Hospital of Pittsburgh and helped enormous families with low incomes during the COVID-19 pandemic. He has published more than 80 research papers at both top AI and system conference venues, including ICLR, AAAI, ASPLOS, MobiCom, MobiSys, SenSys, etc, and received multiple best paper awards or nominations.