More about HKUST
Acceleration Techniques and Systems for Training Deep Neural Networks
The Hong Kong University of Science and Technology Department of Computer Science and Engineering MPhil Thesis Defence Title: "Acceleration Techniques and Systems for Training Deep Neural Networks" By Mr. Rui PAN Abstract: The large training cost of modern deep neural networks, such as transformers, has been a major roadblock in scaling up model and dataset sizes, slowing down the progress of developing more intelligent systems. To accelerate the optimization process of modern transformer models, efforts have been put into the different aspects of the training procedure, including optimization-level, dataset-level, and system-level improvements. In this thesis, we briefly review developments arising from those different research directions, providing an overall picture of current acceleration techniques. On top of that, we delve deeper into the optimization-level improvement with prompt learning, a branch of approaches that distinguishes itself from others in enabling black- box learning of large-scale models. We introduce a novel combination of metaheuristics and black-box prompt learning, bringing forth a series of new prompt-tuning algorithms. Within our paradigm, we develop and examine six typical methods: hill climbing, simulated annealing, genetic algorithms with/without crossover, tabu search, and harmony search, demonstrating their effectiveness under white-box and black-box prompt learning settings. In particular, we show that these methods can be employed to discover more human-understandable prompts that were previously unknown in both reasoning and image generation tasks, opening the door to a cornucopia of possibilities in prompt optimization. This enables the optimization of black-box models to be more efficient, allowing diverse task-tailored models to be developed with minimum computational resources. Date: Monday, 22 July 2024 Time: 10:00am - 12:00noon Venue: Room 3494 Lifts 25/26 Chairman: Prof. Xiaofang ZHOU Committee Members: Dr. Yangqiu SONG (Supervisor) Prof. Dit-Yan YEUNG