More about HKUST
Neural Question Generation and Evaluation with Large Language Models
The Hong Kong University of Science and Technology Department of Computer Science and Engineering MPhil Thesis Defence Title: "Neural Question Generation and Evaluation with Large Language Models" By Mr. Chun Yi Louis BO Abstract: This thesis explores the application of Large Language Models (LLMs) in Neural Question Generation (NQG) and Evaluation, addressing the challenges of generating high-quality, thought-provoking questions for educational purposes. Traditional rule-based Automatic Question Generation (AQG) methods, although effective for simple texts, often lack flexibility and diversity. Despite continuous advancements in the field, a significant gap remains between expert-generated and machine-generated questions. Furthermore, traditional evaluation metrics are limited by the quality of reference questions and often fail to account for creativity and diversity. This research leverages the capabilities of LLMs, such as GPT-4o and Claude 3, to enhance question generation and evaluation. Two novel qualitative evaluation methods are proposed: a question-answering approach and a rubric-based approach. Both methods show significant improvements over traditional metrics by correlating more closely with human annotations. In particular, the proposed metrics outperformed the best existing metrics by up to +40% absolute Pearson correlation for the grammatically and relevance criteria, and up to +7% for the answerability criterion. Furthermore, the thesis explores the capabilities of LLMs in generating educational questions. The findings indicate that models such as GPT-4o can generate questions that are not only grammatically correct but also relevant and answerable, thereby surpassing current specialized QG models both qualitatively and quantitatively. The study also investigates a self-reflective framework, allowing LLMs to improve their own generated questions based on feedback. Results show promising improvements in grammaticality, although challenges remain in enhancing the answerability of more complex questions. Date: Friday, 9 August 2024 Time: 10:00am - 12:00noon Venue: Room 5501 Lifts 25/26 Chairman: Prof. Raymond WONG Committee Members: Dr. Yangqiu SONG (Supervisor) Prof. Gary CHAN