More about HKUST
Crowd-based Debugging with Extractive Summarization of Crowd Posts
PhD Thesis Proposal Defence Title: "Crowd-based Debugging with Extractive Summarization of Crowd Posts" by Mr. Fuxiang CHEN Abstract: Debugging is hard. Even though root causes to bugs are found, fixing them is non-trivial and requires a significant amount of time. For example, a previous study has reported that the median time to fix a single bug is 200 days. Stack Overflow, a question and answering forum for developers, has attracted numerous contributions, in the order of millions (both in asking questions and providing answers within short period of time), since its establishment. This makes Stack Overflow an invaluable source of information for developers. In this thesis, we first propose mining the QA site, Stack Overflow, to leverage the huge mass of crowd knowledge to help developers in debugging their code. Our approach finds defective code fragments by detecting code clones before using them to triangulate source code anomalies. The defective code fragments (with the crowd’s explanation - problem-cause description) are then coupled with the crowd’s suggested solution (with the crowd’s explanation as well - solution description) and reported back to developers. Often, the problem-case and the solution for a given software bug involve a large amount of textual information and code snippets. It has also been reported that the irrelevance and redundancy of Stack Overflow answers may inhibit developers’ ability to retreive information from Stack Overflow efficiently. Thus, to aid developers in their comprehension tasks, we next propose providing both problem-cause and solution summaries of the Stack Overflow answer posts. Our technique comprises of an ensemble of two models of extractive summarization techniques involving detecting salient sentences by making use of Mutual Reinforcement Principle and leveraging a Deep Neural Network architecture which aims to reduce sentence redendancy as one of its objectives. Date: Friday, 13 April 2018 Time: 10:30am - 12:30pm Venue: Room 3494 (lifts 25/26) Committee Members: Dr. Sunghun Kim (Supervisor) Prof. Andrew Horner (Chairperson) Prof. Shing-Chi Cheung Prof. Frederick Lochovsky **** ALL are Welcome ****