More about HKUST
MOOC Data Analytics: Probabilistic Topic Modeling of Discussion Forum Data
The Hong Kong University of Science and Technology
Department of Computer Science and Engineering
Final Year Thesis Oral Presentation
Title: "MOOC Data Analytics: Probabilistic Topic Modeling of Discussion
Forum Data"
By
Lei SUN
Abstract
Massive Open Online Course(MOOC) has expanded over Internet in the last few
years. Finding that the learning experience is not that satisfying for some
students, researchers are now trying to explore more features from the data
collected on MOOC platform to analyze enrolled students' performance during
the course session, so that more specific adjustment in terms of teaching
content can be conducted in time. Besides obvious features that can imply
students' learning progress like assignment grade and video attendance, the
forum contents of students will be given special attention as features that
can characterize users in this project. Techniques including Probabilistic
topic modeling is applied to extract dominant topics behind discussion
forum posts. Specifically, Latent Dirichlet allocation(LDA) model is used
to analyze user's forum posts and to represent their posts using topic
vectors, so that the similarity between users can be calculated using
KL-Divergence, Euclidean distance and other distance measure, and produce
clusters based on the similarity measure. The cluster information can help
us predict other user's performance and possibility to drop along the
course session. Besides user's cluster, we will utilize topic modeling to
learn the similarity between different forum threads and try to aggregate
threads based on their similarity. The aggregated threads can also provide
with new statistics illustrating the performance of users who involved in
those threads. Other aspects involving user's topic evolution will be
covered as well.
Date: Tuesday, 28 April 2015
Time: 11:20 - 12:00noon
Venue: Room 5560
Lifts 27/28
Committee Members: Prof. Dit-Yan Yeung (Supervisor)
Dr. Raymond Wong (Reader)