More about HKUST
Contextual Word Embeddings and Polysemy in Hierarchical Topic Modeling
The Hong Kong University of Science and Technology
Department of Computer Science and Engineering
Final Year Thesis Oral Defense
Title: "Contextual Word Embeddings and Polysemy in Hierarchical Topic
Modeling"
by
WANG Zhiwei
Abstract:
Hierarchical topic modeling (HTD) is a variant of topic modeling task
which has attracted researchers' attention in recent years. Hierarchical
latent tree analysis (HLTA) is a recently proposed method for HTD that has
shown superior performance than state-of-the- art methods. However, the
model of HLTA has a tree structure and cannot represent different meanings
of a word appropriately. We proposed a method using contextual word
embeddings to add context information to the text corpus before running
HLTA. Some selected words are replaced by their specific word senses in
the corpus. The empirical evaluation shows that the proposed method
outperforms the original HLTA on the test dataset.
Date : 8 May 2021 (Saturday)
Time : 16:00-16:40
Zoom Link:
https://hkust.zoom.us/j/92133587886?pwd=N3orUFNBNFpSVjZiV1JTM0Y5TVNZZz09
Meeting ID : 921 3358 7886
Passcode : 663031
Advisor : Prof. ZHANG Nevin Lianwen
2nd Reader : Dr. SONG Yangqiu