More about HKUST
Eigentriphone Modeling in Automatic Speech Recognition
PhD Thesis Proposal Defence Title: "Eigentriphone Modeling in Automatic Speech Recognition" by Mr. Yu Ting KO ABSTRACT: In triphone-based acoustic modeling, it is difficult to robustly model infrequent triphones due to their lack of training samples. Naive maximum-likelihood (ML) estimation of infrequent triphone models produces poor triphone models and eventually affects the overall performance of an automatic speech recognition (ASR) system. Among different techniques proposed to solve the infrequent triphone problem, the most widely used method in current ASR systems is state tying because of its effectiveness in reducing model size and achieving good recognition results. However, state tying inevitably introduces quantization errors since triphones tied to the same state are not distinguishable in that state. On the other hand, speaker adaptation techniques have been well developed over the past decades. Speaker adaptation aims at adapting acoustic models to the characteristics of a particular speaker with a limited amount of speaker specific data. Motivated by the idea of these speaker adaptation techniques, we would like to solve the infrequent triphone problem from an adaptation point of view. In this proposal, we propose a new context-dependent modeling method called eigentriphones modeling. In contrast to the state tying method, all the triphone models are distinct from each other and thus may be more discriminative. The rational behind our method is that a basis is derived over the frequent triphones and each infrequent triphone is modeled as a point in the space spanned by the basis. The eigenvectors in the basis represent the most important context-dependent characteristics among the triphones and thus the infrequent triphones can be robustly modeled with few training samples. The proposed eigentriphone modeling was empirically evaluated on the Wall Street Journal 5K task and the TIMIT phoneme recognition task. It is shown that our proposed method consistently performs better than the most common state tying method. Future works for completion of the thesis are also given in the proposal. Date: Monday, 13 May 2013 Time: 10:00am - 12:00noon Venue: Room 3405 lifts 17/18 Committee Members: Dr. Brian Mak (Supervisor) Prof. Dit-Yan Yeung (Chairperson) Prof. Siu-Wing Cheng Dr. Raymond Wong **** ALL are Welcome ****