標題: 國語韻律訊息之偵測及應用
An Initial Study on Mandarin Prosodic Information Detection and Its Application
作者: 李淑凌
Li, Shu-Ling
陳信宏
Chen, Xin-Hong
電信工程研究所
關鍵字: 韻律狀態;遞迴式類神經網路;向量量化;電信;電子工程;Prosodic States;Recurrent Neural Networks;Vector Quantization;TELECOMMUNICATION;ELECTRONIC-ENGINEERING
公開日期: 1996
摘要: In this thesis, a method to detect the prosodic states of speech signals is proposed. It first employs an RNN to discriminate each input frame of an input utterance among three broad classes of syllable initial, syllable final, and silence. Outputs of the RNN are then used to drive an FSM to segment the input utterance into segments of four states. They include three stable states of I (initial), F (final), and S (silence), and a transient state of T (transition). Several acoustic cues are then extracted from the vicinities of final segments, and used to model the prosodic states of inter-final- segment periods. Two prosodic-state modeling schemes are studied. One uses VQ to directly classify the acoustic cues of two contiguous final segments into 8 or 16 prosodic states. The other uses an RNN with some linguistic features as target outputs. Prosodic states are obtained by vector- quantizing the outputs of the hidden layer of the RNN. Linguistically meaningful interpretations of these prosodic states can be observed. Finally, two outputs of the RNN , which provide word-boundary cues, are integrated into an MRNN-based continuous Mandarin word recognizer. Experimental results showed that it is helpful in improving the word recognition performance.
URI: http://140.113.39.130/cdrfb3/record/nctu/#NT854436001
http://hdl.handle.net/11536/62505
Appears in Collections:Thesis