標題: Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments
作者: Wu, BF
Wang, KC
電控工程研究所
Institute of Electrical and Control Engineering
關鍵字: adaptive processing;endpoint detection;multiband analysis;spectral entropy
公開日期: 1-九月-2005
摘要: In speech processing, endpoint detection in noisy environments is difficult, especially in the presence of nonstationary noise. Robust endpoint detection is one of the most important areas of speech processing. Generally, the feature parameters used for endpoint detection are highly sensitive to the environment. Endpoint detection is severely degraded at low signal-to-noise ratios (SNRs) since those feature parameters cannot adequately describe the characteristics of a speech signal. As a result, this study seeks the banded structure on speech spectrogram to distinguish a speech from a nonspeech, especially in adverse environments. First, this study proposes a feature parameter, called band-partitioning spectral entropy (BSE), which exploits the use of the banded structure on speech spectrogram. A refined adaptive band selection (RABS) method is extended from the adaptive band selection method proposed by Wu et al., which adaptively selects useful bands not corrupted by noise. The successful RABS method is strongly depended on an on-line detection with minimal processing delay. In this paper, the RABS method is combined with the BSE parameter. Finally, a novel robust feature parameter, adaptive band-partitioning spectral entropy (ABSE), is presented to successfully detect endpoints in adverse environments. Experimental results indicate that the ABSE parameter is very effective under various noise conditions with several SNRs. Furthermore, the proposed algorithm outperforms other approaches and is reliable in a real car.
URI: http://dx.doi.org/10.1109/TSA.2005.851909
http://hdl.handle.net/11536/13353
ISSN: 1063-6676
DOI: 10.1109/TSA.2005.851909
期刊: IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Volume: 13
Issue: 5
起始頁: 762
結束頁: 775
顯示於類別:期刊論文


文件中的檔案:

  1. 000231517400004.pdf