標題: A new approach for audio classification and segmentation using Gabor wavelets and Fisher Linear Discriminator
作者: Lin, RS
Chen, LH
資訊工程學系
Department of Computer Science
關鍵字: audio classification and segmentation;spectrogram;audio content-based retrieval;Fisher Linear discriminator;Gabor wavelets
公開日期: 1-九月-2005
摘要: Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified as one of two classes or five classes. Two classes contain speech and music; pure speech, pure music, song, speech with music background, and speech with environmental noise background are for five classes. Finally, a merge technique is provided to do segmentation. In order to make the proposed method robust for a variety of audio sources: we use Fisher Linear Discriminator to obtain features with the highest discriminative ability Experimental results show that the proposed method can achieve over 98% accuracy rate for speech and music discrimination, and more than 95% for a five-way discrimination. By checking the class types of adjacent clips, we can also identify more than 95% audio scene breaks in audio sequence.
URI: http://dx.doi.org/10.1142/S0218001405004289
http://hdl.handle.net/11536/13337
ISSN: 0218-0014
DOI: 10.1142/S0218001405004289
期刊: INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE
Volume: 19
Issue: 6
起始頁: 807
結束頁: 822
顯示於類別:期刊論文


文件中的檔案:

  1. 000232497900005.pdf