標題: A masking-threshold-adapted weighting filter for excitation search
作者: Chang, WW
Wang, CT
交大名義發表
電信工程研究所
National Chiao Tung University
Institute of Communications Engineering
公開日期: 1-三月-1996
摘要: Most LPC-based audio coders improve reproduction quality by using predictor coefficients to embody perceptual masking in noise spectral shaping. Since the predictor coefficients were originally derived to characterize sound production models, they cannot precisely describe the human ear's nonlinear responses to frequency and loudness. In this paper, we report on new approaches to exploiting the masking threshold in the design of a perceptual noise-weighting filter for excitation searches. To track the nonstationary evolution of a masking threshold, an autoregressive spectral analysis with finite order has been shown to be capable of providing sufficient accuracy. In seeking faster response, an artificial neural network was also trained to extract autoregressive modeling parameters of the masking threshold from typical audio signals via mapping. Furthermore, we propose the concept of sinusoidal excitation representation to better track the intrinsic characteristics of prediction error signals. Simulation results indicate that the combined use of a multisinusoid excitation model and a masking-threshold-adapted weighting filter allows the implementation of an LPC-based audio coder that delivers near transparent quality at the rate of 96 kb/s.
URI: http://dx.doi.org/10.1109/89.486062
http://hdl.handle.net/11536/1410
ISSN: 1063-6676
DOI: 10.1109/89.486062
期刊: IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Volume: 4
Issue: 2
起始頁: 124
結束頁: 132
顯示於類別:期刊論文


文件中的檔案:

  1. A1996UC34700006.pdf