標題: Bayesian group sparse learning for music source separation
作者: Chien, Jen-Tzung
Hsieh, Hsin-Lung
電機資訊學士班
Undergraduate Honors Program of Electrical Engineering and Computer Science
關鍵字: Bayesian sparse learning;Signal reconstruction;Subspace approach;Group sparsity;Nonnegative matrix factorization;Single-channel source separation
公開日期: 2013
摘要: Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative signals with the sparseness constraint. The signals are adequately represented by a set of basis vectors and the corresponding weight parameters. NMF has been successfully applied for blind source separation and many other signal processing systems. Typically, controlling the degree of sparseness and characterizing the uncertainty of model parameters are two critical issues for model regularization using NMF. This paper presents the Bayesian group sparse learning for NMF and applies it for single-channel music source separation. This method reconstructs the rhythmic or repetitive signal from a common subspace spanned by the shared bases for the whole signal and simultaneously decodes the harmonic or residual signal from an individual subspace consisting of separate bases for different signal segments. A Laplacian scale mixture distribution is introduced for sparse coding given a sparseness control parameter. The relevance of basis vectors for reconstructing two groups of music signals is automatically determined. A Markov chain Monte Carlo procedure is presented to infer two sets of model parameters and hyperparameters through a sampling procedure based on the conditional posterior distributions. Experiments on separating single-channel audio signals into rhythmic and harmonic source signals show that the proposed method outperforms baseline NMF, Bayesian NMF, and other group-based NMF in terms of signal-to-interference ratio.
URI: http://hdl.handle.net/11536/22418
http://dx.doi.org/10.1186/1687-4722-2013-18
ISSN: 1687-4722
DOI: 10.1186/1687-4722-2013-18
期刊: EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING
顯示於類別:期刊論文


文件中的檔案:

  1. 000321954800001.pdf