Single-channel speech separation by including spectral structure information within non-negative matrix factorization

RIS ID

102811

Publication Details

Y. Feng, C. H. Ritz, et al "Single-channel speech separation by including spectral structure information within non-negative matrix factorization," in Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on, 2015, pp. 620-624.

Abstract

This paper proposes a novel extension on Non-negative Matrix Factorization (NMF) scheme for the separation of single channel speech mixtures, where we impose a post-sparse model on the original weight matrix derived from a previously proposed coherence-constrained NMF model. The approach considers both the modeling ability of NMF basis functions for each source as well as the ability of these basis functions to achieve accurate separation performance. Compared with latest associated NMF models for source separation, the results of our model indicate promising advantages, in terms of both objective source separation measures and Perceptual Evaluation of Speech Quality (PESQ) evaluations.

Please refer to publisher version or contact your library.

Share

COinS
 

Link to publisher version (DOI)

http://dx.doi.org/10.1109/ChinaSIP.2015.7230478