Faculty of Engineering and Information Sciences - Papers: Part A

A psychoacoustic-based analysis-by-synthesis scheme for jointly encoding multiple audio objects into independent mixtures

Xiguang Zheng, University of WollongongFollow
Christian Ritz, University of WollongongFollow
Jiangtao Xi, University of WollongongFollow

RIS ID

86151

Publication Details

Zheng, X., Ritz, C. & Xi, J. (2013). A psychoacoustic-based analysis-by-synthesis scheme for jointly encoding multiple audio objects into independent mixtures. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 281-285). Institute of Electrical and Electronics Engineers.

Abstract

Perceptually accurate representation of audio objects obtained from multi-track audio signals is desired for applications such as interactive soundfield rendering and browsing. Presented in this work is a scalable psychoacoustic analysis-by-synthesis approach to extract the perceptually dominant time-frequency audio objects from a multi-track audio signal. The proposed compression framework exploits sparsity in the perceptual time-frequency domain where up to eight audio objects can be efficiently encoded using only two audio mixtures with side information representing the origin of the time-frequency instances in the mixture signals. The proposed approach, judged by both objective and subjective tests, results in superior audio quality compared to existing techniques when encoding more than 5 audio objects.

Please refer to publisher version or contact your library.

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1109/ICASSP.2013.6637653

Faculty of Engineering and Information Sciences - Papers: Part A

A psychoacoustic-based analysis-by-synthesis scheme for jointly encoding multiple audio objects into independent mixtures

RIS ID

Publication Details

Abstract

Link to publisher version (DOI)

Search

Browse

Links

Faculty of Engineering and Information Sciences - Papers: Part A

A psychoacoustic-based analysis-by-synthesis scheme for jointly encoding multiple audio objects into independent mixtures

Authors

RIS ID

Publication Details

Abstract

Share

Link to publisher version (DOI)

Search

Browse

Links