University of Wollongong
Browse

A general compression approach to multi-channel three-dimensional audio

Download (1.67 MB)
journal contribution
posted on 2024-11-16, 08:57 authored by Bin Cheng, Christian RitzChristian Ritz, Ian Burnett, Xiguang Zheng
This paper presents a technique for low bit rate compression of three-dimensional (3D) audio produced by multiple loudspeaker channels. The approach is based on the time-frequency analysis of the localization of spatial sound sources within the 3D space as rendered by a multi-channel audio signal (in this case 16 channels). This analysis results in the derivation of a stereo downmix signal representing the original 16 channels. Alternatively, a mono-downmix signal with side information representing the location of sound sources within the 3D spatial scene can also be derived. The resulting downmix signals are then compressed with a traditional audio coder, resulting in a representation of the 3D soundfield at bit rates comparable with existing stereo audio coders while maintaining the perceptual quality produced from separate encoding of each channel. © 2006-2012 IEEE.

Funding

Encoding and Communicating Navigable Soundfields

Australian Research Council

Find out more...

History

Citation

B. Cheng, C. Ritz, I. Burnett & X. Zheng, "A general compression approach to multi-channel three-dimensional audio," IEEE Transactions on Audio, Speech and Language Processing, vol. 21, (8) pp. 1676-1688, 2013.

Journal title

IEEE Transactions on Audio, Speech and Language Processing

Volume

21

Issue

8

Pagination

1676-1688

Language

English

RIS ID

78665

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC