Compression of navigable speech soundfield zones

RIS ID

49512

Publication Details

zheng, X. & Ritz, C. (2011). Compression of navigable speech soundfield zones. 2011 IEEE 13rd International Workshop on Multimedia Signal Processing (MMSP) (pp. 1-6). USA: IEEE.

Abstract

This paper presents a new coding architecture for the compression of navigable speech soundfield zones. The proposed coding scheme encodes multiple speech soundfields, each representing different spatial zones, into a mono or stereo sound-field mixture signal that can be compressed with an existing speech or audio coder. The resulting compressed signals can be decoded back to individual soundfield zones. Objective and subjective testing results show that the approach successfully compresses up to 3 speech soundfields (each consisting of 4 individual speakers) at a bit rate of 48 kbps whilst maintaining the perceptual quality of each decoded soundfield zone.

Please refer to publisher version or contact your library.

Share

COinS
 

Link to publisher version (DOI)

http://dx.doi.org/10.1109/MMSP.2011.6093795