Compression of navigable speech soundfield zones
RIS ID
49512
Abstract
This paper presents a new coding architecture for the compression of navigable speech soundfield zones. The proposed coding scheme encodes multiple speech soundfields, each representing different spatial zones, into a mono or stereo sound-field mixture signal that can be compressed with an existing speech or audio coder. The resulting compressed signals can be decoded back to individual soundfield zones. Objective and subjective testing results show that the approach successfully compresses up to 3 speech soundfields (each consisting of 4 individual speakers) at a bit rate of 48 kbps whilst maintaining the perceptual quality of each decoded soundfield zone.
Publication Details
zheng, X. & Ritz, C. (2011). Compression of navigable speech soundfield zones. 2011 IEEE 13rd International Workshop on Multimedia Signal Processing (MMSP) (pp. 1-6). USA: IEEE.