University of Wollongong
Browse

Scalable speech coding spanning the 4 Kbps divide

Download (371.64 kB)
conference contribution
posted on 2024-11-13, 16:59 authored by J Lukasiak, Ian Burnett
This paper examines a scalable method for coding the LP residual. The scalable method is capable of increasing the accuracy of the reconstructed speech from a parametric representation at low rates to a more accurate waveform matched representation at higher rates. The method entails pitch length segmentation, decomposition into pulsed and noise components and modeling of the pulsed components using a fixed shape pulse model in a closed-loop, Analysis by Synthesis system. Subjective testing is presented that indicates that in addition to the AbyS modeling, the pulse parameter evolution must be constrained in synthesis. Results indicate that this proposed method is capable of producing perceptually scalable speech quality as the bit rate is increased through 4 kbps.

History

Citation

This article was published as: Lukiasiak, J & Burnett, I, Scalable speech coding spanning the 4 Kbps divide, Proceedings Seventh International Symposium on Signal Processing and Its Applications, 1-4 July 2003, vol 1, 397-400. Copyright IEEE 2003.

Language

English

RIS ID

9531

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC