University of Wollongong
Browse

Multivariate autoregressive modelling of multichannel reverberant speech

Download (476.08 kB)
conference contribution
posted on 2024-11-14, 10:17 authored by Eva Cheng, Ian Burnett, Christian RitzChristian Ritz
Recent research in speech localization and dereverberation introduced processing of the multichannel linear prediction (LP) residual of speech recorded with multiple microphones. This paper investigates the novel use of intra- and inter-channel speech prediction by proposing the use of a multichannel LP model derived from multivariate autoregression (MVAR), where current LP approaches are based on univariate autoregression (AR). Experiments were conducted on simulated anechoic and reverberant synthetic speech vowels and real speech sentences; results show that, especially at low reverberation times, the MVAR model exhibits greater prediction gains from the residual signal, compared to residuals obtained from univariate AR models for individually or jointly modelled speech channels. In addition, the MVAR model more accurately models the speech signal when compared to univariate LP of a similar prediction order and when a smaller number of microphones are deployed.

History

Citation

E. Cheng, I. S. Burnett & C. H. Ritz, "Multivariate autoregressive modelling of multichannel reverberant speech," in International Workshop on Multimedia Signal Processing, 2008, pp. 945-949.

Parent title

Proceedings of the 2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008

Pagination

945-949

Language

English

RIS ID

24938

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC