Faculty of Informatics - Papers (Archive)

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

Eva Cheng, University of WollongongFollow
I. Burnett, Faculty of Informatics, University of WollongongFollow
Christian Ritz, University of WollongongFollow

RIS ID

22872

Publication Details

E. Cheng, I. S. Burnett & C. H. Ritz, "Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction", in International Conference on Signal Image Technology & Internet Based Systems (SITIS '07), 2007, pp. 494-500.

Abstract

Effective and efficient access to multiparty meeting recordings requires techniques for meeting analysis and indexing. Since meeting participants are generally stationary, speaker location information may be used to identify meeting events e.g., detect speaker changes. Time-delay estimation (TDE) utilizing cross-correlation of multichannel speech recordings is a common approach for deriving speech source location information. Research improved TDE by calculating TDE from linear prediction (LP) residual signals obtained from LP analysis on each individual speech channel. This paper investigates the use of LP residuals for speech TDE, where the residuals are obtained from jointly modeling the multiple speech channels. Experiments conducted with a simulated reverberant room and real room recordings show that jointly modeled LP better predicts the LP coefficients, compared to LP applied to individual channels. Both the individually and jointly modeled LP exhibit similar TDE performance, and outperform TDE on the speech alone, especially with the real recordings.

Download

Included in

Physical Sciences and Mathematics Commons

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1109/SITIS.2007.96

Faculty of Informatics - Papers (Archive)

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

RIS ID

Publication Details

Abstract

Included in

Link to publisher version (DOI)

Search

Browse

Author Corner

Links

Faculty of Informatics - Papers (Archive)

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

Authors

RIS ID

Publication Details

Abstract

Included in

Share

Link to publisher version (DOI)

Search

Browse

Author Corner

Links