University of Wollongong
Browse

Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and post-filtering approach

Download (3.79 MB)
journal contribution
posted on 2024-11-15, 14:43 authored by Yuexian Zou, Peng Wang, Yong Wang, Christian RitzChristian Ritz, Jiangtao XiJiangtao Xi
Speech enhancement has an increasing demand in mobile communications and faces a great challenge in a real ambient noisy environment. This paper develops an effective spatialfrequency domain speech enhancement method with a single acoustic vector sensor (AVS) in conjunction with minimum variance distortionless response (MVDR) spatial filtering and Wiener post-filtering (WPF) techniques. In remote speech applications, the MVDR spatial filtering is effective in suppressing the strong spatial interferences and the Wiener postfiltering is considered as a popular and powerful estimator to further suppress the residual noise if the power spectral density (PSD) of target speech can be estimated properly. With the favorable directional response of the AVS together with the trigonometric relations of the steering vectors, the closed-form estimation of the signal PSDs is derived and the frequency response of the optimal Wiener post-filter is determined accordingly. Extensive computer simulations and a real experiment in an anechoic chamber condition have been carried out to evaluate the performance of the proposed algorithm. Simulation results show that the proposed method offers good ability to suppress the spatial interference while maintaining comparable log spectral deviation and perceptual evaluation of speech quality performance compared with the conventional methods with several objective measures. Moreover, a single AVS solution is particularly attractive for hands-free speech applications due to its compact size.

History

Citation

Y. Zou, P. Wang, Y. Wang, C. Ritz & J. Xi, "Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and post-filtering approach," EURASIP Journal of Audio, Speech, and Music Processing, vol. 17, (2014) pp. 1-23, 2014.

Journal title

Eurasip Journal on Audio, Speech, and Music Processing

Volume

2014

Language

English

RIS ID

89616

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC