University of Wollongong
Browse

Detecting multiple, simultaneous talkers through localising speech recorded by ad-hoc microphone arrays

Download (609.84 kB)
conference contribution
posted on 2024-11-15, 19:52 authored by Shahab Pasha, Christian RitzChristian Ritz, Yue-Xian Zou
This paper proposes a novel approach to detecting multiple, simultaneous talkers in multi-party meetings using localisation of active speech sources recorded with an ad-hoc microphone array. Cues indicating the relative distance between sources and microphones are derived from speech signals and room impulse responses recorded by each of the microphones distributed at unknown locations within a room. Multiple active sources are localised by analysing a surface formed from these cues and derived at different locations within the room. The number of localised active sources per each frame or utterance is then counted to estimate when multiple sources are active. The proposed approach does not require prior information about the number and locations of sources or microphones. Synchronisation between microphones is also not required. A meeting scenario with competing speakers is simulated and results show that simultaneously active sources can be detected with an average accuracy of 75% and the number of active sources counted accurately 65% of the time.

History

Citation

S. Pasha, C. Ritz & Y. X. Zou, "Detecting multiple, simultaneous talkers through localising speech recorded by ad-hoc microphone arrays," in 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2016, 2016, pp. 1-6.

Parent title

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2016

Language

English

RIS ID

112821

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC