Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings
RIS ID
114041
Abstract
Coherent-to-diffuse ratio (CDR) estimates over short time frames are utilized for source counting using ad-hoc microphone arrays to record speech from multiple participants in scenarios such as a meeting. It is shown that the CDR estimates obtained at ad-hoc dual (two channel) microphone nodes, located at unknown locations within an unknown reverberant room, can detect time frames with more than one active source and are informative for source counting applications. Results show that interfering sources can be detected with accuracies ranging from 69% to 89% for delays ranging from 20 ms to 300 ms, with source counting accuracies ranged from 61% to 81% for two sources and the same range of delays.
Publication Details
S. Pasha, J. Donley, C. Ritz & Y. Zou, "Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings," in 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings, 2017, pp. 161-165.