University of Wollongong
Browse

Noise robust keyword spotting for user generated video blogs

Download (357.06 kB)
conference contribution
posted on 2024-11-15, 13:25 authored by M S Barakat, Christian RitzChristian Ritz, David StirlingDavid Stirling
This paper presents a template-based system for speaker independent key word spotting (KWS) in continuous speech that can help in automatic analysis, indexing, search and retrieval of user generated videos by content. Extensive experiments on clean speech confirm that the proposed approach is superior to a HMM approach when applied to noisy speech with different signal-to-noise ratio (SNR) levels. Experiments conducted to detect swear words, personal names and product names within a set of online user generated video blogs shows significantly better recall and precision results compared to a traditional ASR-based approach.

History

Citation

M. S. Barakat, C. H. Ritz & D. A. Stirling, "Noise robust keyword spotting for user generated video blogs," in IEEE International Conference on Multimedia and Expo, 2013, pp. 1-6.

Parent title

Proceedings - IEEE International Conference on Multimedia and Expo

Language

English

RIS ID

84214

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC