Scopus Harvesting Series

High Quality Audio Adversarial Examples Without Using Psychoacoustics

Wei Zong, University of Wollongong
Yang Wai Chow, University of Wollongong
Willy Susilo, University of Wollongong

Publication Name

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract

In the automatic speech recognition (ASR) domain, most, if not all, current audio AEs are generated by applying perturbations to input audio. Adversaries either constrain norm of the perturbations or hide perturbations below the hearing threshold based on psychoacoustics. These two approaches have their respective problems: norm-constrained perturbations will introduce noticeable noise while hiding perturbations below the hearing threshold can be prevented by deliberately removing inaudible components from audio. In this paper, we present a novel method of generating targeted audio AEs. The perceptual quality of our audio AEs are significantly better compared to audio AEs generated by applying norm-constrained perturbations. Furthermore, unlike approaches that rely on psychoacoustics to hide perturbations below the hearing threshold, we show that our audio AEs can still be successfully generated even when inaudible components are removed from audio.

Open Access Status

This publication is not available as open access

Volume

13547 LNCS

First Page

163

Last Page

177

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1007/978-3-031-18067-5_12

Scopus Harvesting Series

High Quality Audio Adversarial Examples Without Using Psychoacoustics

Publication Name

Abstract

Open Access Status

Volume

First Page

Last Page

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

High Quality Audio Adversarial Examples Without Using Psychoacoustics

Authors

Publication Name

Abstract

Open Access Status

Volume

First Page

Last Page

Share

Link to publisher version (DOI)

Search

Browse

Links