Scopus Harvesting Series

DANAA: Towards Transferable Attacks with Double Adversarial Neuron Attribution

Zhibo Jin, The University of Sydney
Zhiyu Zhu, The University of Sydney
Xinyi Wang, Jiangsu University
Jiayu Zhang, Suzhou Yierqi
Jun Shen, University of Wollongong
Huaming Chen, The University of Sydney

Publication Name

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract

While deep neural networks have excellent results in many fields, they are susceptible to interference from attacking samples resulting in erroneous judgments. Feature-level attacks are one of the effective attack types, which target the learned features in the hidden layers to improve their transferability across different models. Yet it is observed that the transferability has been largely impacted by the neuron importance estimation results. In this paper, a double adversarial neuron attribution attack method, termed ‘DANAA’, is proposed to obtain more accurate feature importance estimation. In our method, the model outputs are attributed to the middle layer based on an adversarial non-linear path. The goal is to measure the weight of individual neurons and retain the features that are more important toward transferability. We have conducted extensive experiments on the benchmark datasets to demonstrate the state-of-the-art performance of our method. Our code is available at: https://github.com/Davidjinzb/DANAA.

Open Access Status

This publication may be available as open access

Volume

14177 LNAI

First Page

456

Last Page

470

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1007/978-3-031-46664-9_31

Scopus Harvesting Series

DANAA: Towards Transferable Attacks with Double Adversarial Neuron Attribution

Publication Name

Abstract

Open Access Status

Volume

First Page

Last Page

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

DANAA: Towards Transferable Attacks with Double Adversarial Neuron Attribution

Authors

Publication Name

Abstract

Open Access Status

Volume

First Page

Last Page

Share

Link to publisher version (DOI)

Search

Browse

Links