Defending against Adversarial Audio via Diffusion Model

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2023Embargo end date: 01 Jan 2023Publisher:arXivJournal:CoRR, volume abs/2303.01507

Authors: Shutong Wu; Jiongxiao Wang; Wei Ping; Weili Nie; Chaowei Xiao;

doi: 10.48550/arxiv.2303.01507

arXiv: 2303.01507

Defending against Adversarial Audio via Diffusion Model

- Summary
- Subjects
- Related research
  (6)
- Metrics

Abstract

Deep learning models have been widely used in commercial acoustic systems in recent years. However, adversarial audio examples can cause abnormal behaviors for those acoustic systems, while being hard for humans to perceive. Various methods, such as transformation-based defenses and adversarial training, have been proposed to protect acoustic systems from adversarial attacks, but they are less effective against adaptive attacks. Furthermore, directly applying the methods from the image domain can lead to suboptimal results because of the unique properties of audio data. In this paper, we propose an adversarial purification-based defense pipeline, AudioPure, for acoustic systems via off-the-shelf diffusion models. Taking advantage of the strong generation ability of diffusion models, AudioPure first adds a small amount of noise to the adversarial audio and then runs the reverse sampling step to purify the noisy audio and recover clean audio. AudioPure is a plug-and-play method that can be directly applied to any pretrained classifier without any fine-tuning or re-training. We conduct extensive experiments on speech command recognition task to evaluate the robustness of AudioPure. Our method is effective against diverse adversarial attacks (e.g. $\mathcal{L}_2$ or $\mathcal{L}_\infty$-norm). It outperforms the existing methods under both strong adaptive white-box and black-box attacks bounded by $\mathcal{L}_2$ or $\mathcal{L}_\infty$-norm (up to +20\% in robust accuracy). Besides, we also evaluate the certified robustness for perturbations bounded by $\mathcal{L}_2$-norm via randomized smoothing. Our pipeline achieves a higher certified accuracy than baselines.

Related Organizations

Shanghai Jiao Tong University
China (People's Republic of)
Shanghai Jiao Tong University
Arizona State University
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer Science - Cryptography and Security, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Cryptography and Security (cs.CR), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Machine Learning (cs.LG)

6 Research products, page 1 of 1

H∞ network optimization for edge consensus
2021IsAmongTopNSimilarDocuments
Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence
2021IsAmongTopNSimilarDocuments
Large-Scale Computation of $\mathcal{L}_\infty$-Norms by a Greedy Subspace Method
2017IsAmongTopNSimilarDocuments
A Shared Memory Parallel Implementation of the IRKA Algorithm for $\mathcal{H}_2$ Model Order Reduction
2013IsAmongTopNSimilarDocuments
A Subspace Framework for ${\mathcal H}_\infty$-Norm Minimization
2020IsAmongTopNSimilarDocuments
AudioPure software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average