Generating Adversarial Examples with Adversarial Networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jul 2018Embargo end date: 01 Jan 2018Publisher:International Joint Conferences on Artificial Intelligence OrganizationJournal:Proceedings of the Twenty-Seventh International Joint Conference on Artificial IntelligenceFunded by:NSF | TWC: Small: Understanding...

Authors: Chaowei Xiao; Bo Li 0026; Jun-Yan Zhu; Warren He; Mingyan Liu; Dawn Song;

doi: 10.24963/ijcai.2018/543 , 10.48550/arxiv.1801.02610

arXiv: 1801.02610

Generating Adversarial Examples with Adversarial Networks

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Deep neural networks (DNNs) have been found to be vulnerable to adversarial examples resulting from adding small-magnitude perturbations to inputs. Such adversarial examples can mislead DNNs to produce adversary-selected results. Different attack strategies have been proposed to generate adversarial examples, but how to produce them with high perceptual quality and more efficiently requires more research efforts. In this paper, we propose AdvGAN to generate adversarial exam- ples with generative adversarial networks (GANs), which can learn and approximate the distribution of original instances. For AdvGAN, once the generator is trained, it can generate perturbations efficiently for any instance, so as to potentially accelerate adversarial training as defenses. We apply Adv- GAN in both semi-whitebox and black-box attack settings. In semi-whitebox attacks, there is no need to access the original target model after the generator is trained, in contrast to traditional white-box attacks. In black-box attacks, we dynamically train a distilled model for the black-box model and optimize the generator accordingly. Adversarial examples generated by AdvGAN on different target models have high attack success rate under state-of-the-art defenses compared to other attacks. Our attack has placed the first with 92.76% accuracy on a public MNIST black-box attack challenge.

Related Organizations

Massachusetts Institute of Technology
United States
University of Michigan–Ann Arbor
United States
University of Michigan Ann Arbor
United States
University of Michigan–Flint
United States
University of California, Berkeley
United States

Keywords

FOS: Computer and information sciences, Computer Science - Cryptography and Security, Statistics - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (stat.ML), Cryptography and Security (cs.CR)

3 Research products, page 1 of 1

mnist_challenge software on GitHub
IsRelatedTo
cifar10_challenge software on GitHub
IsRelatedTo
fast-neural-style software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	553
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.1%