Leveraging AutoEncoders and chaos theory to improve adversarial example detection

descriptionPublicationkeyboard_double_arrow_right Article , Journal , Other literature type 24 Jul 2024 Spain English Publisher:Springer Science and Business Media LLCJournal:Neural Computing and Applications, volume 36, pages 18,265-18,275 (issn: 0941-0643, eissn: 1433-3058,

Copyright policy )Funded by:EC | dAIEDGE

Authors: Pedraza, Anibal; Deniz, Oscar; Singh, Harbinder; Bueno, Gloria;

doi: 10.1007/s00521-024-10141-1

handle: 10578/45807

Leveraging AutoEncoders and chaos theory to improve adversarial example detection

- Summary
- Subjects
- Metrics

Abstract

AbstractThe phenomenon of adversarial examples is one of the most attractive topics in machine learning research these days. These are particular cases that are able to mislead neural networks, with critical consequences. For this reason, different approaches are considered to tackle the problem. On the one side, defense mechanisms, such as AutoEncoder-based methods, are able to learn from the distribution of adversarial perturbations to detect them. On the other side, chaos theory and Lyapunov exponents (LEs) have also been shown to be useful to characterize them. This work proposes the combination of both domains. The proposed method employs these exponents to add more information to the loss function that is used during an AutoEncoder training process. As a result, this method achieves a general improvement in adversarial examples detection performance for a wide variety of attack methods.

Country

Spain

Related Organizations

University of Castile-La Mancha
Spain

Keywords

Adversarial examples, Trustworthy machine learning, Lyapunov exponents, Chaos theory, AutoEncoders

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average