Terminal Brain Damage: Exposing the Graceless Degradation in Deep Neural Networks Under Hardware Fault Attacks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2019Embargo end date: 01 Jan 2019 Netherlands Publisher:arXivJournal:CoRR, volume abs/1906.01017Funded by:EC | UNICORE, NWO | How to Securely Update a ..., EC | REACT

Authors: Hong, Sanghyun; Frigo, Pietro; Kaya, Yiğitcan; Giuffrida, Cristiano; Dumitras, Tudor;

doi: 10.48550/arxiv.1906.01017

arXiv: 1906.01017

handle: 1871.1/b5dde830-5d8f-441e-8b8c-a0e55320113f

Terminal Brain Damage: Exposing the Graceless Degradation in Deep Neural Networks Under Hardware Fault Attacks

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Deep neural networks (DNNs) have been shown to tolerate "brain damage": cumulative changes to the network's parameters (e.g., pruning, numerical perturbations) typically result in a graceful degradation of classification accuracy. However, the limits of this natural resilience are not well understood in the presence of small adversarial changes to the DNN parameters' underlying memory representation, such as bit-flips that may be induced by hardware fault attacks. We study the effects of bitwise corruptions on 19 DNN models---six architectures on three image classification tasks---and we show that most models have at least one parameter that, after a specific bit-flip in their bitwise representation, causes an accuracy loss of over 90%. We employ simple heuristics to efficiently identify the parameters likely to be vulnerable. We estimate that 40-50% of the parameters in a model might lead to an accuracy drop greater than 10% when individually subjected to such single-bit perturbations. To demonstrate how an adversary could take advantage of this vulnerability, we study the impact of an exemplary hardware fault attack, Rowhammer, on DNNs. Specifically, we show that a Rowhammer enabled attacker co-located in the same physical machine can inflict significant accuracy drops (up to 99%) even with single bit-flip corruptions and no knowledge of the model. Our results expose the limits of DNNs' resilience against parameter perturbations induced by real-world fault attacks. We conclude by discussing possible mitigations and future research directions towards fault attack-resilient DNNs.

Accepted to USENIX Security Symposium (USENIX) 2019

Country

Netherlands

Related Organizations

University of Maryland, College Park
United States
Vrije Universiteit Amsterdam
Netherlands
Vrije Universiteit Amsterdam
University of Maryland
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, SDG 16 - Peace, Computer Science - Cryptography and Security, Cryptography and Security (cs.CR), Justice and Strong Institutions, Machine Learning (cs.LG)

3 Research products, page 1 of 1

vision software on GitHub
IsRelatedTo
XNOR-Net-PyTorch software on GitHub
IsRelatedTo
quantized.pytorch software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average