Poisoning Web-Scale Training Datasets is Practical

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 19 May 2024Embargo end date: 01 Jan 2023Publisher:IEEEJournal:2024 IEEE Symposium on Security and Privacy (SP)

Authors: Nicholas Carlini; Matthew Jagielski; Christopher A. Choquette-Choo; Daniel Paleka; Will Pearce; Hyrum S. Anderson; Andreas Terzis; +2 Authors

doi: 10.1109/sp54263.2024.00179 , 10.48550/arxiv.2302.10149

arXiv: 2302.10149

Poisoning Web-Scale Training Datasets is Practical

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Deep learning models are often trained on distributed, web-scale datasets crawled from the internet. In this paper, we introduce two new dataset poisoning attacks that intentionally introduce malicious examples to a model's performance. Our attacks are immediately practical and could, today, poison 10 popular datasets. Our first attack, split-view poisoning, exploits the mutable nature of internet content to ensure a dataset annotator's initial view of the dataset differs from the view downloaded by subsequent clients. By exploiting specific invalid trust assumptions, we show how we could have poisoned 0.01% of the LAION-400M or COYO-700M datasets for just $60 USD. Our second attack, frontrunning poisoning, targets web-scale datasets that periodically snapshot crowd-sourced content -- such as Wikipedia -- where an attacker only needs a time-limited window to inject malicious examples. In light of both attacks, we notify the maintainers of each affected dataset and recommended several low-overhead defenses.

Related Organizations

DeepMind (United Kingdom)
United Kingdom
ETH Zurich
Switzerland
Google (Canada)
Canada

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Cryptography and Security, Cryptography and Security (cs.CR), Machine Learning (cs.LG)

3 Research products, page 1 of 1

coyo-dataset software on GitHub
IsRelatedTo
diffusers software on GitHub
IsRelatedTo
img2dataset software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	17
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%