DF40: Toward Next-Generation Deepfake Detection

Name: DF40: Toward Next-Generation Deepfake Detection
Keywords: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:arXiv

Authors: Yan, Zhiyuan; Yao, Taiping; Chen, Shen; Zhao, Yandan; Fu, Xinghe; Zhu, Junwei; Luo, Donghao; +4 Authors

doi: 10.48550/arxiv.2406.13495

arXiv: 2406.13495

DF40: Toward Next-Generation Deepfake Detection

- Summary
- Subjects
- Related research
  (21)
- Metrics

Abstract

We propose a new comprehensive benchmark to revolutionize the current deepfake detection field to the next generation. Predominantly, existing works identify top-notch detection algorithms and models by adhering to the common practice: training detectors on one specific dataset (e.g., FF++) and testing them on other prevalent deepfake datasets. This protocol is often regarded as a "golden compass" for navigating SoTA detectors. But can these stand-out "winners" be truly applied to tackle the myriad of realistic and diverse deepfakes lurking in the real world? If not, what underlying factors contribute to this gap? In this work, we found the dataset (both train and test) can be the "primary culprit" due to: (1) forgery diversity: Deepfake techniques are commonly referred to as both face forgery and entire image synthesis. Most existing datasets only contain partial types of them, with limited forgery methods implemented; (2) forgery realism: The dominated training dataset, FF++, contains out-of-date forgery techniques from the past four years. "Honing skills" on these forgeries makes it difficult to guarantee effective detection generalization toward nowadays' SoTA deepfakes; (3) evaluation protocol: Most detection works perform evaluations on one type, which hinders the development of universal deepfake detectors. To address this dilemma, we construct a highly diverse deepfake detection dataset called DF40, which comprises 40 distinct deepfake techniques. We then conduct comprehensive evaluations using 4 standard evaluation protocols and 8 representative detection methods, resulting in over 2,000 evaluations. Through these evaluations, we provide an extensive analysis from various perspectives, leading to 7 new insightful findings. We also open up 4 valuable yet previously underexplored research questions to inspire future works. Our project page is https://github.com/YZY-stack/DF40.

arXiv admin note: text overlap with arXiv:2108.05080 by other authors

Related Organizations

PEKING UNIVERSITY
China (People's Republic of)
Peking University
PEKING UNIVERSITY
China (People's Republic of)
Peking University
China (People's Republic of)
Pekin University
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

21 Research products, page 1 of 3

ICCV2023-MCNET software on GitHub
IsRelatedTo
stargan-v2 software on GitHub
IsRelatedTo
FaceSwap software on GitHub
IsRelatedTo
stylegan-xl software on GitHub
IsRelatedTo
inswapper software on GitHub
IsRelatedTo
DeepFaceLab software on GitHub
IsRelatedTo
FaceForensics software on GitHub
IsRelatedTo
few-shot-vid2vid software on GitHub
IsRelatedTo
RDDM software on GitHub
IsRelatedTo
SadTalker software on GitHub
IsRelatedTo

chevron_left
1
2
3
chevron_right

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green