
This is part 6 of the IDNet dataset of our research paper "IDNet: A Novel Identity Document Dataset via Few-Shot and Quality-Driven Synthetic Data Generation. Here's a link to the paper: https://ieeexplore.ieee.org/document/10825017 Citation: @inproceedings{xie2024idnet, title={IDNet: A Novel Identity Document Dataset via Few-Shot and Quality-Driven Synthetic Data Generation}, author={Xie, Lulu and Wang, Yancheng and Guan, Hong and Nag, Soham and Goel, Rajeev and Swamy, Niranjan and Yang, Yingzhen and Xiao, Chaowei and Prisby, Jonathan and Maciejewski, Ross and others}, booktitle={2024 IEEE International Conference on Big Data (BigData)}, pages={2244--2253}, year={2024}, organization={IEEE}} @article{guan2024idnet, title={IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection}, author={Guan, Hong and Wang, Yancheng and Xie, Lulu and Nag, Soham and Goel, Rajeev and Swamy, Niranjan Erappa Narayana and Yang, Yingzhen and Xiao, Chaowei and Prisby, Jonathan and Maciejewski, Ross and Zou, Jia}, journal={arXiv preprint arXiv:2408.01690}, year={2024}}
Successful analysis and fraud detection of identity documents (e.g., passports, driver's licenses, and identity cards) is critical to prevent identity theft and ensure security for online platforms. We created a new benchmark dataset of identity documents to facilitate privacy-preserving fraud detection. The dataset contains 597,900 images (about 400 gigabytes) of synthetically generated identity documents, including driver licenses from 10 U.S. States and passport/ID cards from 10 European countries. Due to the large dataset size, we divided it into multiple parts, all available on Zenodo. Each zip file contains the original synthetic document, four fraud patterns (text field replacement, face morphing, portrait substitution, mixed fraud patterns), and metadata. All portrait images in this dataset are synthetic and are provided by https://generated.photos/
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
