Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2024
License: CC BY SA
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2024
License: CC BY SA
Data sources: ZENODO
ZENODO
Dataset . 2024
License: CC BY SA
Data sources: Datacite
ZENODO
Dataset . 2024
License: CC BY SA
Data sources: Datacite
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Long-Tailed Anomaly Detection (LTAD) Dataset

Authors: Ho, Chih-Hui; Peng, Kuan-Chuan; Vasconcelos, Nuno;

Long-Tailed Anomaly Detection (LTAD) Dataset

Abstract

Introduction Anomaly detection (AD) aims to identify defective images and localize their defects (if any). Ideally, AD models should be able to: detect defects over many image classes; not rely on hard-coded class names that can be uninformative or inconsistent across datasets; learn without anomaly supervision; and be robust to the long-tailed distributions of real-world applications. To address these challenges, we formulate the problem of long-tailed AD by introducing several datasets with different levels of class imbalance for performance evaluation. To encourage more follow up works on long-tailed AD, we are publicly releasing the dataset split used in our paper (“Long-Tailed Anomaly Detection with Learnable Class Names” by Chih-Hui Ho, Kuan-Chuan Peng, and Nuno Vasconcelos, CVPR 2024). Files in the unzipped folder: 1. ./README.md: This Markdown file 2. ./dataset_split: Folder contains long-tail splits from three datasets. See below for details. At a Glance The size of the unzipped dataset is ~16MB Three datasets are used in this project, including [MVTec](https://www.mvtec.com/company/research/datasets/mvtec-ad), [VisA](https://github.com/amazon-science/spot-diff) and [DAGM](https://www.kaggle.com/datasets/mhskjelvareid/dagm-2007-competition-dataset-optical-inspection). Please download the datasets from their original repositories. The dataset split provided in this folder is organized as follows:```dataset_split|---dagm_lt|---mvtec_lt|---visa_lt|-----|-- exp|-----|-----|----- 100|-----|-----|----- |-----test.json|-----|-----|----- |-----train.json|-----|-----|----- 200|-----|-- step|-----|-- ...``` Each long-tailed dataset split contains a subfolder ``imbalance_type/imbalance_factor", where imbalance type can be [exponential (exp), step, reverse exponential (exp_reverse), reverse step (step_reverse)]. The definition of imbalance type and imbalance factor can be found in our paper. Each subfolder contains two json files, one for training and the other for testing. Each entry in the json file contains the meta information of an image and is similar to```{"filename": "candle/test/bad/000.JPG", "label": 1, "label_name": "defective", "clsname": "candle", "maskname": "candle/ground_truth/bad/000.png"}```- filename: location of the input image in the dataset- label: indicates whether the input image is normal (labeled as 0) or defective (labeled as 1)- label name: can be "good" or "defective"- clsname: class name of the input image- maskname (optional): location of the binary image that indicates the defect region. This is only available for test.json, because there is no defect image during training. Citation If you use the LTAD dataset in your research, please cite our contribution: @InProceedings{Ho_2024_CVPR, author = {Ho, Chih-Hui and Peng, Kuan-Chuan and Vasconcelos, Nuno}, title = {Long-Tailed Anomaly Detection with Learnable Class Names}, booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2024} } License The LTAD dataset is released under CC-BY-SA-4.0 license. For the images in the MVTec, VisA, and DAGM datasets, please refer to their websites for their copyright and license terms. Created by Mitsubishi Electric Research Laboratories (MERL), 2023-2024 SPDX-License-Identifier: CC-BY-SA-4.0

Keywords

• long-tailed distribution, • anomaly detection, • industrial inspection

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average