Code for: Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection

Chaudhry, Zan; Rotenberg, Noam; Caffo, Brian; Jones, Craig; Sair, Haris

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Software . 2026

Data sources: ZENODO

ZENODO

Software . 2026

Data sources: Datacite

ZENODO

Software . 2026

Data sources: Datacite

Code for: Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection

integration_instructionsResearch softwarekeyboard_double_arrow_right Software 11 Jan 2026Publisher:Zenodo

Authors: Chaudhry, Zan; Rotenberg, Noam; Caffo, Brian; Jones, Craig; Sair, Haris;

doi: 10.5281/zenodo.18210899 , 10.5281/zenodo.18210898

Code for: Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection

- Summary
- Metrics

Abstract

This Zenodo repository contains the code and data outputs used to generate the results found in the paper "Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection", which can be found here: doi:10.48550/arXiv.2601.10084. Adaptive Label Error Detection (ALED) is also implemented in the statlab Python package, which can be found on pypi here (or through Github, here). The jupyter notebook files provided correspond to the data displayed in the tables of the paper as follows: Table I ("PRETRAINED DENSENET121 WITH 5% MISLABELING") and Figure 1: DenseNet-Pretrained.ipynb Table II ("PRETRAINED RESNET50 WITH 5% MISLABELING") and Figure 2: ResNet-Pretrained.ipynb Table III ("RANDOMLY INITIALIZED DENSENET121 WITH 5% MISLABELING") and Figure 3: DenseNet-Untrained.ipynb Table IV ("RANDOMLY INITIALIZED RESNET50 WITH 5% MISLABELING") and Figure 4: ResNet-Untrained.ipynb Figure 5 (varying percentage of mislabeled samples): ResNet-Pretrained_Mislabel_Percentage.ipynb Figure 6 (full ALED workflow: applying ALED algorithm to identify mislabeled classes, then removing them and retraining): ResNet-Pretrained_Full_Workflow.ipynb Table V ("VARYING ALED PROJECTION NUMBER, FIXED ENSEMBLE NUMBER (10) AND LIKELIHOOD RATIO (2)"); Table VI ("VARYING ALED NUMBER OF ENSEMBLES, FIXED PROJECTION NUMBER (2) AND LIKELIHOOD RATIO (2)"); and Table VII ("VARYING ALED LIKELIHOOD RATIO, FIXED PROJECTION NUMBER (2) AND ENSEMBLE NUMBER (10)"): ResNet-Pretrained_HyperParams.ipynb All outputs of the jupyter notebooks (as shown in the last block of each file, if applicable) is provided in results.zip. To reproduce these results, the Python environment file ALED_environment.yml is provided.

Related Organizations

Johns Hopkins Medicine
United States
Johns Hopkins University
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average