
AbstractTraining of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available datasets of dermatoscopic images. We tackle this problem by releasing the HAM10000 (“Human Against Machine with 10000 training images”) dataset. We collected dermatoscopic images from different populations acquired and stored by different modalities. Given this diversity we had to apply different acquisition and cleaning methods and developed semi-automatic workflows utilizing specifically trained neural networks. The final dataset consists of 10015 dermatoscopic images which are released as a training set for academic machine learning purposes and are publicly available through the ISIC archive. This benchmark dataset can be used for machine learning and for comparisons with human experts. Cases include a representative collection of all important diagnostic categories in the realm of pigmented lesions. More than 50% of lesions have been confirmed by pathology, while the ground truth for the rest of the cases was either follow-up, expert consensus, or confirmation by in-vivo confocal microscopy.
Statistics and Probability, FOS: Computer and information sciences, Data Descriptor, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Dermoscopy, Seborrheic keratosis, Library and Information Sciences, 1710 Information Systems, Skin Diseases, Lentigo maligna, Education, Diagnosis, 1706 Computer Science Applications, Image Processing, Computer-Assisted, Humans, 2613 Statistics and Probability, Accuracy, 006, Facial lesions, Computer Science Applications, 1804 Statistics, Probability and Uncertainty, 3309 Library and Information Sciences, Actinic keratosis, Statistics, Probability and Uncertainty, Pigmentation Disorders, 3304 Education, Information Systems
Statistics and Probability, FOS: Computer and information sciences, Data Descriptor, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Dermoscopy, Seborrheic keratosis, Library and Information Sciences, 1710 Information Systems, Skin Diseases, Lentigo maligna, Education, Diagnosis, 1706 Computer Science Applications, Image Processing, Computer-Assisted, Humans, 2613 Statistics and Probability, Accuracy, 006, Facial lesions, Computer Science Applications, 1804 Statistics, Probability and Uncertainty, 3309 Library and Information Sciences, Actinic keratosis, Statistics, Probability and Uncertainty, Pigmentation Disorders, 3304 Education, Information Systems
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3K | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.01% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 0.01% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 0.01% |
