Semi-supervised Long-tail Endoscopic Image Classification

Run-Nan, Cao; Meng-Jie, Fang; Hai-Ling, Li; Jie, Tian; Di, Dong

Found an issue? Give us feedback

Chinese Medical Scie...arrow_drop_down

Chinese Medical Sciences Journal

Article . 2022 . Peer-reviewed

Data sources: Crossref

Chinese Medical Sciences Journal

Article . 2022

Data sources: Europe PubMed Central

Semi-supervised Long-tail Endoscopic Image Classification

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2022 English Publisher:Chinese Medical Sciences JournalJournal:Chinese Medical Sciences Journal, volume 37, pages 171-180 (issn: 1001-9294,

Copyright policy )

Authors: Run-Nan, Cao; Meng-Jie, Fang; Hai-Ling, Li; Jie, Tian; Di, Dong;

doi: 10.24920/004135

pmid: 36321172

Semi-supervised Long-tail Endoscopic Image Classification

- Summary
- Subjects
- Metrics

Abstract

Objective To explore the semi-supervised learning (SSL) algorithm for long-tail endoscopic image classification with limited annotations. Method We explored semi-supervised long-tail endoscopic image classification in HyperKvasir, the largest gastrointestinal public dataset with 23 diverse classes. Semi-supervised learning algorithm FixMatch was applied based on consistency regularization and pseudo-labeling. After splitting the training dataset and the test dataset at a ratio of 4:1, we sampled 20%, 50%, and 100% labeled training data to test the classification with limited annotations. Results The classification performance was evaluated by micro-average and macro-average evaluation metrics, with the Mathews correlation coefficient (MCC) as the overall evaluation. SSL algorithm improved the classification performance, with MCC increasing from 0.8761 to 0.8850, from 0.8983 to 0.8994, and from 0.9075 to 0.9095 with 20%, 50%, and 100% ratio of labeled training data, respectively. With a 20% ratio of labeled training data, SSL improved both the micro-average and macro-average classification performance; while for the ratio of 50% and 100%, SSL improved the micro-average performance but hurt macro-average performance. Through analyzing the confusion matrix and labeling bias in each class, we found that the pseudo-based SSL algorithm exacerbated the classifier's preference for the head class, resulting in improved performance in the head class and degenerated performance in the tail class. Conclusion SSL can improve the classification performance for semi-supervised long-tail endoscopic image classification, especially when the labeled data is extremely limited, which may benefit the building of assisted diagnosis systems for low-volume hospitals. However, the pseudo-labeling strategy may amplify the effect of class imbalance, which hurts the classification performance for the tail class.

Related Organizations

Institute of Automation
China (People's Republic of)
Beihang University
China (People's Republic of)
Xidian University
China (People's Republic of)
University of Chinese Academy of Sciences
China (People's Republic of)
State Key Laboratory of Complex System Management and Control
China (People's Republic of)

View all View all

Keywords

Supervised Machine Learning, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average