MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

Name: MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition
Keywords: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Qihao Zhao; Chen Jiang; Wei Hu 0004; Fan Zhang 0007; Jun Liu 0036

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/iccv51...

Article . 2023 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2023

License: CC BY

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Oct 2023Embargo end date: 01 Jan 2023Publisher:IEEEJournal:2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Authors: Qihao Zhao; Chen Jiang; Wei Hu 0004; Fan Zhang 0007; Jun Liu 0036;

doi: 10.1109/iccv51070.2023.01065 , 10.48550/arxiv.2308.09922

arXiv: 2308.09922

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Recently, multi-expert methods have led to significant improvements in long-tail recognition (LTR). We summarize two aspects that need further enhancement to contribute to LTR boosting: (1) More diverse experts; (2) Lower model variance. However, the previous methods didn't handle them well. To this end, we propose More Diverse experts with Consistency Self-distillation (MDCS) to bridge the gap left by earlier methods. Our MDCS approach consists of two core components: Diversity Loss (DL) and Consistency Self-distillation (CS). In detail, DL promotes diversity among experts by controlling their focus on different categories. To reduce the model variance, we employ KL divergence to distill the richer knowledge of weakly augmented instances for the experts' self-distillation. In particular, we design Confident Instance Sampling (CIS) to select the correctly classified instances for CS to avoid biased/noisy knowledge. In the analysis and ablation study, we demonstrate that our method compared with previous work can effectively increase the diversity of experts, significantly reduce the variance of the model, and improve recognition accuracy. Moreover, the roles of our DL and CS are mutually reinforcing and coupled: the diversity of experts benefits from the CS, and the CS cannot achieve remarkable results without the DL. Experiments show our MDCS outperforms the state-of-the-art by 1% $\sim$ 2% on five popular long-tailed benchmarks, including CIFAR10-LT, CIFAR100-LT, ImageNet-LT, Places-LT, and iNaturalist 2018. The code is available at https://github.com/fistyee/MDCS.

ICCV2023 Accept. 13 pages

Related Organizations

Beijing University of Chemical Technology
China (People's Republic of)
Singapore University of Technology and Design
Singapore

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

1 Research products, page 1 of 1

MDCS software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	7
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

7

Top 10%

Average

Top 10%

Green

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

1 Research products, page 1 of 1

MDCS software on GitHub