Active learning with error-correcting output codes

descriptionPublicationkeyboard_double_arrow_right Article 01 Oct 2019 English Publisher:Elsevier BVJournal:Neurocomputing, volume 364, pages 182-191 (issn: 0925-2312,

Copyright policy )

Authors: Shilin Gu; Yang Cai; Jincheng Shan; Chenping Hou;

doi: 10.1016/j.neucom.2019.06.064

Active learning with error-correcting output codes

- Summary
- Metrics

Abstract

Abstract In many real-world classification problems, while there is a large amount of unlabeled data, labeled data is usually hard to acquire. One way to solve these problems is active learning. It aims to select the most valuable instances for labeling and construct a superior classifier. Most existing active learning algorithms are designed for binary classification problems, only a few algorithms can deal with multi-class cases. Moreover, as most multi-class active learning methods are directly extended from binary active learning methods, it is difficult for them to fuse the output results of binary cases. In this paper, we propose a novel multi-class active learning algorithm to tackle the above problems and select the most informative instances, called active learning with error-correcting output codes (ECOCAL). We create a codeword for each class and then obtain a test code for each unlabeled instance by error-correcting output codes (ECOC) framework, which is a powerful tool to combine multiple binary classifiers to address multi-class classification problems. By calculating the variance of the distance between a test code and all codewords, the proposed algorithm is able to measure the uncertainty across multiple classes. Extensive experimental results show that the proposed method outperforms several state-of-the-art active learning methods on both binary and multi-class datasets.

Related Organizations

National University of Defense Technology
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%