A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2017 English Publisher:WileyJournal:Complexity, volume 2,017, pages 1-17 (issn: 1076-2787, eissn: 1099-0526,

Copyright policy )

Authors: Joaquín Abellán; Javier G. Castellano; Carlos Javier Mantas;

doi: 10.1155/2017/9023970

A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

- Summary
- Subjects
- Metrics

Abstract

The knowledge extraction from data with noise or outliers is a complex problem in the data mining area. Normally, it is not easy to eliminate those problematic instances. To obtain information from this type of data, robust classifiers are the best option to use. One of them is the application of bagging scheme on weak single classifiers. The Credal C4.5 (CC4.5) model is a new classification tree procedure based on the classical C4.5 algorithm and imprecise probabilities. It represents a type of the so-calledcredal trees. It has been proven that CC4.5 is more robust to noise than C4.5 method and even than other previous credal tree models. In this paper, the performance of the CC4.5 model in bagging schemes on noisy domains is shown. An experimental study on data sets with added noise is carried out in order to compare results where bagging schemes are applied on credal trees and C4.5 procedure. As a benchmark point, the known Random Forest (RF) classification method is also used. It will be shown that the bagging ensemble using pruned credal trees outperforms the successful bagging C4.5 and RF when data sets with medium-to-high noise level are classified.

Related Organizations

University of Granada
Spain

Keywords

CC4.5 model, Electronic computers. Computer science, Random Forest (RF) classification, Learning and adaptive systems in artificial intelligence, knowledge extraction, QA75.5-76.95, data mining, robust classifiers, Theory of languages and software systems (knowledge-based systems, expert systems, etc.) for artificial intelligence

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

5

Top 10%

Average

gold

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering