Incremental decision tree based on order statistics

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Aug 2013 France Publisher:IEEEJournal:The 2013 International Joint Conference on Neural Networks (IJCNN)

Authors: Salperwyck, Christophe; Lemaire, Vincent;

doi: 10.1109/ijcnn.2013.6706907

Incremental decision tree based on order statistics

- Summary
- Subjects
- Metrics

Abstract

New application domains generate data which are not persistent anymore but volatile: network management, web profile modeling... These data arrive quickly, massively and are visible just once. Thus they necessarily have to be learnt according to their arrival orders. For classification problems online decision trees are known to perform well and are widely used on streaming data. In this paper, we propose a new decision tree method based on order statistics. The construction of an online tree usually needs summaries in the leaves. Our solution uses bounded error quantiles summaries. A robust and performing discretization or grouping method uses these summaries to provide, at the same time, a criterion to find the best split and better density estimations. This estimation is then used to build a na¨ıve Bayes classifier in the leaves to improve the prediction in the early learning stage.

Country

France

Related Organizations

University of Lille
France
French Institute for Research in Computer Science and Automation
France
French National Centre for Scientific Research
France
Institut des Sciences Humaines et Sociales
France

Keywords

[INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

5

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Related to Research communities

INRIA