Multivariate random forests

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2011 English Publisher:WileyJournal:WIREs Data Mining and Knowledge Discovery, volume 1, pages 80-87 (issn: 1942-4787, eissn: 1942-4795,

Copyright policy )

Authors: Mark R. Segal; Yuanyuan Xiao;

doi: 10.1002/widm.12

Multivariate random forests

- Summary
- Metrics

Abstract

AbstractRandom forests have emerged as a versatile and highly accurate classification and regression methodology, requiring little tuning and providing interpretable outputs. Here, we briefly outline the genesis of, and motivation for, the random forest paradigm as an outgrowth from earlier tree‐structured techniques. We elaborate on aspects of prediction error and attendant tuning parameter issues. However, our emphasis is on extending the random forest schema to the multiple response setting. We provide a simple illustrative example from ecology that showcases the improved fit and enhanced interpretation afforded by the random forest framework. © 2011 John Wiley & Sons, Inc. WIREs Data Mining Knowl Discov 2011 1 80‐87 DOI: 10.1002/widm.12This article is categorized under: Algorithmic Development > Hierarchies and Trees Algorithmic Development > Ensemble Methods Technologies > Machine Learning Technologies > Prediction

Related Organizations

University of California, San Francisco
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	185
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%