descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 02 Oct 2015 English Publisher:Informa UK LimitedJournal:Journal of the American Statistical Association, volume 110, pages 1,770-1,784 (issn: 0162-1459, eissn: 1537-274X,

Authors: Ruoqing Zhu; Michael R. Kosorok; Donglin Zeng;

doi: 10.1080/01621459.2015.1036994 , 10.6084/m9.figshare.1381894 , 10.17615/v9dj-6b88 , 10.6084/m9.figshare.1381894.v1

pmid: 26903687

pmc: PMC4760114

Reinforcement Learning Trees

- Summary
- Metrics

Abstract

In this article, we introduce a new type of tree-based method, reinforcement learning trees (RLT), which exhibits significantly improved performance over traditional methods such as random forests (Breiman 2001) under high-dimensional settings. The innovations are three-fold. First, the new method implements reinforcement learning at each selection of a splitting variable during the tree construction processes. By splitting on the variable that brings the greatest future improvement in later splits, rather than choosing the one with largest marginal effect from the immediate split, the constructed tree uses the available samples in a more efficient way. Moreover, such an approach enables linear combination cuts at little extra computational cost. Second, we propose a variable muting procedure that progressively eliminates noise variables during the construction of each individual tree. The muting procedure also takes advantage of reinforcement learning and prevents noise variables from being considered in the search for splitting rules, so that toward terminal nodes, where the sample size is small, the splitting rules are still constructed from only strong variables. Last, we investigate asymptotic properties of the proposed method under basic assumptions and discuss rationale in general settings. Supplementary materials for this article are available online.

Related Organizations

University of North Carolina at Chapel Hill
United States

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	128
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%