文章基本信息

标题：Variable importance in binary regression trees and forests
作者：Hemant Ishwaran
期刊名称：Electronic Journal of Statistics
印刷版ISSN：1935-7524
出版年度：2007
卷号：1
页码：519-537
出版社：Institute of Mathematical Statistics
摘要：We characterize and study variable importance (VIMP) and pairwise variable associations in binary regression trees. A key component involves the node mean squared error for a quantity we refer to as a maximal subtree. The theory naturally extends from single trees to ensembles of trees and applies to methods like random forests. This is useful because while importance values from random forests are used to screen variables, for example they are used to filter high throughput genomic data in Bioinformatics, very little theory exists about their properties.