Regression-Enhanced Random Forests

Thumbnail Image
Supplemental Files
Date
2017-01-01
Authors
Nettleton, Dan
Zhu, Zhengyuan
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Authors
Person
Nettleton, Dan
Department Chair and Distinguished Professor
Person
Zhu, Zhengyuan
Director of the Center for Survey Statistics and Methodology and Professor
Research Projects
Organizational Units
Organizational Unit
Statistics
As leaders in statistical research, collaboration, and education, the Department of Statistics at Iowa State University offers students an education like no other. We are committed to our mission of developing and applying statistical methods, and proud of our award-winning students and faculty.
Journal Issue
Is Version Of
Versions
Series
Department
Statistics
Abstract

Random forest (RF) methodology is one of the most popular machine learning techniques for prediction problems. In this article, we discuss some cases where random forests may suffer and propose a novel generalized RF method, namely regression-enhanced random forests (RERFs), that can improve on RFs by borrowing the strength of penalized parametric regression. The algorithm for constructing RERFs and selecting its tuning parameters is described. Both simulation study and real data examples show that RERFs have better predictive performance than RFs in important situations often encountered in practice. Moreover, RERFs may incorporate known relationships between the response and the predictors, and may give reliable predictions in extrapolation problems where predictions are required at points out of the domain of the training dataset. Strategies analogous to those described here can be used to improve other machine learning methods via combination with penalized parametric regression techniques.

Comments

This proceeding is published as Zhang, H., Nettleton, D., Zhu, Z. (2017). Regression-enhanced random forests. In JSM Proceedings, Section on Statistical Learning and Data Science. Alexandria, VA: American Statistical Association. 636–647. Posted with permission.

Description
Keywords
Citation
DOI
Source
Copyright
Sun Jan 01 00:00:00 UTC 2017