Journal article
Performance of a localized tree splitting criterion in tree averaging
Australian & New Zealand Journal of Statistics, Vol.46(4), pp.583-599
2004
Abstract
This paper explores the performance of the local splitting criterion devised by Bremner & Taplin for classification and regression trees when multiple trees are averaged to improve performance. The criterion is compared with the deviance used by Clark & Pregibon's method, which is a global splitting criterion typically used to grow trees. The paper considers multiple trees generated by randomly selecting splits with probability proportional to the likelihood for the split, and by bagging where bootstrap samples from the data are used to grow trees. The superiority of the localized splitting criterion often persists when multiple trees are grown and averaged for six datasets.
Tree averaging is known to be advantageous when the trees being averaged produce different predictions, and this can be achieved by choosing splits where the splitting criterion is locally optimal. The paper shows that use of locally optimal splits gives promising results in conjunction with both local and global splitting criteria, and with and without random selection of splits. The paper also extends the local splitting criterion to accommodate categorical predictors.
Details
- Title
- Performance of a localized tree splitting criterion in tree averaging
- Authors/Creators
- A.P. Bremner (Author/Creator) - Murdoch UniversityR.H. Taplin (Author/Creator) - Murdoch University
- Publication Details
- Australian & New Zealand Journal of Statistics, Vol.46(4), pp.583-599
- Publisher
- Blackwell Publishing Inc.
- Identifiers
- 991005544910707891
- Murdoch Affiliation
- Murdoch University
- Language
- English
- Resource Type
- Journal article
Metrics
41 Record Views
InCites Highlights
These are selected metrics from InCites Benchmarking & Analytics tool, related to this output
- Citation topics
- 4 Electrical Engineering, Electronics & Computer Science
- 4.61 Artificial Intelligence & Machine Learning
- 4.61.145 Classification Algorithms
- Web Of Science research areas
- Statistics & Probability
- ESI research areas
- Mathematics