An Empirical Comparison of Pruning Methods for Decision Tree Induction

Machine Learning - Tập 4 - Trang 227-243 - 1989
John Mingers1
1School of Industrial and Business Studies, University of Warwick, Coventry, England

Tóm tắt

This paper compares five methods for pruning decision trees, developed from sets of examples. When used with uncertain rather than deterministic data, decision-tree induction involves three main stages—creating a complete tree able to classify all the training examples, pruning this tree to give statistical reliability, and processing the pruned tree to improve understandability. This paper concerns the second stage—pruning. It presents empirical comparisons of the five methods across several domains. The results show that three methods—critical value, error complexity and reduced error—perform well, while the other two may cause problems. They also show that there is no significant interaction between the creation and pruning methods.

Tài liệu tham khảo