Post-processing Evolved Decision TreesShow others and affiliations
2009 (English)In: Foundations of Computational Intelligence / [ed] Ajith Abraham, Springer Verlag , 2009, p. 149-164Chapter in book (Other academic)
Abstract [en]
Although Genetic Programming (GP) is a very general technique, it is also quite powerful. As a matter of fact, GP has often been shown to outperform more specialized techniques on a variety of tasks. In data mining, GP has successfully been applied to most major tasks; e.g. classification, regression and clustering. In this chapter, we introduce, describe and evaluate a straightforward novel algorithm for post-processing genetically evolved decision trees. The algorithm works by iteratively, one node at a time, search for possible modifications that will result in higher accuracy. More specifically, the algorithm, for each interior test, evaluates every possible split for the current attribute and chooses the best. With this design, the post-processing algorithm can only increase training accuracy, never decrease it. In the experiments, the suggested algorithm is applied to GP decision trees, either induced directly from datasets, or extracted from neural network ensembles. The experimentation, using 22 UCI datasets, shows that the suggested post-processing technique results in higher test set accuracies on a large majority of the datasets. As a matter of fact, the increase in test accuracy is statistically significant for one of the four evaluated setups, and substantial on two out of the other three.
Place, publisher, year, edition, pages
Springer Verlag , 2009. p. 149-164
Keywords [en]
decision trees, genetic programming, Machine learning
Keywords [sv]
data mining
National Category
Computer and Information Sciences Computer and Information Sciences
Identifiers
URN: urn:nbn:se:hb:diva-4926DOI: 10.1007/978-3-642-01088-0Local ID: 2320/5721ISBN: 978-3-642-01087-3 (print)OAI: oai:DiVA.org:hb-4926DiVA, id: diva2:884344
2015-12-172015-12-172020-01-29Bibliographically approved