Published January 1, 2011 | Version v1
Journal article Open

Model selection in omnivariate decision trees using Structural Risk Minimization

Description

As opposed to trees that use a single type of decision node, an omnivariate decision tree contains nodes of different types. We propose to use Structural Risk Minimization (SRM) to choose between node types in omnivariate decision tree construction to match the complexity of a node to the complexity of the data reaching that node. In order to apply SRM for model selection, one needs the VC-dimension of the candidate models. In this paper, we first derive the VC-dimension of the univariate model, and estimate the VC-dimension of all three models (univariate, linear multivariate or quadratic multivariate) experimentally. Second, we compare SRM with other model selection techniques including Akaike's Information Criterion (AIC), Bayesian Information Criterion (BIC) and cross-validation (CV) on standard datasets from the UCI and Delve repositories. We see that SRM induces omnivariate trees that have a small percentage of multivariate nodes close to the root and they generalize more or at least as accurately as those constructed using other model selection techniques. (C) 2011 Published by Elsevier Inc.

Files

bib-89aefc83-1d7d-4bd1-b5bb-57fb9f3485eb.txt

Files (142 Bytes)

Name Size Download all
md5:3e8283941d92b0cbcb38ca3c7b2dcf6c
142 Bytes Preview Download