Machine Learning Algorithm : decision tree - Part 9 of 12

Abhay Kumar

Published Dec 11, 2017

Classification and Regression Tree(CART)

Classification and regression trees are machine-learning methods for constructing prediction models from data. The models are obtained by recursively partitioning the data space and fitting a simple prediction model within each partition. As a result, the partitioning can be represented graphically as a decision tree. Classification trees are designed for dependent variables that take a finite number of unordered values, with prediction error measured in terms of misclassification cost. Regression trees are for dependent variables that take continuous or ordered discrete values, with prediction error typically measured by the squared difference between the observed and predicted values.

For example, consider the widely referenced Iris data classification problem introduced by Fisher . The data file Irisdat reports the lengths and widths of sepals and petals of three types of irises (Setosa, Versicol, and Virginic). The purpose of the analysis is to learn how we can discriminate between the three types of flowers, based on the four measures of width and length of petals and sepals. Discriminant function analysis will estimate several linear combinations of predictor variables for computing classification scores (or probabilities) that allow the user to determine the predicted classification for each observation. A classification tree will determine a set of logical if-then conditions (instead of linear equations) for predicting or classifying cases instead:

The interpretation of this tree is straightforward: If the petal width is less than or equal to 0.8, the respective flower would be classified as Setosa; if the petal width is greater than 0.8 and less than or equal to 1.75, then the respective flower would be classified as Versicol; else, it belongs to class Virginic.

To view or add a comment, sign in

Machine Learning Algorithm : decision tree - Part 9 of 12

Abhay Kumar

Classification and Regression Tree(CART)

More articles by Abhay Kumar

Explore content categories

Classification and Regression Tree(CART)

More articles by Abhay Kumar

SAS Middle Tier Web Process

Machine Learning Algorithm : bayesian - Part 8 of 12

Machine Learning Algorithm : ensemble (part 7 of 12)

Machine Learning Algorithm : Associated Rule (Part 6 of 12)

Machine Learning Algorithm - Deep Learning (Part 5 of 12)

Machine Learning Algorithm - Dimensionality Reduction (Part 4 of 12)

MACHINE LEARNING ALGORITHMS - Instance Based (CAKE BASED, MEMORY BASED) Part 3 of 12

Machine Learning Algorithm - Regularization Part 2 of 12

MACHINE LEARNING ALGORITHMS - Regression - Part 1 of 12

Big Data A Real Challenge

Explore content categories