How to do it...

Perform the following steps to split the churn dataset into training and testing datasets:

Load the rpart package:

        > library(rpart)

Use the rpart function to build a classification tree model:

        > churn.rp = rpart(churn ~ ., data=trainset)

Type churn.rp to retrieve the node detail of the classification tree:

        > churn.rp

Next, use the printcp function to examine the complexity parameter:

        > printcp(churn.rp)
        Output

        Classification tree:
        rpart(formula = churn ~ ., data = trainset)

        Variables actually used in tree construction:
        [1] international_plan number_customer_service_calls
        [3] total_day_minutes total_eve_minutes 
        [5] total_intl_calls total_intl_minutes 
        [7] voice_mail_plan 

        Root node error: 342/2315 = 0.14773

        n= 2315 

                CP nsplit rel error xerror xstd
        1 0.076023 0 1.00000 1.00000 0.049920
        2 0.074561 2 0.84795 0.99708 0.049860
        3 0.055556 4 0.69883 0.76023 0.044421
        4 0.026316 7 0.49415 0.52632 0.037673
        5 0.023392 8 0.46784 0.52047 0.037481
        6 0.020468 10 0.42105 0.50877 0.037092
        7 0.017544 11 0.40058 0.47076 0.035788
        8 0.010000 12 0.38304 0.47661 0.035993

Next, use the plotcp function to plot the cost complexity parameters:

        > plotcp(churn.rp)

The cost complexity parameter plot

Lastly, use the summary function to examine the built model:

        > summary(churn.rp)

Table of Contents for How to do it...

Create new playlist

Sign In

Sign Up

Table of Contents for
How to do it...