Apply Model to Test Data Test Data Refund marital Taxable Status Income Cheat Married 80K Refund Yes NO MasT Single, Dworced Married TaxIng NO <80K >80K NO YES C Tan, Steinbach, Kumar Introduction to Data Mining 18/2004
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 ‹#› Apply Model to Test Data Refund MarSt TaxInc NO YES NO NO Yes No Single, Divorced Married < 80K > 80K Refund Marital Status Taxable Income Cheat No Married 80K ? 10 Test Data
Apply Model to Test Data Test Data Refund marital Taxable Status Income Cheat Married 80K Refund Yes NO MasT Single, Dworced Married TaxIng NO <80K >80K NO YES C Tan, Steinbach, Kumar Introduction to Data Mining 18/2004
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 ‹#› Apply Model to Test Data Refund MarSt TaxInc NO YES NO NO Yes No Single, Divorced Married < 80K > 80K Refund Marital Status Taxable Income Cheat No Married 80K ? 10 Test Data
Apply Model to Test Data Test Data Refund Marital Taxable Status Income Cheat No Married 80K Refund Yes NO MasT Single, Dworced Married TaxIng NO <80K >80K NO YES C Tan, Steinbach, Kumar Introduction to Data Mining 18/2004
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 ‹#› Apply Model to Test Data Refund MarSt TaxInc NO YES NO NO Yes No Single, Divorced Married < 80K > 80K Refund Marital Status Taxable Income Cheat No Married 80K ? 10 Test Data
Apply Model to Test Data Test Data Refund Marital Taxable Status Income Cheat No Married 80K Refund Yes NO MasT Single, Dworced Married Assign Cheat to“No” TaxIng NO <80K >80K NO YES C Tan, Steinbach, Kumar Introduction to Data Mining 18/2004
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 ‹#› Apply Model to Test Data Refund MarSt TaxInc NO YES NO NO Yes No Single, Divorced Married < 80K > 80K Refund Marital Status Taxable Income Cheat No Married 80K ? 10 Test Data Assign Cheat to “No
Decision tree classification task Tree Tid Attrib1 Attrib2 Attrib3 Class 125K Induction Medium100 algorithm Small 70K Medium Inducti bn Medium 220K Learn Model Medium 75K 10 No Small Training Set Model Apply Decision Model Tid Attrib1 Attrib2 Attrib3 Class Tree Deduction 110K 14 No Small 67K Test Set C Tan, Steinbach, Kumar Introduction to Data Mining 18/2004
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 ‹#› Decision Tree Classification Task Apply Model Induction Deduction Learn Model Model Tid Attrib1 Attrib2 Attrib3 Class 1 Yes Large 125K No 2 No Medium 100K No 3 No Small 70K No 4 Yes Medium 120K No 5 No Large 95K Yes 6 No Medium 60K No 7 Yes Large 220K No 8 No Small 85K Yes 9 No Medium 75K No 10 No Small 90K Yes 10 Tid Attrib1 Attrib2 Attrib3 Class 11 No Small 55K ? 12 Yes Medium 80K ? 13 Yes Large 110K ? 14 No Small 95K ? 15 No Large 67K ? 10 Test Set Tree Induction algorithm Training Set Decision Tree