0% found this document useful (0 votes)

27 views

L22 DecisionTrees

This document discusses decision trees, including their history, how they work for classification and regression problems, and algorithms for building decision trees like ID3 and CART. It explains how decision trees classify instances by sorting them from the root node to a leaf node. Each node specifies an attribute test and branches represent attribute values. Metrics like entropy, information gain, and Gini index are used to build trees. The recursive binary splitting algorithm is also described to build regression trees. Sample weather and Python decision trees are shown.

Uploaded by

whathwaye

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

L22 DecisionTrees

Uploaded by

whathwaye

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Decision Trees

Arun Kumar

IIT Ropar

1 / 14
Outlines

1 Elements of Information Theory

2 Decision Tree Classification for Categorical Data

3 Decision Tree Regression

2 / 14
History

• Information theory was introduced in 1948 by Shannon.

• The theory cam into existence in connection with the problem of

transmission of information along communications channels.

• “Information" in itself is a very general, qualitative, subjective and not

very precise concept.

• However, information theory is developed into a quantitative, precise,

objective and very useful theory.

3 / 14
Shannon’s Information

• Let the PMF of the rv X is given.

• The question posed by Shannon is the following “Can we find a measure

of how much uncertain we are of the outcome" ?

• Shannon then assumed that if such a function, denoted H(p1 , · · · , pn ),

exists, it is reasonable to expect that it will have the following properties.
a H should be continuous in all the pi .
b If all the pi are equal, i.e., pi = 1/n, then H should have a maximum value
and this maximum value should be a monotonic increasing function of n.
c If a choice is broken down into successive choices, the quantity H should be
the weighted sum of the individual values of H.
Pn
• The entroy function is defined by H(p1 , p2 , · · · , pn ) = − i=1 pi log(pi ).

4 / 14
Measure of Impurity

Entropy
Entropy for a set S is givne by
X
H(S) = − p(c) log2 p(c),
c∈C

where C is the set of classes is S and p(c) are proportions of different

classes.

Gini
Gini impurity for a set S, where the target variable takes N different labels

X N
X
Gini(S) = p(i)p(j) = 1 − p(i)2 ,
i̸=j i=1

where p(i) are the proportions of different labels in the set S.

5 / 14
Decision Tree Introduction

• Decision Tree algorithm belongs to the family of supervised learning

algorithms.
• Decision tree can be used for solving classification as well as regression
problems.
• Decision trees classify instances by sorting them down the tree from the
root node to some leaf node. Leaf node classify the instance.
• Each node in the tree specifies a test of some attribute of instance and
each branch emanating from that node belong to the values of the
attribute.

6 / 14
Sample Decision Tree

7 / 14
Algorithms to build decision trees

• ID3 (Iterative Dichotomiser 3): Entropy and Information Gains as

metrics
• CART (Classification and Regression Trees): Gini Index as metric
• Decision tree regression by using recursive binary splitting as metric
• Others

8 / 14
ID3 Algorithm based on Weather Data

1
1
Based on “Machine Learning", by T. Mitchell, Ch. 3
9 / 14
Final Decision Tree

10 / 14
Recursive Binary Tree Splitting Algorithm

• Suppose there are p predictors.

• We find out the predictor Xj and the cutpoint s such that splitting the
predictor space into the regions {X |Xj < s} and {X |Xj ≥ s} leads to the
highest reduction in Residual Square Sums (RSS).
• In details, for any j and s, define the pair of half-planes given by
R1 (j, s) = {X |Xj < s} and R2 (j, s) = {X |Xj ≥ s}, and we search for the
pair (j, s), that minimize the value of RSS given by
X X
(yi − ŷR1 )2 + (yi − ŷR2 )2
i:xi ∈R1 (j,s) i:xi ∈R2 (j,s)

where ŷR1 is the mean response for the training observations in R1 (j, s),
and ŷR1 is the mean response for the training observations in R2 (j, s).
2

2
Based on “An Introduction to Statistical Learning with Applications in R ", Chapter 8, Page 306
11 / 14
Data

12 / 14
Final Decision Tree Based On Python Sklearn

13 / 14
References

• Beazley, D. M. (2009). Python: Essential Reference (4th ed.) Pearson

Education, Inc.

• James, G., Witten, D., Hastie, T. and Tibshirani, R. (2013). An

Introduction to Statistical Learning with Applications in R. Springer New
York.

• Mitchell, T. M. (2017). Machine Learning. McGraw Hill Education.

• https://www.superdatascience.com/

14 / 14

GIS For Coastal Zone Management PDF
No ratings yet
GIS For Coastal Zone Management PDF
349 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
Classification
No ratings yet
Classification
148 pages
Machine Learning-Lecture 05
No ratings yet
Machine Learning-Lecture 05
21 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Springer.linguistic Decision Trees for Classification-2014
No ratings yet
Springer.linguistic Decision Trees for Classification-2014
43 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
No ratings yet
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
17 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
ML Unit 3 New
No ratings yet
ML Unit 3 New
24 pages
22.InfoTheory-DecisionTrees-short
No ratings yet
22.InfoTheory-DecisionTrees-short
25 pages
Slide 3
No ratings yet
Slide 3
23 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
ML UNIT 2-2-40
No ratings yet
ML UNIT 2-2-40
39 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
entropy and information gain for decision tree algorithm
No ratings yet
entropy and information gain for decision tree algorithm
12 pages
7_DecisionTree
No ratings yet
7_DecisionTree
58 pages
Classification and Prediction
No ratings yet
Classification and Prediction
81 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Lecture - 3 Classification (Decision Tree)
No ratings yet
Lecture - 3 Classification (Decision Tree)
44 pages
DS4 - CLS-Decision Tree
No ratings yet
DS4 - CLS-Decision Tree
32 pages
ML-chap-3
No ratings yet
ML-chap-3
52 pages
3 - Decision trees
No ratings yet
3 - Decision trees
16 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Learning by Asking Questions: Decision Trees: Piyush Rai Machine Learning (CS771A)
No ratings yet
Learning by Asking Questions: Decision Trees: Piyush Rai Machine Learning (CS771A)
22 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Classification and Regression Tree Construction
No ratings yet
Classification and Regression Tree Construction
18 pages
ESGB_2025_classification and regression tress [Enregistré automatiquement]
No ratings yet
ESGB_2025_classification and regression tress [Enregistré automatiquement]
43 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Unit 3
No ratings yet
Unit 3
46 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant
No ratings yet
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant
4 pages
Decision Trees
No ratings yet
Decision Trees
128 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
Decision Trees
No ratings yet
Decision Trees
14 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Basic Methods of Linear Functional Analysis
From Everand
Basic Methods of Linear Functional Analysis
John D. Pryce
No ratings yet
Derivatives Basics
No ratings yet
Derivatives Basics
41 pages
L23-Decision Tree Classification
No ratings yet
L23-Decision Tree Classification
6 pages
Lecture 29 30
No ratings yet
Lecture 29 30
6 pages
Introduction To Stochastic Calculus
No ratings yet
Introduction To Stochastic Calculus
17 pages
Handout - Basic Regression - Analysis
No ratings yet
Handout - Basic Regression - Analysis
14 pages
Lecture 19
No ratings yet
Lecture 19
5 pages
Lecture 18
No ratings yet
Lecture 18
8 pages
FM University
No ratings yet
FM University
57 pages
5o-5o Biscuits Marketing Management Presentation
No ratings yet
5o-5o Biscuits Marketing Management Presentation
13 pages
Quiz #1 in Gec Art Table - Kimberly Eve T. Catalonia
No ratings yet
Quiz #1 in Gec Art Table - Kimberly Eve T. Catalonia
3 pages
Eab2103 Construction Science-1 PDF
No ratings yet
Eab2103 Construction Science-1 PDF
111 pages
GB Case Study HR Prob Sol1
100% (1)
GB Case Study HR Prob Sol1
69 pages
ShippingLabel (10208)
No ratings yet
ShippingLabel (10208)
2 pages
First Science
No ratings yet
First Science
11 pages
Anjana Consolidate Report OG
No ratings yet
Anjana Consolidate Report OG
10 pages
1001 Sayings of Shri Mataji Nirmala Devi
No ratings yet
1001 Sayings of Shri Mataji Nirmala Devi
28 pages
Kelas 3
No ratings yet
Kelas 3
3 pages
Numerical Chapter-5
No ratings yet
Numerical Chapter-5
9 pages
SLC Bounty Presentation Template V 1
No ratings yet
SLC Bounty Presentation Template V 1
11 pages
DEPRESSION (Anaya Singh)
No ratings yet
DEPRESSION (Anaya Singh)
16 pages
Kapelo Kolaro
No ratings yet
Kapelo Kolaro
4 pages
Elementis-Nalzin 2
No ratings yet
Elementis-Nalzin 2
2 pages
Project Agreement - Robert Rooker - LevelMaster Web and Mobile App - v1.0
No ratings yet
Project Agreement - Robert Rooker - LevelMaster Web and Mobile App - v1.0
17 pages
Item Analysis 1st Q
No ratings yet
Item Analysis 1st Q
23 pages
ECONOMICS. Syllabi of B.A. (Three Year Degree Course) DIBRUGARH UNIVERSITY DIBRUGARH. Preamble
No ratings yet
ECONOMICS. Syllabi of B.A. (Three Year Degree Course) DIBRUGARH UNIVERSITY DIBRUGARH. Preamble
22 pages
OPM545 (2)
No ratings yet
OPM545 (2)
4 pages
BEE Viva Questions
No ratings yet
BEE Viva Questions
3 pages
Kelompok 4 TRK 2
No ratings yet
Kelompok 4 TRK 2
5 pages
CBB Research Project Report
No ratings yet
CBB Research Project Report
22 pages
CHAPTER 4 - Fourier Series
No ratings yet
CHAPTER 4 - Fourier Series
14 pages
Velez - Entrepreneurship and Venture Management - Prelim
No ratings yet
Velez - Entrepreneurship and Venture Management - Prelim
11 pages
MPLSin IPv 6
No ratings yet
MPLSin IPv 6
38 pages
Portfolio Work Immersion
No ratings yet
Portfolio Work Immersion
64 pages
Dna Brochure
No ratings yet
Dna Brochure
2 pages
Los Dos Testigos de Apocalipsis 11
No ratings yet
Los Dos Testigos de Apocalipsis 11
10 pages
Pack 201 Horizontal Flow Wrapper: Application Information
No ratings yet
Pack 201 Horizontal Flow Wrapper: Application Information
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

L22 DecisionTrees

Uploaded by

L22 DecisionTrees

Uploaded by

Decision Trees

1 Elements of Information Theory

2 Decision Tree Classification for Categorical Data

3 Decision Tree Regression

• Information theory was introduced in 1948 by Shannon.

• The theory cam into existence in connection with the problem of

• “Information" in itself is a very general, qualitative, subjective and not

• However, information theory is developed into a quantitative, precise,

• Let the PMF of the rv X is given.

• The question posed by Shannon is the following “Can we find a measure

• Shannon then assumed that if such a function, denoted H(p1 , · · · , pn ),

where C is the set of classes is S and p(c) are proportions of different

where p(i) are the proportions of different labels in the set S.

• Decision Tree algorithm belongs to the family of supervised learning

• ID3 (Iterative Dichotomiser 3): Entropy and Information Gains as

• Suppose there are p predictors.

• Beazley, D. M. (2009). Python: Essential Reference (4th ed.) Pearson

• James, G., Witten, D., Hastie, T. and Tibshirani, R. (2013). An

• Mitchell, T. M. (2017). Machine Learning. McGraw Hill Education.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.