0% found this document useful (0 votes)

24 views

Decision Trees

Decision trees are a type of supervised machine learning algorithm that use a tree-like structure to model decisions and their consequences. They work by recursively splitting the data into subsets based on feature values, and can handle both categorical and numerical data. Decision trees are easy to interpret, can capture complex interactions, and are robust to noise.

Uploaded by

Victor Vadillo Erosa

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Decision Trees

Uploaded by

Victor Vadillo Erosa

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Decision Trees

What is a Decision
Tree?

Decision trees are a type of supervised learning

algorithm used in machine learning. They are
used for classification and regression analysis.

Decision trees use a tree-like structure of nodes,

branches, and leafs to model decisions and their
consequences.

Decision trees are non-parametric, which means

they do not make any assumptions about the
distribution of the data.
Key Terms

Root Node: It represents the entire population or

sample, and this further gets divided into two or
more homogeneous sets.

Leaf/ Terminal Node: A node that cannot be split

further is called Leaf or Terminal node.

Decision Node: When a sub-node splits into

further sub-nodes, then it is called a decision
node.

Branch / Sub-Tree: A subsection of the entire

tree is called a branch or sub-tree.
Key Terms

Parent and Child Node: A node, which is divided

into sub-nodes, is called the parent node of sub-
nodes, whereas sub-nodes are the child of the
parent node.

Splitting: It is a process of dividing a node into

two or more sub-nodes.

Pruning: Pruning is when we selectively remove

branches from a tree. The goal is to remove
unwanted branches, improve the tree’s structure,
and direct new, healthy growth.
Key Terms

Impurity: In decision trees, impurity is a measure

of the homogeneity of the labels at the node. The
current implementation provides two impurity
measures for classification (Gini impurity and
entropy) and one impurity measure for regression
(variance). The algorithm chooses the partition
maximizing the purity of the split (i.e., minimizing
the impurity).
Example
How decision trees
work?

The decision tree algorithm works by recursively

partitioning the data into subsets based on the
values of different features.
At each node of the tree, the algorithm selects the
feature that best separates the data into subsets
that are most homogeneous with respect to the
target variable.
This process is repeated until a stopping criterion
is met, such as a maximum depth of the tree or a
minimum number of data points in a leaf node.
How decision trees
work?

Step 1. Select the best feature to split the

data: The algorithm looks at all the input features
and selects the one that provides the best split in
terms of maximizing the information gain or
minimizing the impurity of the resulting subsets.

Step 2. Split the data: Once the best feature is

selected, the algorithm splits the data into
subsets based on the feature's values. Each
subset represents a branch or child node of the
tree.
How decision trees
work?

Step 3. Recurse on the child nodes: The

algorithm recursively applies the above steps on
each child node until a stopping criterion is met,
such as reaching a maximum depth, having a
minimum number of data points in a leaf node, or
achieving a certain level of accuracy.
How decision trees
work?

Step 4. Assign a class or regression value to

each leaf node: Once the tree has been
constructed, the algorithm assigns a class label or
regression value to each leaf node based on the
majority class of the data points or the average
value of the target variable in that node.
How decision trees
work?

Step 5. Make predictions: To make predictions

on new data, the algorithm traverses the tree
based on the feature values of the data point until
it reaches a leaf node, and assigns the
corresponding class label or regression value.
Advantages
of Decision
Trees
1. Easy to interpret: Decision trees provide a
simple and intuitive representation of the decision-
making process, making them easy to understand
and interpret. This is especially useful for non-
technical stakeholders who need to make decisions
based on the model's predictions.

2. Can handle both categorical and numerical

data: Decision trees can handle a mix of categorical
and numerical data, making them suitable for a
wide range of applications.

3. Able to capture complex interactions between

variables: Decision trees can capture complex
interactions between variables, including non-linear
relationships, interactions, and dependencies,
making them a good choice for data with many
features.
Advantages
of Decision
Trees
4. Robust to noise: Decision trees are robust to
noisy data and can handle missing values without
the need for imputation.

5. Fast and efficient: Decision trees are relatively

fast and efficient to train and can handle large
datasets with millions of observations.

6. Can be combined with other algorithms:

Decision trees can be used as building blocks in
ensemble methods such as random forests and
gradient boosting, which can further improve their
performance.
Disadvantages
of Decision
Trees
1. Prone to overfitting: Decision trees can easily
overfit the training data if not properly pruned or
regularized, leading to poor generalization
performance on new data.

2. Sensitive to small variations in the data:

Decision trees are sensitive to small variations in the
data, which can lead to different trees being
generated for different training sets. This can make
the model less robust and more difficult to
interpret.

3. Can be biased towards features with many

levels: Decision trees tend to be biased towards
features with many levels, which can lead to
overemphasis on these features at the expense of
other important features.
Disadvantages
of Decision
Trees
4. May not handle continuous variables well:
Decision trees are typically designed to handle
discrete or categorical data, and may not perform
as well on continuous variables without
discretization.

5. May not be the most accurate algorithm for

certain problems: While decision trees are a
powerful and flexible algorithm, they may not
always provide the best predictive accuracy
compared to other algorithms like random forests
or neural networks, especially for high-dimensional
or complex data.
Follow #DataRanch on
LinkedIn for more...
Follow #DataRanch on
LinkedIn for more...
info@dataranch.org

linkedin.com/company/dataranch

Introduction to Decision Tree Algorithm
No ratings yet
Introduction to Decision Tree Algorithm
11 pages
Introduction to Decision Trees
No ratings yet
Introduction to Decision Trees
10 pages
Lecture Note 5
No ratings yet
Lecture Note 5
7 pages
12500221027
No ratings yet
12500221027
12 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
UNIT-3 ML notes
No ratings yet
UNIT-3 ML notes
4 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
kiran ppt
No ratings yet
kiran ppt
12 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Assignment Decision Tree
No ratings yet
Assignment Decision Tree
15 pages
HSMC
No ratings yet
HSMC
5 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Unit 4
No ratings yet
Unit 4
33 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Lect 6-7 Notes Decision Tree
No ratings yet
Lect 6-7 Notes Decision Tree
4 pages
Prac 6
No ratings yet
Prac 6
6 pages
Assignment of Decision Tree in Machine Learning
No ratings yet
Assignment of Decision Tree in Machine Learning
15 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
4 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
Ca-Project: Aryan Devesh Puja Shabnas Mudit
No ratings yet
Ca-Project: Aryan Devesh Puja Shabnas Mudit
8 pages
decision tree
No ratings yet
decision tree
11 pages
TEAA_ Tree Ensembles-1
No ratings yet
TEAA_ Tree Ensembles-1
43 pages
decision tree
No ratings yet
decision tree
13 pages
Decision-Trees-in-AI
No ratings yet
Decision-Trees-in-AI
8 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Decision Tree Ppt
0% (1)
Decision Tree Ppt
24 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Breaking Down Decision Tree Algorithm
No ratings yet
Breaking Down Decision Tree Algorithm
10 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Decisiontree1 2
No ratings yet
Decisiontree1 2
29 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
34 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
BPE 22, Decision Trees
No ratings yet
BPE 22, Decision Trees
11 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
Week 8 - Understanding the Decision Tree
No ratings yet
Week 8 - Understanding the Decision Tree
28 pages
Decision Trees a Comprehensive Guide
No ratings yet
Decision Trees a Comprehensive Guide
7 pages
Decision Trees Set-1
No ratings yet
Decision Trees Set-1
7 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
10 pages
Unit-3 Decision Tree Learning (Februray 26, 2024)
No ratings yet
Unit-3 Decision Tree Learning (Februray 26, 2024)
51 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Trees
No ratings yet
Decision Trees
27 pages
6 Decision Trees in Data Mining
No ratings yet
6 Decision Trees in Data Mining
10 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Decision Trees Report
No ratings yet
Decision Trees Report
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Trees

Uploaded by

Decision Trees

Uploaded by

Decision Trees

Decision trees are a type of supervised learning

Decision trees use a tree-like structure of nodes,

Decision trees are non-parametric, which means

Root Node: It represents the entire population or

Leaf/ Terminal Node: A node that cannot be split

Decision Node: When a sub-node splits into

Branch / Sub-Tree: A subsection of the entire

Parent and Child Node: A node, which is divided

Splitting: It is a process of dividing a node into

Pruning: Pruning is when we selectively remove

Impurity: In decision trees, impurity is a measure

The decision tree algorithm works by recursively

Step 1. Select the best feature to split the

Step 2. Split the data: Once the best feature is

Step 3. Recurse on the child nodes: The

Step 4. Assign a class or regression value to

Step 5. Make predictions: To make predictions

2. Can handle both categorical and numerical

3. Able to capture complex interactions between

5. Fast and efficient: Decision trees are relatively

6. Can be combined with other algorithms:

2. Sensitive to small variations in the data:

3. Can be biased towards features with many

5. May not be the most accurate algorithm for

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.