0% found this document useful (0 votes)

3 views

Decision Tree & Random Forest

Uploaded by

vardhanvalluri5

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Decision Tree & Random Forest

Uploaded by

vardhanvalluri5

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Clustering

Clustering
• Clustering is the process of grouping a set of objects into
classes of similar objects.
• Group data points that are close (or similar) to each other.
• Clustering - unsupervised machine learning: no predefined
classes.
• A good clustering method will produce
clusters with
High intra-class similarity
Low inter-class similarity
Types of Clustering
Centroid based clustering

The centroid of a cluster is the

arithmetic mean of all the points in
the cluster.

Centroid-based clustering organizes

the data into non-hierarchical
clusters.
Clustering
Density based clustering
o Density-based clustering connects
contiguous areas of high example
density into clusters.
o This allows for the discovery of
any number of clusters of any
shape.
o Outliers are not assigned to
clusters.
Clustering
Distribution based clustering
o This approach assumes data is
composed of probabilistic
distributions, such as Gaussian
distributions.
o The distribution-based algorithm
clusters data into 3 Gaussian
distributions.
o As distance from the distribution's
center increases, the probability that
a point belongs to the distribution
decreases.
Clustering
Hierarchical clustering
o Hierarchical clustering creates
a tree of clusters.

o It is well suited to hierarchical

data.
Decision Tree
Decision Tree
• A decision tree is a supervised learning algorithm that is used
for classification and regression modeling.
• Mostly it is preferred for solving Classification problems.
• A decision tree is a non-parametric supervised learning
algorithm
• It is a tree-structured classifier
• Internal nodes represent the features of a dataset.
• Branches represent the decision rules.
• Each leaf node represents the final decision/outcome.
Decision Tree
Decision Tree
Decision Tree
• A decision tree starts with a root node, which does not have
any incoming branches.
• The outgoing branches from the root node then feed into
the internal nodes (decision nodes).
• Decision nodes are used to make any decision and have
multiple branches.
• Leaf nodes are the output of those decisions and do not
contain any further branches.
Decision Tree
• Decision tree is a graphical representation for getting all the
possible solutions to a problem/decision based on given
conditions.
• In order to build a tree, CART (Classification and Regression
Tree algorithm) algorithm can be used.
• A decision tree simply asks a question, and based on the
answer (Yes/No), it further split the tree into subtrees.
Decision Tree Example
Decision Tree Example
Decision Tree Example
Decision Tree Example
Decision Tree Terminologies
• Root Node: Root node is from where the decision tree starts. It
represents the entire dataset, which further gets divided into two
or more homogeneous sets.

• Decision (or internal) node: Decision nodes are used to make

any decision and have multiple branches.

• Leaf (External or terminal) Node: Leaf nodes are the final output
node, and the tree cannot be segregated further after getting a
leaf node.
Decision Tree
• Splitting: Splitting is the process of dividing the decision
node/root node into sub-nodes according to the given
conditions.
• Branch/Sub Tree: A tree formed by splitting the tree.
• Pruning: Pruning is the process of removing the unwanted
branches from the tree.
• Parent/Child node: The root node of the tree is called the
parent node, and other nodes are called the child nodes.
Decision Tree – Solving Example -1
Decision Tree – Solving Example -2

Suppose there is a candidate who

has a job offer and wants to
decide whether he should accept
the offer or Not.
Decision Tree – Solving Example -2
Decision Tree - Algorithm
• Step-1: Begin the tree with the root node, say S, which contains the
complete dataset.

• Step-2: Find the best attribute in the dataset using Attribute

Selection Measure (ASM).
• Information Gain
• Gini Index
Decision Tree - Algorithm
• Step-3: Divide the S into subsets that contains possible
values for the best attributes.

• Step-4: Generate the decision tree node, which contains the

best attribute.

• Step-5: Recursively make new decision trees using the

subsets of the dataset created in step -3. Continue this
process until a stage is reached where you cannot further
classify the nodes and called the final node as a leaf node.
Decision Tree
Advantages
• It is simple to understand as it follows the same process
which a human follow while making any decision in real-life.

• It can be very useful for solving decision-related problems.

• It helps to think about all the possible outcomes for a

problem.

• There is less requirement of data cleaning compared to

other algorithms.
Decision Tree
Disadvantages
• The decision tree contains lots of layers, which makes it
complex.

• It may have an overfitting issue, which can be resolved using

the Random Forest algorithm.

• For more class labels, the computational complexity of the

decision tree may increase.
Random Forest
Random Forest
• Random Forest is a popular machine learning algorithm that
belongs to the supervised learning technique.

• It can be used for both Classification and Regression

problems in ML.

• It is based on the concept of ensemble learning.

• Combining multiple classifiers to solve a complex problem
and to improve the performance of the model.
Random Forest
• Random Forest is a classifier that contains a number
of decision trees on various subsets of the given
dataset and takes the average to improve the
predictive accuracy of that dataset.

• The greater number of trees in the forest leads to

higher accuracy and prevents the problem of
overfitting.
Random Forest
Random Forest
Random Forest - Algorithm
• Step-1: Select random K data points from the training set.
• Step-2: Build the decision trees associated with the
selected data points (Subsets).
• Step-3: Choose the number N for decision trees that you
want to build.
• Step-4: Repeat Step 1 & 2.
• Step-5: For new data points, find the predictions of each
decision tree, and assign the new data points to the
category that wins the majority votes.
Random Forest
Advantages
• Random Forest is capable of performing both Classification and
Regression tasks.
• It is capable of handling large datasets with high dimensionality.
• It enhances the accuracy of the model and prevents the
overfitting issue.
Disadvantages
• Although random forest can be used for both classification and
regression tasks, it is not more suitable for Regression tasks.
Decision Tree vs Random Forest

Yexmarine Specimen ACCA Solution
No ratings yet
Yexmarine Specimen ACCA Solution
7 pages
Jeep Grand Cherokee WJ - 14a-3.1 TD - Sistema Inyección
75% (4)
Jeep Grand Cherokee WJ - 14a-3.1 TD - Sistema Inyección
38 pages
Etonitazene Improved Synthesis - Carroll FI, Coleman MC, J Med Chem, Mar 1975, 18 (3), 318-320
67% (3)
Etonitazene Improved Synthesis - Carroll FI, Coleman MC, J Med Chem, Mar 1975, 18 (3), 318-320
3 pages
The Ends of Human Act: Fatima Nuestro-Bagnol Polytechnic University of The Philippines Bansud-Campus
100% (1)
The Ends of Human Act: Fatima Nuestro-Bagnol Polytechnic University of The Philippines Bansud-Campus
27 pages
Schneider-Electric-NOVEMBER-2019-Issue Pricelist
100% (1)
Schneider-Electric-NOVEMBER-2019-Issue Pricelist
12 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
4 pages
Unit 4
No ratings yet
Unit 4
33 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
41 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
UNIT-3 ML notes
No ratings yet
UNIT-3 ML notes
4 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
10 pages
Decision Tree Ppt
0% (1)
Decision Tree Ppt
24 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
ML for ME S17 Decision Trees
No ratings yet
ML for ME S17 Decision Trees
12 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
NOTES
No ratings yet
NOTES
18 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Tree
No ratings yet
Tree
7 pages
Decision Tree (Autosaved)
No ratings yet
Decision Tree (Autosaved)
14 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Week 8 - Understanding the Decision Tree
No ratings yet
Week 8 - Understanding the Decision Tree
28 pages
Ca-Project: Aryan Devesh Puja Shabnas Mudit
No ratings yet
Ca-Project: Aryan Devesh Puja Shabnas Mudit
8 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
Deciosn_tree_(1)
No ratings yet
Deciosn_tree_(1)
5 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
CSL0777 L25
No ratings yet
CSL0777 L25
39 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
decision tree
No ratings yet
decision tree
13 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
10 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Ch5 Data Science
No ratings yet
Ch5 Data Science
60 pages
Primer On Major Data Mining Algorithms
No ratings yet
Primer On Major Data Mining Algorithms
86 pages
Lec.7.intro.D.S. Fall 2023
No ratings yet
Lec.7.intro.D.S. Fall 2023
26 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
Module 6
No ratings yet
Module 6
82 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Unit-II - Tree Based Methods
No ratings yet
Unit-II - Tree Based Methods
158 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Module 4 Lecture -2
No ratings yet
Module 4 Lecture -2
65 pages
Introduction to Decision Tree Algorithm
No ratings yet
Introduction to Decision Tree Algorithm
11 pages
Cours #4—Decision Tree
No ratings yet
Cours #4—Decision Tree
18 pages
Decision Tree
No ratings yet
Decision Tree
24 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
DecisionTree Session
No ratings yet
DecisionTree Session
12 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
11 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Lists
No ratings yet
Lists
51 pages
ATD-Mid-2 Sets
No ratings yet
ATD-Mid-2 Sets
2 pages
DMC UNIT-4 Assignment Q
No ratings yet
DMC UNIT-4 Assignment Q
1 page
B.tech T & P Enrollment Form (2026 Batch)
No ratings yet
B.tech T & P Enrollment Form (2026 Batch)
2 pages
Ex-7 DMC
No ratings yet
Ex-7 DMC
2 pages
APRESENTACAO_CHOQUE CARDIOGENICO [Guardado automaticamente] - cópia
No ratings yet
APRESENTACAO_CHOQUE CARDIOGENICO [Guardado automaticamente] - cópia
61 pages
The States of Matter
No ratings yet
The States of Matter
20 pages
Retrograde Antegrade Femoral Nail Surgical Technique Guide
No ratings yet
Retrograde Antegrade Femoral Nail Surgical Technique Guide
91 pages
University of Cambridge International Examinations General Certificate of Education Advanced Subsidiary Level and Advanced Level
No ratings yet
University of Cambridge International Examinations General Certificate of Education Advanced Subsidiary Level and Advanced Level
4 pages
Annual Report 23-24
No ratings yet
Annual Report 23-24
258 pages
Kircher Sound
No ratings yet
Kircher Sound
10 pages
CaNaMed 01.01.2024
No ratings yet
CaNaMed 01.01.2024
273 pages
Question 1567942
No ratings yet
Question 1567942
7 pages
Use of Sentinel-1 Data For Earthquake Damage Assessment in Cases of Amatrice and Sarpol-E Zahab
No ratings yet
Use of Sentinel-1 Data For Earthquake Damage Assessment in Cases of Amatrice and Sarpol-E Zahab
5 pages
Choose The Letter of The Correct Answer
100% (2)
Choose The Letter of The Correct Answer
2 pages
Fosroc Products and Uses
No ratings yet
Fosroc Products and Uses
4 pages
Chemistry Annuals Y10 Notes
No ratings yet
Chemistry Annuals Y10 Notes
18 pages
Weight of M.S. Pipe
No ratings yet
Weight of M.S. Pipe
14 pages
Review Journal Experimental Model of Myocardial Infarction Induced by Isoproterenol in Rats
No ratings yet
Review Journal Experimental Model of Myocardial Infarction Induced by Isoproterenol in Rats
5 pages
Periodic Table Electronic Configuration
100% (1)
Periodic Table Electronic Configuration
3 pages
1 Conservative Vector Fields
No ratings yet
1 Conservative Vector Fields
8 pages
Iso 1436 2020
No ratings yet
Iso 1436 2020
9 pages
Introduction To: Elmer FEM Software
No ratings yet
Introduction To: Elmer FEM Software
29 pages
Amwatch Access
No ratings yet
Amwatch Access
4 pages
Homeopathy and Energy: An Integrative Approach. Part 2.
100% (3)
Homeopathy and Energy: An Integrative Approach. Part 2.
8 pages
Column Chromatography
No ratings yet
Column Chromatography
31 pages
Medical_Form Jio Fiber
No ratings yet
Medical_Form Jio Fiber
3 pages
Oceanological and Hydrobiological Studies
No ratings yet
Oceanological and Hydrobiological Studies
9 pages
Hasil Pengamatan
No ratings yet
Hasil Pengamatan
7 pages
Unit 5 Exam Nomenclature
No ratings yet
Unit 5 Exam Nomenclature
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Tree & Random Forest

Uploaded by

Decision Tree & Random Forest

Uploaded by

Clustering

The centroid of a cluster is the

Centroid-based clustering organizes

o It is well suited to hierarchical

• Decision (or internal) node: Decision nodes are used to make

Suppose there is a candidate who

• Step-2: Find the best attribute in the dataset using Attribute

• Step-4: Generate the decision tree node, which contains the

• Step-5: Recursively make new decision trees using the

• It can be very useful for solving decision-related problems.

• It helps to think about all the possible outcomes for a

• There is less requirement of data cleaning compared to

• It may have an overfitting issue, which can be resolved using

• For more class labels, the computational complexity of the

• It can be used for both Classification and Regression

• It is based on the concept of ensemble learning.

• The greater number of trees in the forest leads to

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.