0% found this document useful (0 votes)

165 views

02 - Decision Tree Classification On Iris Dataset

The document discusses building a decision tree classification model to predict iris flower species (Iris-setosa, Iris-versicolor, Iris-virginica) based on sepal and petal attributes. It loads the iris dataset, splits it into training and test sets, trains a decision tree classifier, evaluates its accuracy at 97.8%, and visually plots the decision tree to show how it makes predictions based on attribute thresholds. It also demonstrates predicting the species for new data points using the trained decision tree model.

Uploaded by

John Wick

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

165 views

02 - Decision Tree Classification On Iris Dataset

Uploaded by

John Wick

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Practical - 2

AIM :- Decision Tree Classification on iris

Dataset

Import Libraries

In [1]:

1 import numpy as np
2 import pandas as pd
3 from sklearn.tree import DecisionTreeClassifier

Loading iris.csv Dataset in Pandas Dataframe

In [2]:

1 data = pd.read_csv("Iris.csv")
2 data.head(3)

Out[2]:

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

Getting Information about data

In [3]:

1 data.info()

RangeIndex: 150 entries, 0 to 149

Data columns (total 6 columns):

# Column Non-Null Count Dtype

--- ------ -------------- -----

0 Id 150 non-null int64

1 SepalLengthCm 150 non-null float64

2 SepalWidthCm 150 non-null float64

3 PetalLengthCm 150 non-null float64

4 PetalWidthCm 150 non-null float64

5 Species 150 non-null object

dtypes: float64(4), int64(1), object(1)

memory usage: 7.2+ KB


X is data and Y is target data i.e species

In [4]:

1 X = data[['SepalLengthCm','SepalWidthCm','PetalLengthCm','PetalWidthCm']].values
2 X[:5]

Out[4]:

array([[5.1, 3.5, 1.4, 0.2],

[4.9, 3. , 1.4, 0.2],

[4.7, 3.2, 1.3, 0.2],

[4.6, 3.1, 1.5, 0.2],

[5. , 3.6, 1.4, 0.2]])

In [5]:

1 Y = data['Species']
2 Y[:5]

Out[5]:

0 Iris-setosa

1 Iris-setosa

2 Iris-setosa

3 Iris-setosa

4 Iris-setosa

Name: Species, dtype: object

Training Model

In [6]:

1 from sklearn.model_selection import train_test_split

2
3 X_trainset, X_testset, Y_trainset, Y_testset = train_test_splittrain_X, test_X, train_
4 X, Y, test_size=0.3, random_state=0)

In [7]:

1 SpeciesTree = DecisionTreeClassifier(criterion = 'entropy', max_depth = 4)

2 SpeciesTree

Out[7]:

DecisionTreeClassifier(criterion='entropy', max_depth=4)

In [8]:

1 SpeciesTree.fit(X_trainset, Y_trainset)

Out[8]:

DecisionTreeClassifier(criterion='entropy', max_depth=4)

Prediction
In [9]:

1 predTree = SpeciesTree.predict(X_testset)
2 predTree [0:5]

Out[9]:

array(['Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa'], dtype=object)

In [10]:

1 Y_testset[0:5]

Out[10]:

114 Iris-virginica

62 Iris-versicolor

33 Iris-setosa

107 Iris-virginica

7 Iris-setosa

Name: Species, dtype: object

In [11]:

1 from sklearn import metrics

2 import matplotlib.pyplot as plt
3 print("DecisionTrees's Accuracy: ",metrics.accuracy_score(Y_testset, predTree))

DecisionTrees's Accuracy: 0.9777777777777777

Visualizing the Decision Tree

In [12]:

1 import matplotlib.pyplot as plt

2 from sklearn.tree import DecisionTreeClassifier
3 from sklearn import tree
4
5 fn = data.columns[1:5]
6 cn = data["Species"].unique().tolist()
7 SpeciesTree.fit(X, Y)
8 fig, axes = plt.subplots(nrows=1, ncols=1, figsize=(10, 10), dpi=300)
9
10 tree.plot_tree(SpeciesTree, feature_names=fn, class_names=cn, filled=True)

Out[12]:

[Text(0.5, 0.9, 'PetalLengthCm <= 2.45\nentropy = 1.585\nsamples = 150\nvalu

e = [50, 50, 50]\nclass = Iris-setosa'),

Text(0.4230769230769231, 0.7, 'entropy = 0.0\nsamples = 50\nvalue = [50, 0,

0]\nclass = Iris-setosa'),

Text(0.5769230769230769, 0.7, 'PetalWidthCm <= 1.75\nentropy = 1.0\nsamples

= 100\nvalue = [0, 50, 50]\nclass = Iris-versicolor'),

Text(0.3076923076923077, 0.5, 'PetalLengthCm <= 4.95\nentropy = 0.445\nsamp

les = 54\nvalue = [0, 49, 5]\nclass = Iris-versicolor'),

Text(0.15384615384615385, 0.3, 'PetalWidthCm <= 1.65\nentropy = 0.146\nsamp

les = 48\nvalue = [0, 47, 1]\nclass = Iris-versicolor'),

Text(0.07692307692307693, 0.1, 'entropy = 0.0\nsamples = 47\nvalue = [0, 4

7, 0]\nclass = Iris-versicolor'),

Text(0.23076923076923078, 0.1, 'entropy = 0.0\nsamples = 1\nvalue = [0, 0,

1]\nclass = Iris-virginica'),

Text(0.46153846153846156, 0.3, 'PetalWidthCm <= 1.55\nentropy = 0.918\nsamp

les = 6\nvalue = [0, 2, 4]\nclass = Iris-virginica'),

Text(0.38461538461538464, 0.1, 'entropy = 0.0\nsamples = 3\nvalue = [0, 0,

3]\nclass = Iris-virginica'),

Text(0.5384615384615384, 0.1, 'entropy = 0.918\nsamples = 3\nvalue = [0, 2,

1]\nclass = Iris-versicolor'),

Text(0.8461538461538461, 0.5, 'PetalLengthCm <= 4.85\nentropy = 0.151\nsamp

les = 46\nvalue = [0, 1, 45]\nclass = Iris-virginica'),

Text(0.7692307692307693, 0.3, 'SepalLengthCm <= 5.95\nentropy = 0.918\nsamp

les = 3\nvalue = [0, 1, 2]\nclass = Iris-virginica'),

Text(0.6923076923076923, 0.1, 'entropy = 0.0\nsamples = 1\nvalue = [0, 1,

0]\nclass = Iris-versicolor'),

Text(0.8461538461538461, 0.1, 'entropy = 0.0\nsamples = 2\nvalue = [0, 0,

2]\nclass = Iris-virginica'),

Text(0.9230769230769231, 0.3, 'entropy = 0.0\nsamples = 43\nvalue = [0, 0,

43]\nclass = Iris-virginica')]
Predicting Species for Set of Values

Prediction-1
In [13]:

1 X_new = [[6.3,3.0,1.3,0.2]]
2 predTree = SpeciesTree.predict(X_new)
3 predTree

Out[13]:

array(['Iris-setosa'], dtype=object)

Prediction-2

In [14]:

1 X_new = [[5.4,2.8,2.9,1.5]]
2 predTree = SpeciesTree.predict(X_new)
3 predTree

Out[14]:

array(['Iris-versicolor'], dtype=object)

Prediction-3

In [15]:

1 X_new = [[5.4,2.8,2.9,0.5]]
2 predTree = SpeciesTree.predict(X_new)
3 predTree

Out[15]:

array(['Iris-versicolor'], dtype=object)

ATO PTO Basics in Oracle Applications
No ratings yet
ATO PTO Basics in Oracle Applications
153 pages
MAchine Learning
No ratings yet
MAchine Learning
120 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Python Data Analysis Visualization
No ratings yet
Python Data Analysis Visualization
34 pages
Pattern Classification
100% (1)
Pattern Classification
42 pages
Churn Modeling
100% (1)
Churn Modeling
11 pages
Poly
100% (1)
Poly
108 pages
PCA Using Python
No ratings yet
PCA Using Python
18 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
ML Interview Questions and Answers
100% (1)
ML Interview Questions and Answers
25 pages
HW1
100% (1)
HW1
8 pages
Machine Learning With Real Life Project: by - Rishabh Gaur
100% (2)
Machine Learning With Real Life Project: by - Rishabh Gaur
26 pages
Churn For Bank Customers
No ratings yet
Churn For Bank Customers
28 pages
Bank Customer Churn Analysis - Jupyter Notebook
No ratings yet
Bank Customer Churn Analysis - Jupyter Notebook
11 pages
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
100% (1)
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
72 pages
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
No ratings yet
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
12 pages
Unit V - Classification and Prediction 2020-21
100% (1)
Unit V - Classification and Prediction 2020-21
68 pages
Machine Learning Project Report
100% (1)
Machine Learning Project Report
4 pages
Face Detection & Emotion Recognition
No ratings yet
Face Detection & Emotion Recognition
26 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
K Means
100% (2)
K Means
329 pages
Simple - Linear - Regression - Ipynb - Colaboratory
No ratings yet
Simple - Linear - Regression - Ipynb - Colaboratory
2 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Pandas
100% (1)
Pandas
1,131 pages
L2 - Machine Learning Process
No ratings yet
L2 - Machine Learning Process
17 pages
7 Classification
100% (3)
7 Classification
63 pages
Bagging and Random Forest Presentation1
100% (2)
Bagging and Random Forest Presentation1
23 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
DEA-7TT2 Associate-Data Science and Big Data Analytics v2 Exam
0% (1)
DEA-7TT2 Associate-Data Science and Big Data Analytics v2 Exam
4 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
13 PracticalMachineLearning
100% (1)
13 PracticalMachineLearning
84 pages
Machine Learning
100% (1)
Machine Learning
21 pages
270+ Machine Learning: Projects
100% (1)
270+ Machine Learning: Projects
15 pages
Loading The Dataset: 'Churn - Modelling - CSV'
No ratings yet
Loading The Dataset: 'Churn - Modelling - CSV'
6 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Machine Learning LAB MANUAL
No ratings yet
Machine Learning LAB MANUAL
56 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
1 page
Matplotlib PDF
No ratings yet
Matplotlib PDF
16 pages
Scikit - Notes ML
100% (2)
Scikit - Notes ML
12 pages
Simple Linear Regression - Assignn5
No ratings yet
Simple Linear Regression - Assignn5
8 pages
Machine Learning Mini-Project Report
No ratings yet
Machine Learning Mini-Project Report
26 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Logistic Regression: Jia Li
No ratings yet
Logistic Regression: Jia Li
44 pages
What Is Naive Bayes Algorithm?
No ratings yet
What Is Naive Bayes Algorithm?
18 pages
Missing Value Treatment
No ratings yet
Missing Value Treatment
22 pages
Understanding DBSCAN Algorithm and Implementation From Scratch - by Andrewngai - Towards Data Science
No ratings yet
Understanding DBSCAN Algorithm and Implementation From Scratch - by Andrewngai - Towards Data Science
10 pages
Import As
100% (1)
Import As
27 pages
Vinee
100% (1)
Vinee
28 pages
C2M2 - Assignment: 1 Risk Models Using Tree-Based Models
100% (1)
C2M2 - Assignment: 1 Risk Models Using Tree-Based Models
38 pages
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
100% (1)
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
6 pages
Data Visualization and Matplot
No ratings yet
Data Visualization and Matplot
11 pages
Prediction of Company Bankruptcy: Amlan Nag
100% (2)
Prediction of Company Bankruptcy: Amlan Nag
16 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
Data Pre-Processing (Pandas)
No ratings yet
Data Pre-Processing (Pandas)
19 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Learn R By Coding
From Everand
Learn R By Coding
Thomas Kurnicki
No ratings yet
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
Nda Lab2
No ratings yet
Nda Lab2
2 pages
05 - I and F Pattern Classification Using Perceptron
No ratings yet
05 - I and F Pattern Classification Using Perceptron
3 pages
04 - Burglary Alarm Example Using Bayesian Network
100% (1)
04 - Burglary Alarm Example Using Bayesian Network
5 pages
03 - K Means Clustering On Iris Datasets
No ratings yet
03 - K Means Clustering On Iris Datasets
4 pages
85010-0055 - EST3 Control Display Modules
100% (1)
85010-0055 - EST3 Control Display Modules
4 pages
T24Ho-01 Alert Log
No ratings yet
T24Ho-01 Alert Log
8 pages
AWS Well-Architected Framework
100% (1)
AWS Well-Architected Framework
56 pages
ICTNWK511 Assessment 4 Case Study Project v2
No ratings yet
ICTNWK511 Assessment 4 Case Study Project v2
38 pages
ICT Skills Worksheet IIprintouts
No ratings yet
ICT Skills Worksheet IIprintouts
9 pages
xps-desktop-8960-spec-sheet
No ratings yet
xps-desktop-8960-spec-sheet
2 pages
Foot Printing:: Footprinting Means Gathering Information About A Target System Which Can
No ratings yet
Foot Printing:: Footprinting Means Gathering Information About A Target System Which Can
18 pages
A Walk With Shannon
No ratings yet
A Walk With Shannon
58 pages
Adventures in Automotive Networks A
No ratings yet
Adventures in Automotive Networks A
206 pages
z10 Installation Manual (GC28-6864-08b)
No ratings yet
z10 Installation Manual (GC28-6864-08b)
291 pages
Yokogawa Giza PDF
No ratings yet
Yokogawa Giza PDF
187 pages
HyperMesh 11.0 Core Tutorials
No ratings yet
HyperMesh 11.0 Core Tutorials
497 pages
CFP Icece 2020
No ratings yet
CFP Icece 2020
1 page
Dss Cheetshet
No ratings yet
Dss Cheetshet
3 pages
4048 Intake Map01
No ratings yet
4048 Intake Map01
1 page
Acer Travelmate 3200 Service Manual
No ratings yet
Acer Travelmate 3200 Service Manual
112 pages
SOA Interview Questions
No ratings yet
SOA Interview Questions
49 pages
Selling SaaS Prerelease
No ratings yet
Selling SaaS Prerelease
15 pages
Csacs Ins Acs in Ucs 1
No ratings yet
Csacs Ins Acs in Ucs 1
12 pages
Splendid CRM 2.1 Deployment Guide
No ratings yet
Splendid CRM 2.1 Deployment Guide
23 pages
Menor, Rosalinda A.
No ratings yet
Menor, Rosalinda A.
7 pages
Unit 1 Basics of An Algorithm and Its Properties: Muhammad Ibn Musa Al-Khowarizmi.
No ratings yet
Unit 1 Basics of An Algorithm and Its Properties: Muhammad Ibn Musa Al-Khowarizmi.
86 pages
Image Processing
No ratings yet
Image Processing
45 pages
ECO 4+ User Manual v1.4
No ratings yet
ECO 4+ User Manual v1.4
21 pages
BD PPT Word
No ratings yet
BD PPT Word
3 pages
AI Units
No ratings yet
AI Units
23 pages
Education - Presmat
No ratings yet
Education - Presmat
28 pages
Whitepaper
No ratings yet
Whitepaper
20 pages
Numberingsystem
No ratings yet
Numberingsystem
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

02 - Decision Tree Classification On Iris Dataset

Uploaded by

02 - Decision Tree Classification On Iris Dataset

Uploaded by

Practical - 2

AIM :- Decision Tree Classification on iris

Loading iris.csv Dataset in Pandas Dataframe

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

Getting Information about data

RangeIndex: 150 entries, 0 to 149

Data columns (total 6 columns):

# Column Non-Null Count Dtype

--- ------ -------------- -----

0 Id 150 non-null int64

1 SepalLengthCm 150 non-null float64

2 SepalWidthCm 150 non-null float64

3 PetalLengthCm 150 non-null float64

4 PetalWidthCm 150 non-null float64

5 Species 150 non-null object

dtypes: float64(4), int64(1), object(1)

memory usage: 7.2+ KB

array([[5.1, 3.5, 1.4, 0.2],

[4.9, 3. , 1.4, 0.2],

[4.7, 3.2, 1.3, 0.2],

[4.6, 3.1, 1.5, 0.2],

[5. , 3.6, 1.4, 0.2]])

Name: Species, dtype: object

1 from sklearn.model_selection import train_test_split

1 SpeciesTree = DecisionTreeClassifier(criterion = 'entropy', max_depth = 4)

array(['Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa'], dtype=object)

Name: Species, dtype: object

1 from sklearn import metrics

DecisionTrees's Accuracy: 0.9777777777777777

Visualizing the Decision Tree

1 import matplotlib.pyplot as plt

[Text(0.5, 0.9, 'PetalLengthCm <= 2.45\nentropy = 1.585\nsamples = 150\nvalu

Text(0.4230769230769231, 0.7, 'entropy = 0.0\nsamples = 50\nvalue = [50, 0,

Text(0.5769230769230769, 0.7, 'PetalWidthCm <= 1.75\nentropy = 1.0\nsamples

Text(0.3076923076923077, 0.5, 'PetalLengthCm <= 4.95\nentropy = 0.445\nsamp

Text(0.15384615384615385, 0.3, 'PetalWidthCm <= 1.65\nentropy = 0.146\nsamp

Text(0.07692307692307693, 0.1, 'entropy = 0.0\nsamples = 47\nvalue = [0, 4

Text(0.23076923076923078, 0.1, 'entropy = 0.0\nsamples = 1\nvalue = [0, 0,

Text(0.46153846153846156, 0.3, 'PetalWidthCm <= 1.55\nentropy = 0.918\nsamp

Text(0.38461538461538464, 0.1, 'entropy = 0.0\nsamples = 3\nvalue = [0, 0,

Text(0.5384615384615384, 0.1, 'entropy = 0.918\nsamples = 3\nvalue = [0, 2,

Text(0.8461538461538461, 0.5, 'PetalLengthCm <= 4.85\nentropy = 0.151\nsamp

Text(0.7692307692307693, 0.3, 'SepalLengthCm <= 5.95\nentropy = 0.918\nsamp

Text(0.6923076923076923, 0.1, 'entropy = 0.0\nsamples = 1\nvalue = [0, 1,

Text(0.8461538461538461, 0.1, 'entropy = 0.0\nsamples = 2\nvalue = [0, 0,

Text(0.9230769230769231, 0.3, 'entropy = 0.0\nsamples = 43\nvalue = [0, 0,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.