0% found this document useful (0 votes)

111 views

Evaluation Metrics in Machine Learning

Uploaded by

Sahil Mhaske

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views

Evaluation Metrics in Machine Learning

Uploaded by

Sahil Mhaske

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

MENTORNESS ARTICLE

TASK 1

MIP-ML-08 BATCH

Evaluation Metrics in Machine Learning:

Exploring Performance Assessment
BY NIGAR SULTANA

What Are Evaluation Metrics in Machine Learning?

Evaluation metrics in Machine Learning (ML) are important tools for checking how well

ML models work. These metrics use numbers to show how effective a model is at

handling new information and help decide which model is best. They look at things like

accuracy, precision, recall, and other measures to see if the model is performing well.

The goal is to make sure the model can learn from training and make accurate

predictions on new things it hasn't seen before. These metrics also help pick the best

model from many options and can find where a model needs to improve. This helps ML
experts adjust things to make the model more effective and ensures it does well in real-

life situations.

Why Do We Need Evaluation Metrics in ML?

Evaluation metrics are crucial in ML for several reasons:

• They help us understand if ML models are effective and accurate in their

predictions or classifications.

• These metrics guide us in choosing the best model among several options by

comparing their performances.

• They play a role in tuning model settings (hyperparameters) to make them

perform better.

• Evaluation metrics provide a foundation for constantly improving and fine-tuning

ML algorithms.

• They allow us to objectively assess and compare different models based on their

performance scores.

• These metrics are essential for making informed decisions about which ML

model to use in a given situation.

• They help us identify areas where ML models need improvement or adjustments.

• Using evaluation metrics ensures that ML models meet desired performance

standards and deliver reliable results.

• Overall, evaluation metrics are key tools for assessing, improving, and optimizing

ML models for various applications.

Types of Evaluation Metrics

There are various types of evaluation metrics used in ML, including:

1. Regression Model Evaluation Metrics: These metrics assess how well

regression models predict numerical outcomes. They include:

• Mean Absolute Error (MAE): This measures the average difference

between predicted and actual values, giving an idea of how accurate the

predictions are.

• Root Mean Squared Error (RMSE): Similar to MAE but emphasizes larger

errors, providing a more comprehensive view of prediction accuracy.

• R-squared (R2) Score: It shows how much of the variance in the data is

explained by the model, indicating how well the model fits the data.

2. Classification Model Evaluation Metrics: These metrics evaluate how

accurately classification models classify data into different categories. They

include:

• Accuracy: This measures the overall correctness of the model's

predictions.

• Precision: It shows how many positive predictions were actually correct,

focusing on the accuracy of positive predictions.

• Recall (Sensitivity): This metric indicates how many actual positive

instances the model correctly identified, emphasizing the model's ability to

capture all positive cases.

• F1 Score: The F1 score combines precision and recall into a single value,

offering a balanced assessment of the model's performance in handling

class imbalances.
• Confusion Matrix: A confusion matrix is a tabular representation of a

machine learning model's performance, displaying the counts of true

positive, true negative, false positive, and false negative predictions.

Explaining Each Evaluation Metric in Detail

1. Mean Absolute Error (MAE):

• MAE measures the average magnitude of errors between predicted and

actual values in regression models.

• It provides insights into how accurate the predictions of the model are on

average.

• MAE is calculated by taking the average of the absolute differences

between predicted and actual values.

Formula for MAE:

Where:

• y_j: ground-truth value

• y_hat: predicted value from the regression model

• N: number of datums
Example Graph: Mean Absolute Error

2. Root Mean Squared Error (RMSE):

• RMSE is similar to MAE but gives more weight to larger errors, making it

sensitive to outliers.

• It provides a more comprehensive assessment of model performance by

penalizing significant errors.

• RMSE is calculated by taking the square root of the average of squared

differences between predicted and actual values.

Formula for RMSE:

Example Graph: Root Mean Squared Error

3. R-squared (R2) Score:

• R2 score quantifies the proportion of variance in the dependent variable

explained by independent variables in regression models.

• It indicates the goodness of fit of the regression model, showing how well

the model fits the data.

• R2 score ranges from 0 to 1, where 1 indicates a perfect fit and 0 indicates

no relationship between variables.

Formula for R2 Score:

Where:

• yˉ is the mean of the actual values.

Example Graph: R-Squared Score

4.Accuracy:

• Accuracy is a simple and intuitive metric that measures the percentage of correct

predictions made by a model.

• It is suitable for balanced datasets where the positive and negative classes are

similar in number.

• However, in imbalanced datasets, accuracy can be misleading as it favors the

majority class predictions, neglecting the minority class.

• This can lead to an inaccurate assessment of the model's performance,

especially in scenarios where the minority class is of high importance.

Formula for Accuracy:

Where:

• TP=True Positive

• TN=True Negative

• FP=False Positive

• FN=False Negative

Example Graph: Accuracy

5.Precision:

• Precision measures the proportion of correctly predicted positive instances out

of all instances predicted as positive.

• It is particularly valuable when the cost of false positives is significant.

• For instance, in medical diagnosis, high precision indicates accurate

identification of patients with a disease, reducing false positive cases.

Formula for Precision:

Example Graph: Precision

6.Recall:

• Recall (also known as Sensitivity) measures the proportion of correctly predicted

positive instances out of all actual positive instances.

• It is crucial in scenarios where capturing all positive cases is vital, even if it

results in some false alarms.

• For instance, in healthcare, high recall ensures that the model doesn't miss

identifying patients with a disease, even if it means some healthy individuals are

flagged for further evaluation.

Formula for Recall:

Where:

TP=True Positive
FN=False Negative

Example Graph: Recall

7.F1 Score:

• The F1 score represents the harmonic mean of precision and recall.

• It offers a balanced assessment of a model's performance by taking into account

both precision and recall.

• Models with a similar balance between precision and recall are favored by the F1

score.

• The harmonic mean is particularly suitable for averaging ratios of values, making

the F1 score valuable in scenarios with imbalanced precision and recall values.
Example Graph: F1 Score

8.Confusion Matrix:

• The confusion matrix is a tabular representation of true and predicted classes in

a classification problem.

• It displays the four possible combinations of true positives, true negatives, false

positives, and false negatives, offering insights into the model's performance and

areas for improvement.

Formula for Confusion Matrix:

Confusion Matrix

Recap:

In conclusion, evaluation metrics are indispensable tools in ML for assessing model

performance and guiding decision-making processes. Incorporating both regression

and classification model evaluation metrics provides a comprehensive understanding

of a model's capabilities and areas for improvement. The table below summarizes the

key evaluation metrics discussed in this article, along with their descriptions and

formulas:

Metric Description Formula

Mean Measures average
Absolute magnitude of errors
Error between predicted
and actual values in
regression models.
Root Mean Similar to MAE but
Squared penalizes large errors
Error more, sensitive to
outliers.
R-squared Quantifies proportion
(R2) Score of variance in
dependent variable
explained by
independent
variables.
Accuracy Measures proportion
of correct predictions
made by model over
total predictions.
Precision Measures proportion
of correctly predicted
positive instances
out of all predicted
positives.
Recall Measures proportion
(Sensitivity) of correctly predicted
positive instances
out of all actual
positives.
F1 Score Combines precision
and recall into single
value, offering
balanced model
performance
assessment.
Confusion Tabular
Matrix representation of
true and predicted
classes, displaying
counts of true
positives, true
negatives, false
positives, and false
negatives.

By leveraging evaluation metrics effectively, data scientists and ML practitioners can

develop robust and accurate ML models that meet the desired performance standards

and deliver reliable results in real-world applications.

Thank you.

Elements and Principles of Design
No ratings yet
Elements and Principles of Design
2 pages
Selecting and Constructing Test Items and Tasks
100% (3)
Selecting and Constructing Test Items and Tasks
22 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
Data Mining:: Concepts and Techniques
100% (1)
Data Mining:: Concepts and Techniques
63 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
Machine Learning C
No ratings yet
Machine Learning C
24 pages
PPT1
No ratings yet
PPT1
93 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
4-Data Cleaning, Data Integration, Data Transformation, Data Reduction-03-02-2024
No ratings yet
4-Data Cleaning, Data Integration, Data Transformation, Data Reduction-03-02-2024
22 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Poly
100% (1)
Poly
108 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
ML-UNIT-5
No ratings yet
ML-UNIT-5
20 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Data Science
No ratings yet
Data Science
39 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Practical Lab File Based ON Programing in C: Submitted by
No ratings yet
Practical Lab File Based ON Programing in C: Submitted by
6 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Revised CS8383 (Eee) Oop Lab Man
No ratings yet
Revised CS8383 (Eee) Oop Lab Man
85 pages
2.2 ML Session Bias Variance Tradeoffs
No ratings yet
2.2 ML Session Bias Variance Tradeoffs
38 pages
Machine Learning Solution
No ratings yet
Machine Learning Solution
6 pages
Dwbi Unit 4 & 5
No ratings yet
Dwbi Unit 4 & 5
26 pages
SOC Lab Manual
No ratings yet
SOC Lab Manual
11 pages
OOMD Summer
No ratings yet
OOMD Summer
12 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
ADL Unit-3
No ratings yet
ADL Unit-3
21 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
ML-UNIT4
No ratings yet
ML-UNIT4
41 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
14 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
Convolutional Neural Networks For Visual Recognition
No ratings yet
Convolutional Neural Networks For Visual Recognition
45 pages
Jntuk R20 ML Unit-Iii
100% (1)
Jntuk R20 ML Unit-Iii
21 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
Dev Answer Key
100% (1)
Dev Answer Key
17 pages
The Price Prediction For Used Cars Using Multiple Linear Regression Model
No ratings yet
The Price Prediction For Used Cars Using Multiple Linear Regression Model
6 pages
ML_LAB_Mannual-1
No ratings yet
ML_LAB_Mannual-1
79 pages
ML Unit-1
No ratings yet
ML Unit-1
26 pages
Unit 5
No ratings yet
Unit 5
104 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Iot Systems - Logical Design Using Python: Bahga & Madisetti, © 2015
No ratings yet
Iot Systems - Logical Design Using Python: Bahga & Madisetti, © 2015
31 pages
UNIT-4
No ratings yet
UNIT-4
79 pages
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
No ratings yet
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
9 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
Key Data Mining Tasks: 1. Descriptive Analytics
No ratings yet
Key Data Mining Tasks: 1. Descriptive Analytics
10 pages
Neural Network and Their Applications
No ratings yet
Neural Network and Their Applications
2 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Internship Report - Software - Salaries Predictions
100% (1)
Internship Report - Software - Salaries Predictions
17 pages
Linear Regression Analysis. Statistics 2 Notes
No ratings yet
Linear Regression Analysis. Statistics 2 Notes
20 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
ML Lab Final R22
No ratings yet
ML Lab Final R22
67 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Data Preprocessing
No ratings yet
Data Preprocessing
77 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
Technical Interview Preparation
No ratings yet
Technical Interview Preparation
12 pages
PR 10 Program
No ratings yet
PR 10 Program
4 pages
Introducing Our Fitness App
No ratings yet
Introducing Our Fitness App
10 pages
Deep Learning Notebook
No ratings yet
Deep Learning Notebook
7 pages
JPR E-Content FINAL
No ratings yet
JPR E-Content FINAL
57 pages
A Register-Set Flip-Flop (RSFF) Is A Type of Flip-Flop Circu443-1
No ratings yet
A Register-Set Flip-Flop (RSFF) Is A Type of Flip-Flop Circu443-1
11 pages
Instant Download A Primer On Stable Isotopes in Ecology M. Francesca Cotrufo & Yamina Pressler PDF All Chapter
100% (3)
Instant Download A Primer On Stable Isotopes in Ecology M. Francesca Cotrufo & Yamina Pressler PDF All Chapter
64 pages
The World As Archive
No ratings yet
The World As Archive
6 pages
Procedure For Fire Alarm System Rev. 0
No ratings yet
Procedure For Fire Alarm System Rev. 0
6 pages
Non Conformities
No ratings yet
Non Conformities
4 pages
Lesson 3 in Art Appreciation
No ratings yet
Lesson 3 in Art Appreciation
39 pages
Problems
No ratings yet
Problems
27 pages
Poly Kernel
No ratings yet
Poly Kernel
6 pages
Lesson Exemplar For Grade 6 Math First Quarter
100% (1)
Lesson Exemplar For Grade 6 Math First Quarter
4 pages
GEN 207: Industrial Psychology: Chapter 1: Introduction To I/O Psychology
No ratings yet
GEN 207: Industrial Psychology: Chapter 1: Introduction To I/O Psychology
8 pages
Yashwanthi Randhi: Profile
No ratings yet
Yashwanthi Randhi: Profile
2 pages
[FREE PDF sample] How to Present to Absolutely Anyone Confident Public Speaking and Presenting in Every Situation Mark Rhodes ebooks
100% (3)
[FREE PDF sample] How to Present to Absolutely Anyone Confident Public Speaking and Presenting in Every Situation Mark Rhodes ebooks
65 pages
Full download From Visual Surveillance To Internet Of Things: Technology And Applications Lavanya Sharma pdf docx
100% (1)
Full download From Visual Surveillance To Internet Of Things: Technology And Applications Lavanya Sharma pdf docx
65 pages
AL2-Handout5-Assessment Tools in The Affective Domain
No ratings yet
AL2-Handout5-Assessment Tools in The Affective Domain
10 pages
Fixed Effect and Random Effect
No ratings yet
Fixed Effect and Random Effect
17 pages
Change of Address For Jada Manuscript Submission, Review: N E W S
No ratings yet
Change of Address For Jada Manuscript Submission, Review: N E W S
4 pages
Box Girder Inclined Web
No ratings yet
Box Girder Inclined Web
5 pages
Flyer - F2F October 2024 - v1.0
No ratings yet
Flyer - F2F October 2024 - v1.0
3 pages
SEM, TEM and EDX
No ratings yet
SEM, TEM and EDX
20 pages
PO Dates - NMR1, 2, FAT, Dispach, Delivery
No ratings yet
PO Dates - NMR1, 2, FAT, Dispach, Delivery
60 pages
Muhammad Arief Bin Syam (2021862606) - Assignment 2
No ratings yet
Muhammad Arief Bin Syam (2021862606) - Assignment 2
7 pages
Ausubel's Learning Theory
No ratings yet
Ausubel's Learning Theory
3 pages
Colin Klein - What The Body Commands - The Imperative Theory of Pain-MIT Press (2015)
No ratings yet
Colin Klein - What The Body Commands - The Imperative Theory of Pain-MIT Press (2015)
225 pages
SMK Dpha Gapor Biology STPM: Trial Exam P3 2017 Section A
No ratings yet
SMK Dpha Gapor Biology STPM: Trial Exam P3 2017 Section A
6 pages
Cambridge IGCSE
No ratings yet
Cambridge IGCSE
8 pages
Assignment On Correlation Analysis Name: Md. Arafat Rahman
No ratings yet
Assignment On Correlation Analysis Name: Md. Arafat Rahman
6 pages
MLT Product Catalog 2014
No ratings yet
MLT Product Catalog 2014
226 pages
History of Sulphuric Acid
100% (1)
History of Sulphuric Acid
59 pages
ECE101 Version1.2 PDF
50% (2)
ECE101 Version1.2 PDF
420 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.