0% found this document useful (0 votes)

47 views

Naive Bates Classifier

The document discusses the Naive Bayes classification algorithm. It begins by defining Naive Bayes as a supervised learning algorithm based on Bayes' theorem used for classification problems. It then explains that Naive Bayes is a simple and effective classifier that makes predictions based on probability. Some examples of its use are spam filtering and sentiment analysis. The document next explains why it is called "Naive" Bayes - because it assumes feature independence. It also explains that Bayes is used because it relies on Bayes' theorem for conditional probability. An example is then given to demonstrate how Naive Bayes works to classify whether to "play" based on weather conditions. The document concludes by discussing the advantages, disadvantages and applications of Naive Bayes,

Uploaded by

Vijayalakshmi Govindarajalu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

Naive Bates Classifier

Uploaded by

Vijayalakshmi Govindarajalu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Home AI Machine Learning DBMS Java Blockchain Control System Selenium

Naïve Bayes Classifier Algorithm

Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes
theorem and used for solving classification problems.

It is mainly used in text classification that includes a high-dimensional training

dataset.

Naïve Bayes Classifier is one of the simple and most effective Classification
algorithms which helps in building the fast machine learning models that can make
quick predictions.

It is a probabilistic classifier, which means it predicts on the basis of the

probability of an object.

Some popular examples of Naïve Bayes Algorithm are spam filtration, Sentimental
analysis, and classifying articles.

Why is it called Naïve Bayes?

The Naïve Bayes algorithm is comprised of two words Naïve and Bayes, Which can be
described as:

Naïve: It is called Naïve because it assumes that the occurrence of a certain feature
is independent of the occurrence of other features. Such as if the fruit is identified
on the bases of color, shape, and taste, then red, spherical, and sweet fruit is
recognized as an apple. Hence each feature individually contributes to identify that
it is an apple without depending on each other.

Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.

Bayes' Theorem:
Bayes' theorem is also known as Bayes' Rule or Bayes' law, which is used to
determine the probability of a hypothesis with prior knowledge. It depends on the
conditional probability.

The formula for Bayes' theorem is given as:

Where,

P(A|B) is Posterior probability: Probability of hypothesis A on the observed event B.

P(B|A) is Likelihood probability: Probability of the evidence given that the probability of
a hypothesis is true.

P(A) is Prior Probability: Probability of hypothesis before observing the evidence.

P(B) is Marginal Probability: Probability of Evidence.

Working of Naïve Bayes' Classifier:
Working of Naïve Bayes' Classifier can be understood with the help of the below example:

Suppose we have a dataset of weather conditions and corresponding target variable

"Play". So using this dataset we need to decide that whether we should play or not on a
particular day according to the weather conditions. So to solve this problem, we need to
follow the below steps:

1. Convert the given dataset into frequency tables.

2. Generate Likelihood table by finding the probabilities of given features.

3. Now, use Bayes theorem to calculate the posterior probability.

Problem: If the weather is sunny, then the Player should play or not?

Solution: To solve this, first consider the below dataset:

Outlook Play

0 Rainy Yes

1 Sunny Yes

2 Overcast Yes

3 Overcast Yes
4 Sunny No

5 Rainy Yes

6 Sunny Yes

7 Overcast Yes

8 Rainy No

9 Sunny No

10 Sunny Yes

11 Rainy No

12 Overcast Yes

13 Overcast Yes

Frequency table for the Weather Conditions:

Weather Yes No

Overcast 5 0

Rainy 2 2
Sunny 3 2

Total 10 5

Likelihood table weather condition:

Weather No Yes

Overcast 0 5 5/14= 0.35

Rainy 2 2 4/14=0.29

Sunny 2 3 5/14=0.35

All 4/14=0.29 10/14=0.71

Applying Bayes'theorem:

P(Yes|Sunny)= P(Sunny|Yes)*P(Yes)/P(Sunny)

P(Sunny|Yes)= 3/10= 0.3

P(Sunny)= 0.35

P(Yes)=0.71

So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

P(No|Sunny)= P(Sunny|No)*P(No)/P(Sunny)

P(Sunny|NO)= 2/4=0.5

P(No)= 0.29

P(Sunny)= 0.35

So P(No|Sunny)= 0.5*0.29/0.35 = 0.41

So as we can see from the above calculation that P(Yes|Sunny)>P(No|Sunny)

Hence on a Sunny day, Player can play the game.

Advantages of Naïve Bayes Classifier:

Naïve Bayes is one of the fast and easy ML algorithms to predict a class of datasets.

It can be used for Binary as well as Multi-class Classifications.

It performs well in Multi-class predictions as compared to the other Algorithms.

It is the most popular choice for text classification problems.

Disadvantages of Naïve Bayes Classifier:

Naive Bayes assumes that all features are independent or unrelated, so it cannot
learn the relationship between features.
Applications of Naïve Bayes Classifier:

It is used for Credit Scoring.

It is used in medical data classification.

It can be used in real-time predictions because Naïve Bayes Classifier is an eager

learner.

It is used in Text classification such as Spam filtering and Sentiment analysis.

Types of Naïve Bayes Model:

There are three types of Naive Bayes Model, which are given below:

Gaussian: The Gaussian model assumes that features follow a normal distribution.
This means if predictors take continuous values instead of discrete, then the model
assumes that these values are sampled from the Gaussian distribution.

Multinomial: The Multinomial Naïve Bayes classifier is used when the data is
multinomial distributed. It is primarily used for document classification problems, it
means a particular document belongs to which category such as Sports, Politics,
education, etc.
The classifier uses the frequency of words for the predictors.

Bernoulli: The Bernoulli classifier works similar to the Multinomial classifier, but the
predictor variables are the independent Booleans variables. Such as if a particular
word is present or not in a document. This model is also famous for document
classification tasks.

Python Implementation of the Naïve Bayes algorithm:

Now we will implement a Naive Bayes Algorithm using Python. So for this, we will use the
"user_data" dataset, which we have used in our other classification model. Therefore we
can easily compare the Naive Bayes model with the other models.

Steps to implement:

Data Pre-processing step

Fitting Naive Bayes to the Training set

Predicting the test result

Test accuracy of the result(Creation of Confusion matrix)

Visualizing the test set result.

1) Data Pre-processing step:

In this step, we will pre-process/prepare the data so that we can use it efficiently in our
code. It is similar as we did in data-pre-processing. The code for this is given below:

1. Importing the libraries

2. import numpy as nm
3. import matplotlib.pyplot as mtp
4. import pandas as pd
5.
6. # Importing the dataset
7. dataset = pd.read_csv('user_data.csv')
8. x = dataset.iloc[:, [2, 3]].values
9. y = dataset.iloc[:, 4].values
10.
11. # Splitting the dataset into the Training set and Test set
12. from sklearn.model_selection import train_test_split
13. x_train, x_test, y_train, y_test = train_test_split(x, y, test_size = 0.25, random_state = 0)
14.
15. # Feature Scaling
16. from sklearn.preprocessing import StandardScaler
17. sc = StandardScaler()
18. x_train = sc.fit_transform(x_train)
19. x_test = sc.transform(x_test)

In the above code, we have loaded the dataset into our program using "dataset =
pd.read_csv('user_data.csv'). The loaded dataset is divided into training and test set,
and then we have scaled the feature variable.

The output for the dataset is given as:

2) Fitting Naive Bayes to the Training Set:

After the pre-processing step, now we will fit the Naive Bayes model to the Training set.
Below is the code for it:

1. # Fitting Naive Bayes to the Training set

2. from sklearn.naive_bayes import GaussianNB
3. classifier = GaussianNB()
4. classifier.fit(x_train, y_train)
In the above code, we have used the GaussianNB classifier to fit it to the training
dataset. We can also use other classifiers as per our requirement.

Output:

Out[6]: GaussianNB(priors=None, var_smoothing=1e-09)

3) Prediction of the test set result:

Now we will predict the test set result. For this, we will create a new predictor variable
y_pred, and will use the predict function to make the predictions.

100% Forex Deposit Bonus

1. # Predicting the Test set Bresults
SS P O N S O R EE D B …
YY…
FF XX D A II L YY II N F O .. C O M
M
2. y_pred = classifier.predict(x_test)
LEARN MORE

Output:
The above output shows the result for prediction vector y_pred and real vector y_test. We
can see that some predications are different from the real values, which are the incorrect
predictions.

4) Creating Confusion Matrix:

Now we will check the accuracy of the Naive Bayes classifier using the Confusion matrix.
Below is the code for it:

1. # Making the Confusion Matrix

2. from sklearn.metrics import confusion_matrix
3. cm = confusion_matrix(y_test, y_pred)
Output:

As we can see in the above confusion matrix output, there are 7+3= 10 incorrect
predictions, and 65+25=90 correct predictions.

5) Visualizing the training set result:

Next we will visualize the training set result using Naïve Bayes Classifier. Below is the code
for it:

1. # Visualising the Training set results

2. from matplotlib.colors import ListedColormap
3. x_set, y_set = x_train, y_train
4. X1, X2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1, step =
5. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))
6. mtp.contourf(X1, X2, classifier.predict(nm.array([X1.ravel(), X2.ravel()]).T).reshape(X1.shape),
7. alpha = 0.75, cmap = ListedColormap(('purple', 'green')))
8. mtp.xlim(X1.min(), X1.max())
9. mtp.ylim(X2.min(), X2.max())
10. for i, j in enumerate(nm.unique(y_set)):
11. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],
12. c = ListedColormap(('purple', 'green'))(i), label = j)
13. mtp.title('Naive Bayes (Training set)')
14. mtp.xlabel('Age')
15. mtp.ylabel('Estimated Salary')
16. mtp.legend()
17. mtp.show()

Output:

In the above output we can see that the Naïve Bayes classifier has segregated the data
points with the fine boundary. It is Gaussian curve as we have used GaussianNB classifier
in our code.

6) Visualizing the Test set result:

1. # Visualising the Test set results

2. from matplotlib.colors import ListedColormap
3. x_set, y_set = x_test, y_test
4. X1, X2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1, step =
5. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))
6. mtp.contourf(X1, X2, classifier.predict(nm.array([X1.ravel(), X2.ravel()]).T).reshape(X1.shape),
7. alpha = 0.75, cmap = ListedColormap(('purple', 'green')))
8. mtp.xlim(X1.min(), X1.max())
9. mtp.ylim(X2.min(), X2.max())
10. for i, j in enumerate(nm.unique(y_set)):
11. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],
12. c = ListedColormap(('purple', 'green'))(i), label = j)
13. mtp.title('Naive Bayes (test set)')
14. mtp.xlabel('Age')
15. mtp.ylabel('Estimated Salary')
16. mtp.legend()
17. mtp.show()

Output:

The above output is final output for test set data. As we can see the classifier has created
a Gaussian curve to divide the "purchased" and "not purchased" variables. There are
some wrong predictions which we have calculated in Confusion matrix. But still it is pretty
good classifier.

← Prev Next →

Youtube For Videos Join Our Youtube Channel: Join Now

Feedback
Send your Feedback to feedback@javatpoint.com

Help Others, Please Share

Learn Latest Tutorials

Splunk tutorial SPSS tutorial Swagger tutorial T-SQL tutorial

Splunk SPSS Swagger Transact-SQL

Tumblr tutorial React tutorial Regex tutorial

Tumblr ReactJS Regex Reinforcement

Learning

RxJS tutorial

R Programming RxJS React Native

Python Design
Patterns
Keras tutorial

Python Pillow Python Turtle Keras

Preparation

Aptitude Verbal Ability

Aptitude Reasoning Verbal Ability Interview

Questions

Company
Questions

Trending Technologies

AWS Tutorial Selenium tutorial Cloud Computing

Artificial AWS Selenium Cloud Computing

Intelligence

Hadoop tutorial ReactJS Tutorial

Hadoop ReactJS Data Science Angular 7

Git Tutorial DevOps Tutorial

Blockchain Git Machine Learning DevOps

B.Tech / MCA

DBMS tutorial DAA tutorial Operating System

DBMS Data Structures DAA Operating System

Computer Network Compiler Design Computer Discrete

Organization Mathematics

Ethical Hacking html tutorial

Ethical Hacking Computer Graphics Software Web Technology

Engineering

Automata Tutorial C++ tutorial

Cyber Security Automata C Programming C++

Java tutorial Python tutorial List of Programs

Java .Net Python Programs

Control System Data Mining Data Warehouse

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Research Chapter 2
50% (4)
Research Chapter 2
4 pages
RESEARCH PROJECT Divyanshi Mishra
No ratings yet
RESEARCH PROJECT Divyanshi Mishra
10 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Exp 3 Bi
No ratings yet
Exp 3 Bi
12 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
Practical_3 (2)
No ratings yet
Practical_3 (2)
11 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
3 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
24 pages
Myppt
No ratings yet
Myppt
14 pages
An Introduction to Naive Bayes Algorithm for Beginners
No ratings yet
An Introduction to Naive Bayes Algorithm for Beginners
11 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
NOTES
No ratings yet
NOTES
15 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
07 Naive - Bayes
No ratings yet
07 Naive - Bayes
7 pages
16_Naïve Bayes Classifier
No ratings yet
16_Naïve Bayes Classifier
21 pages
Lab2 - Bayes Classification
No ratings yet
Lab2 - Bayes Classification
4 pages
Naive_Bayes_Classifier_Presentation
No ratings yet
Naive_Bayes_Classifier_Presentation
10 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
8 pages
DWM EXP 4-2
No ratings yet
DWM EXP 4-2
4 pages
Naive Bays
No ratings yet
Naive Bays
10 pages
EXP-10
No ratings yet
EXP-10
9 pages
CP4252 Machine Learning lab manual
No ratings yet
CP4252 Machine Learning lab manual
37 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
UNIT 2 AAM notes (1)
No ratings yet
UNIT 2 AAM notes (1)
38 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
16 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
3 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
NB Slides
No ratings yet
NB Slides
29 pages
Pract 8 - Naive Bays Algorithm
No ratings yet
Pract 8 - Naive Bays Algorithm
2 pages
Unit 3 PPT
No ratings yet
Unit 3 PPT
20 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
11 pages
Unit-4
No ratings yet
Unit-4
36 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
AI NOTES unit 2
No ratings yet
AI NOTES unit 2
9 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Unit2_5_part 2
No ratings yet
Unit2_5_part 2
1 page
Notes On Module 3 - Pattern Recognition
No ratings yet
Notes On Module 3 - Pattern Recognition
17 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Naive Bayes - Report (Repaired)
No ratings yet
Naive Bayes - Report (Repaired)
5 pages
Naive Bayes etc.
No ratings yet
Naive Bayes etc.
1 page
DWM Exp 4
No ratings yet
DWM Exp 4
7 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
14 Supervised Machine Learning
No ratings yet
14 Supervised Machine Learning
94 pages
Naive_Bayes (1)
No ratings yet
Naive_Bayes (1)
4 pages
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Artificial Intelligence and International Security Syllabus Public 1
No ratings yet
Artificial Intelligence and International Security Syllabus Public 1
19 pages
GTU_TIME TABLE_WINTER 2024_FIRST YEAR Remedial
No ratings yet
GTU_TIME TABLE_WINTER 2024_FIRST YEAR Remedial
1 page
Structural Functionalism
100% (1)
Structural Functionalism
3 pages
G9 - Q1 - WEEK2 - DAY2 - How Lifestyle Can Affect The Functioning of The Circulatory and Respiratory System
No ratings yet
G9 - Q1 - WEEK2 - DAY2 - How Lifestyle Can Affect The Functioning of The Circulatory and Respiratory System
4 pages
Crisis Intervention: Issues and Challenges
No ratings yet
Crisis Intervention: Issues and Challenges
4 pages
PR 1 Exam - 123
No ratings yet
PR 1 Exam - 123
5 pages
Time Management in The Workplace
No ratings yet
Time Management in The Workplace
5 pages
Student List - Biosangam
No ratings yet
Student List - Biosangam
4 pages
Plac908 Dap Record 2023 Pavnoor
No ratings yet
Plac908 Dap Record 2023 Pavnoor
5 pages
Bachelor of Secondary Education (Major in Social Studies) Program
100% (1)
Bachelor of Secondary Education (Major in Social Studies) Program
2 pages
Keywords: English, Self-Confidence, Speaking Skills
No ratings yet
Keywords: English, Self-Confidence, Speaking Skills
5 pages
SSRN Id4150158
No ratings yet
SSRN Id4150158
16 pages
SYLLABUS Understanding The Self
No ratings yet
SYLLABUS Understanding The Self
10 pages
Training Generative Adversarial Networks With Limited Data
No ratings yet
Training Generative Adversarial Networks With Limited Data
37 pages
Faculty of Economics and Management: Fakulti Ekonomi Dan Pengurusan
No ratings yet
Faculty of Economics and Management: Fakulti Ekonomi Dan Pengurusan
2 pages
Lovett Dietrich Lydia MArch August 2019
No ratings yet
Lovett Dietrich Lydia MArch August 2019
83 pages
ML Tennis
No ratings yet
ML Tennis
6 pages
Language Development of Slang in The Younger Generation in The Digital Era
No ratings yet
Language Development of Slang in The Younger Generation in The Digital Era
9 pages
Impact of Social Media On Cognitive Functioning and Sleep Quality 1
No ratings yet
Impact of Social Media On Cognitive Functioning and Sleep Quality 1
58 pages
KPD3016 - JSK Group 3
No ratings yet
KPD3016 - JSK Group 3
5 pages
Improving Beginning Mathematical Skills Through Problem Solving Approach
No ratings yet
Improving Beginning Mathematical Skills Through Problem Solving Approach
6 pages
Lesson Plan in 21 Century Literature From The Philippines & The World
No ratings yet
Lesson Plan in 21 Century Literature From The Philippines & The World
7 pages
Computer Network Quizes
No ratings yet
Computer Network Quizes
5 pages
(Culture and History of The Ancient Near East 65) Leslie Anne Warden - Pottery and Economy in Old Kingdom Egypt-Brill Academic Publishers (2014)
100% (2)
(Culture and History of The Ancient Near East 65) Leslie Anne Warden - Pottery and Economy in Old Kingdom Egypt-Brill Academic Publishers (2014)
343 pages
Daily Lesson Plan Modular 25-28.1.2022
100% (1)
Daily Lesson Plan Modular 25-28.1.2022
1 page
Module 8 GEC 5-2
No ratings yet
Module 8 GEC 5-2
4 pages
Document From Hafsah Anwar
No ratings yet
Document From Hafsah Anwar
29 pages
Fallatah 2016
No ratings yet
Fallatah 2016
37 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.