0% found this document useful (0 votes)

152 views

Pgm5 With Output

The document describes implementing a naive Bayesian classifier to predict whether weather conditions are suitable for playing golf based on a sample dataset. It explains the key concepts of naive Bayes classification including Bayes' theorem, the independence assumption, calculating class and conditional probabilities, and making predictions on new data. The accuracy of the classifier is then evaluated on test data sets.

Uploaded by

Manoj

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

152 views

Pgm5 With Output

Uploaded by

Manoj

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 13

Program 05.

Write a program to implement the naïve Bayesian classifier for a sample

training data set stored as a .CSV file. Compute the accuracy of the classifier, considering few
test data sets.

Naive Bayes Classifiers

Naive Bayes classifiers are a collection of classification algorithms based on Bayes’ Theorem. It is
not a single algorithm but a family of algorithms where all of them share a common principle, i.e.
every pair of features being classified is independent of each other.
To start with, let us consider a dataset.

Consider a fictional dataset that describes the weather conditions for playing a game of golf. Given
the weather conditions, each tuple classifies the conditions as fit(“Yes”) or unfit(“No”) for plaing
golf.

Here is a tabular representation of our dataset.

PLAY
SL.NO. OUTLOOK TEMPERATURE HUMIDITY WINDY
GOLF

1 Rainy Hot High False No

2 Rainy Hot High True No
3 Overcast Hot High False Yes
4 Sunny Mild High False Yes
5 Sunny Cool Normal False Yes
6 Sunny Cool Normal True No
7 Overcast Cool Normal True Yes
8 Rainy Mild High False No
9 Rainy Cool Normal False Yes
10 Sunny Mild Normal False Yes
11 Rainy Mild Normal True Yes
12 Overcast Mild High True Yes
13 Overcast Hot Normal False Yes
14 Sunny Mild High True No

The dataset is divided into two parts, namely, feature matrix and the response vector.
 Feature matrix contains all the vectors(rows) of dataset in which each vector consists of the
value of dependent features. In above dataset, features are ‘Outlook’, ‘Temperature’,
‘Humidity’ and ‘Windy’.
 Response vector contains the value of class variable(prediction or output) for each row of
feature matrix. In above dataset, the class variable name is ‘Play golf’.
Assumption:
The fundamental Naive Bayes assumption is that each feature makes an:

 independent
 equal
contribution to the outcome.

With relation to our dataset, this concept can be understood as:

 We assume that no pair of features are dependent. For example, the temperature being ‘Hot’
has nothing to do with the humidity or the outlook being ‘Rainy’ has no effect on the winds.
Hence, the features are assumed to be independent.
 Secondly, each feature is given the same weight(or importance). For example, knowing only
temperature and humidity alone can’t predict the outcome accuratey. None of the attributes is
irrelevant and assumed to be contributing equally to the outcome.
Note: The assumptions made by Naive Bayes are not generally correct in real-world situations. In-
fact, the independence assumption is never correct but often works well in practice.
Now, before moving to the formula for Naive Bayes, it is important to know about Bayes’ theorem.

Bayes’ Theorem

Bayes’ Theorem finds the probability of an event occurring given the probability of another event
that has already occurred. Bayes’ theorem is stated mathematically as the following equation:

where A and B are events and P(B) ? 0.

 Basically, we are trying to find probability of event A, given the event B is true. Event B is
also termed as evidence.
 P(A) is the priori of A (the prior probability, i.e. Probability of event before evidence is
seen). The evidence is an attribute value of an unknown instance(here, it is event B).
 P(A|B) is a posteriori probability of B, i.e. probability of event after evidence is seen.
Now, with regards to our dataset, we can apply Bayes’ theorem in following way:

where, y is class variable and X is a dependent feature vector (of size n) where:

Just to clear, an example of a feature vector and corresponding class variable can be: (refer 1st row
of dataset)

X = (Rainy, Hot, High, False)

y = No

So basically, P(y|X) here means, the probability of “Not playing golf” given that the weather
conditions are “Rainy outlook”, “Temperature is hot”, “high humidity” and “no wind”.

Naive assumption
Now, its time to put a naive assumption to the Bayes’ theorem, which is, independence among the
features. So now, we split evidence into the independent parts.
Now, if any two events A and B are independent, then,

P(A,B) = P(A)P(B)

Hence, we reach to the result:

which can be expressed as:

Now, as the denominator remains constant for a given input, we can remove that term:

Now, we need to create a classifier model. For this, we find the probability of given set of inputs for
all possible values of the class variable y and pick up the output with maximum probability. This
can be expressed mathematically as:
So, finally, we are left with the task of calculating P(y) and P(xi | y).
Please note that P(y) is also called class probability and P(xi | y) is called conditional probability.
The different naive Bayes classifiers differ mainly by the assumptions they make regarding the
distribution of P(xi | y).
Let us try to apply the above formula manually on our weather dataset. For this, we need to do some
precomputations on our dataset.

We need to find P(xi | yj) for each xi in X and yj in y. All these calculations have been demonstrated
in the tables below:

So, in the figure above, we have calculated P(xi | yj) for each xi in X and yj in y manually in the
tables 1-4. For example, probability of playing golf given that the temperature is cool, i.e P(temp. =
cool | play golf = Yes) = 3/9.
Also, we need to find class probabilities (P(y)) which has been calculated in the table 5. For
example, P(play golf = Yes) = 9/14.
So now, we are done with our pre-computations and the classifier is ready!

Let us test it on a new set of features (let us call it today):

today = (Sunny, Hot, Normal, False)

So, probability of playing golf is given by:

and probability to not play golf is given by:

Since, P(today) is common in both probabilities, we can ignore P(today) and find proportional
probabilities as:

and

Now, since

These numbers can be converted into a probability by making the sum equal to 1 (normalization):

and
Since

So, prediction that golf would be played is ‘Yes’.

The method that we discussed above is applicable for discrete data. In case of continuous data, we
need to make some assumptions regarding the distribution of values of each feature. The different
naive Bayes classifiers differ mainly by the assumptions they make regarding the distribution of
P(xi | y).
Now, we discuss one of such classifiers here.

Gaussian Naive Bayes classifier

In Gaussian Naive Bayes, continuous values associated with each feature are assumed to be
distributed according to a Gaussian distribution. A Gaussian distribution is also called .Normal
distribution When plotted, it gives a bell shaped curve which is symmetric about the mean of the
feature values as shown below:

The likelihood of the features is assumed to be Gaussian, hence, conditional probability is given by:

When dealing with continuous data, a typical assumption is that the continuous values associated
with each class are distributed according to a Gaussian distribution. For example, suppose the
training data contains a continuous attribute, x. We first segment the data by the class, and then
compute the mean and variance of x in each class. Let μ be the mean of the values in x associated
with class Ck, and let σ 2 k be the variance of the values in x associated with class Ck. Suppose we
have collected some observation value v. Then, the probability distribution of v given a class Ck,
p(x=v|Ck) can be computed by plugging v into the equation for a Normal distribution parameterized
by μ and σ 2 k . That is

Above method is adopted in our implementation of the program. Pima Indian diabetis dataset This
dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The
objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on
certain diagnostic measurements included in the dataset.
APPLICATION AREAS:
 Real time Prediction: Naive Bayes is an eager learning classifier and it is sure fast. Thus, it
could be used for making predictions in real time
 . Multi class Prediction: This algorithm is also well known for multi class prediction
feature. Here we can predict the probability of multiple classes of target variable.
 Text classification/ Spam Filtering/ Sentiment Analysis: Naive Bayes classifiers mostly
used in text classification (due to better result in multi class problems and independence
rule) have higher success rate as compared to other algorithms. As a result, it is widely used
in Spam filtering (identify spam e-mail) and Sentiment Analysis (in social media analysis,
to identify positive and negative customer sentiments)
 Recommendation System: Naive Bayes Classifier and Collaborative Filtering together
builds a Recommendation System that uses machine learning and data mining techniques to
filter unseen information and predict whether a user would like a given resource or not

zip:
The zip()function returns a zip object, which is an iterator of tuples where the first
item in each passed iterator is paired together, and then the second item in each
passed iterator are paired together etc.

If the passed iterators have different lengths, the iterator with the least items decides
the length of the new iterator.

Syntax
zip(iterator1, iterator2, iterator3 ...)

f one tuple contains more items, these items are ignored:

a = ("John", "Charles", "Mike")

b = ("Jenny", "Christy", "Monica", "Vicky")
x = zip(a, b)
print(tuple(x))

output:
(('John', 'Jenny'), ('Charles', 'Christy'),
('Mike', 'Monica'))

mean and standard deviation:

(('John', 'Jenny'),
('Charles', 'Christy'),
('Mike', 'Monica'))

The pima indian diabetes data set :

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney
Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has
diabetes, based on certain diagnostic measurements included in the dataset.
The pima indian diabetes data set consists of 8 fields:
BloodPress SkinThick
pregnancies Glucose ure ness Insuline BMI DiabeticPed Age Outcome

Gaussian distribution:
import csv
import random
import math

def loadcsv(filename):
lines = csv.reader(open(filename, "r"));
dataset = list(lines)
for i in range(len(dataset)):
# converting strings into numbers for processing
dataset[i] = [float(x) for x in dataset[i]]

return dataset

def splitdataset(dataset, splitratio):

# 67% training size
trainsize = int(len(dataset) * splitratio); # 380 datasets for training model
trainset = []
copy = list(dataset);
while len(trainset) < trainsize:
# generate indices for the dataset list randomly to pick ele for training data
index = random.randrange(len(copy));
trainset.append(copy.pop(index))
return [trainset, copy]

def separatebyclass(dataset):
separated = {} # dictionary of classes 1 and 0
# creates a dictionary of classes 1 and 0 where the values are
# the instances belonging to each class
for i in range(len(dataset)):
vector = dataset[i]
if (vector[-1] not in separated):
separated[vector[-1]] = []
separated[vector[-1]].append(vector)
return separated

def mean(numbers):
return sum(numbers) / float(len(numbers))

def stdev(numbers):
avg = mean(numbers)
variance = sum([pow(x - avg, 2) for x in numbers]) / float(len(numbers) - 1)
return math.sqrt(variance)

def summarize(dataset): # creates a dictionary of classes

summaries = [(mean(attribute), stdev(attribute)) for attribute in zip(*dataset)];
del summaries[-1] # excluding labels +ve or -ve
return summaries

def summarizebyclass(dataset):
separated = separatebyclass(dataset);
# print(separated)
summaries = {}
for classvalue, instances in separated.items():
# for key,value in dic.items()
# summaries is a dic of tuples(mean,std) for each class value
summaries[classvalue] = summarize(instances) # summarize is used to cal to mean and std
return summaries

def calculateprobability(x, mean, stdev):

exponent = math.exp(-(math.pow(x - mean, 2) / (2 * math.pow(stdev, 2))))
return (1 / (math.sqrt(2 * math.pi) * stdev)) * exponent

def calculateclassprobabilities(summaries, inputvector):

probabilities = {} # probabilities contains the all prob of all class of test data
for classvalue, classsummaries in summaries.items(): # class and attribute information as mean
and sd
probabilities[classvalue] = 1
for i in range(len(classsummaries)):
mean, stdev = classsummaries[i] # take mean and sd of every attribute for class 0 and 1
seperaely
x = inputvector[i] # testvector's first attribute
probabilities[classvalue] *= calculateprobability(x, mean, stdev); # use normal dist
return probabilities

def predict(summaries, inputvector): # training and test data is passed

probabilities = calculateclassprobabilities(summaries, inputvector)
bestLabel, bestProb = None, -1
for classvalue, probability in probabilities.items(): # assigns that class which has he highest prob
if bestLabel is None or probability > bestProb:
bestProb = probability
bestLabel = classvalue
return bestLabel
def getpredictions(summaries, testset):
predictions = []
for i in range(len(testset)):
result = predict(summaries, testset[i])
predictions.append(result)
return predictions

def getaccuracy(testset, predictions):

correct = 0
for i in range(len(testset)):
if testset[i][-1] == predictions[i]:
correct += 1
return (correct / float(len(testset))) * 100.0

def main():
filename = 'naivedata.csv'
splitratio = 0.67
dataset = loadcsv(filename);

trainingset, testset = splitdataset(dataset, splitratio)

print('Split {0} rows into train={1} and test={2} rows'.format(len(dataset), len(trainingset),
len(testset)))

# prepare model
summaries = summarizebyclass(trainingset);
#print(summaries)
# test model
predictions = getpredictions(summaries, testset) # find the predictions of test data with the
training data
accuracy = getaccuracy(testset, predictions)
print('Accuracy of the classifier is : {0}%'.format(accuracy))

main() #function call

-----------------------------------------------------------------------------------

output:
1. Split 768 rows into train=514 and test=254 rows
Accuracy of the classifier is : 73.62204724409449%
2.Split 768 rows into train=514 and test=254 rows
Accuracy of the classifier is : 75.19685039370079%

3.Split 768 rows into train=514 and test=254 rows

Accuracy of the classifier is : 73.22834645669292%

The AI Wealth Creation Blueprint PDF
67% (3)
The AI Wealth Creation Blueprint PDF
50 pages
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (20)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
81% (48)
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
708 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
100% (10)
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
708 pages
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (26)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
100% (24)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
The Fabric of Reality
100% (1)
The Fabric of Reality
6 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Example Sky Airtemp Humidity Wind Water Forecast Enjoysport 1 2 3 4
No ratings yet
Example Sky Airtemp Humidity Wind Water Forecast Enjoysport 1 2 3 4
6 pages
DropShipping Template
No ratings yet
DropShipping Template
4 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
01 Naiv Bayes
No ratings yet
01 Naiv Bayes
25 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Naive-By
No ratings yet
Naive-By
23 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Bayesian-Classification Ok
No ratings yet
Bayesian-Classification Ok
21 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Cours #5— Naive Bayes Classification
No ratings yet
Cours #5— Naive Bayes Classification
18 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Assignment No 2
No ratings yet
Assignment No 2
5 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
07 - ML - Naive-Bayes-update
No ratings yet
07 - ML - Naive-Bayes-update
26 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
26 pages
UNIT 2 AAM notes (1)
No ratings yet
UNIT 2 AAM notes (1)
38 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
ML Lecture 10 (Naïve Bayes Classifier)
No ratings yet
ML Lecture 10 (Naïve Bayes Classifier)
14 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
16_Naïve Bayes Classifier
No ratings yet
16_Naïve Bayes Classifier
21 pages
What Is Naive Bayes Algorithm?
No ratings yet
What Is Naive Bayes Algorithm?
18 pages
Classification-Alternative Techniques: Bayesian Classifiers
No ratings yet
Classification-Alternative Techniques: Bayesian Classifiers
7 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
NOTES
No ratings yet
NOTES
15 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
D3 It Naive Bayes
No ratings yet
D3 It Naive Bayes
24 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Hypothesis Testing Made Simple
From Everand
Hypothesis Testing Made Simple
Leonard Gaston
4/5 (5)
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Set-Theoretic Paradoxes and their Resolution in Z-F
From Everand
Set-Theoretic Paradoxes and their Resolution in Z-F
Samuel Horelick
4.5/5 (2)
Pgm6 With Output
No ratings yet
Pgm6 With Output
6 pages
Pgm10 With Output
No ratings yet
Pgm10 With Output
3 pages
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
From Music To Mathematic
100% (1)
From Music To Mathematic
4 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
Tech Trend 2024 Report-2
No ratings yet
Tech Trend 2024 Report-2
11 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Mind Control Patents
100% (1)
Mind Control Patents
41 pages
Wisc V Interpretation
100% (1)
Wisc V Interpretation
8 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
100% (7)
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
145 pages
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
No ratings yet
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
456 pages
Psych Unit 7a Practice Quiz
No ratings yet
Psych Unit 7a Practice Quiz
4 pages
BABM2003_AS2_BAL Assessment Brief_Sept 2024 Cohort
No ratings yet
BABM2003_AS2_BAL Assessment Brief_Sept 2024 Cohort
7 pages
6 Osama-DM in Sports
100% (1)
6 Osama-DM in Sports
76 pages
6.5 - The Central Limit Theorem: Objectives
No ratings yet
6.5 - The Central Limit Theorem: Objectives
6 pages
Practical Research 2 (Quantitative Research) : Mrs. Catherine Calixto-Valera
No ratings yet
Practical Research 2 (Quantitative Research) : Mrs. Catherine Calixto-Valera
17 pages
Model Ma3251 Type B
No ratings yet
Model Ma3251 Type B
4 pages
Medical Statistics at a Glance 4th Edition Aviva Petrie Caroline Sabin instant download
100% (2)
Medical Statistics at a Glance 4th Edition Aviva Petrie Caroline Sabin instant download
45 pages
Jurnal Appendicitis
No ratings yet
Jurnal Appendicitis
6 pages
Approval Sheet
No ratings yet
Approval Sheet
3 pages
Final Manuscript 1
No ratings yet
Final Manuscript 1
41 pages
Discovering Statistics Using SPSS 2nd Edition Andy
No ratings yet
Discovering Statistics Using SPSS 2nd Edition Andy
2 pages
Stats Week 1
No ratings yet
Stats Week 1
16 pages
Government Administration Management of Recruitment Process and Employee Job Performance in Uganda
No ratings yet
Government Administration Management of Recruitment Process and Employee Job Performance in Uganda
24 pages
Equivalence Tests For The Ratio of Two Proportions in A Cluster-Randomized Design
No ratings yet
Equivalence Tests For The Ratio of Two Proportions in A Cluster-Randomized Design
10 pages
Nu - Edu.kz Econometrics-I Assignment 3 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 3 Answer Key
7 pages
Exercise #7-Deli Depot-Differences
No ratings yet
Exercise #7-Deli Depot-Differences
3 pages
Reference List Time Series Analysis
No ratings yet
Reference List Time Series Analysis
4 pages
One-Way ANOVA
No ratings yet
One-Way ANOVA
39 pages
MCQs 4 Research Assistant exam 2025
No ratings yet
MCQs 4 Research Assistant exam 2025
16 pages
Power and Sample Size
No ratings yet
Power and Sample Size
88 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
The Role of The Assessor and Assessment Center
100% (1)
The Role of The Assessor and Assessment Center
25 pages
Umrah Statistics Bulletin 2018 en
No ratings yet
Umrah Statistics Bulletin 2018 en
79 pages
Statistics in Marketing
No ratings yet
Statistics in Marketing
7 pages
Solution Manual Adms 2320 PDF
No ratings yet
Solution Manual Adms 2320 PDF
869 pages
Sample Size Guideline For Correlation Analysis: World Journal of Social Science Research March 2016
No ratings yet
Sample Size Guideline For Correlation Analysis: World Journal of Social Science Research March 2016
11 pages
General Examples Using The Crow Model
No ratings yet
General Examples Using The Crow Model
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pgm5 With Output

Uploaded by

Pgm5 With Output

Uploaded by

Program 05.

Write a program to implement the naïve Bayesian classifier for a sample

Naive Bayes Classifiers

Here is a tabular representation of our dataset.

1 Rainy Hot High False No

With relation to our dataset, this concept can be understood as:

where A and B are events and P(B) ? 0.

X = (Rainy, Hot, High, False)

Hence, we reach to the result:

which can be expressed as:

Let us test it on a new set of features (let us call it today):

today = (Sunny, Hot, Normal, False)

So, probability of playing golf is given by:

and probability to not play golf is given by:

So, prediction that golf would be played is ‘Yes’.

Gaussian Naive Bayes classifier

f one tuple contains more items, these items are ignored:

a = ("John", "Charles", "Mike")

mean and standard deviation:

The pima indian diabetes data set :

def splitdataset(dataset, splitratio):

def summarize(dataset): # creates a dictionary of classes

def calculateprobability(x, mean, stdev):

def calculateclassprobabilities(summaries, inputvector):

def predict(summaries, inputvector): # training and test data is passed

def getaccuracy(testset, predictions):

trainingset, testset = splitdataset(dataset, splitratio)

main() #function call

3.Split 768 rows into train=514 and test=254 rows

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.