0% found this document useful (0 votes)

117 views

Support Vector Machine

Support Vector Machine (SVM) is a supervised machine learning algorithm used for both classification and regression problems. It finds the optimal separating hyperplane that maximizes the margin between two classes of data points. The data points that lie closest to this hyperplane are called the support vectors, and they are important for defining the nature of the separation. There are linear and non-linear versions of SVM depending on whether the classes can be separated by a straight line or more complex decision boundaries are required.

Uploaded by

Prashant Sahu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views

Support Vector Machine

Uploaded by

Prashant Sahu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Support Vector Machine

SVM
Support Vector Machine or SVM is one of the most popular Supervised
Learning algorithms, which is used for Classification as well as Regression
problems.
However, primarily, it is used for Classification problems in Machine
Learning.
The goal of the SVM algorithm is to create the best line or decision
boundary that can segregate n-dimensional space into classes so that we
can easily put the new data point in the correct category in the future.
This best decision boundary is called a hyperplane.
SVM

SVM chooses the extreme

points/vectors that help in
creating the hyperplane. These
extreme cases are called as
support vectors, and hence
algorithm is termed as Support
Vector Machine. Consider the
below diagram in which there are
two different categories that are
classified using a decision
boundary or hyperplane:
Example
• SVM can be understood with the example that we have used in the KNN classifier.
• Suppose we see a strange cat that also has some features of dogs, so if we want a model that can accurately
identify whether it is a cat or dog, so such a model can be created by using the SVM algorithm.
• We will first train our model with lots of images of cats and dogs so that it can learn about different features of
cats and dogs, and then we test it with this strange creature.
• So as support vector creates a decision boundary between these two data (cat and dog) and choose extreme cases
(support vectors), it will see the extreme case of cat and dog.
• On the basis of the support vectors, it will classify it as a cat. Consider the below diagram:

SVM algorithm can be used for Face detection, image classification, text categorization, etc.
Types Of SVM
SVM can be of two types:
•Linear SVM: Linear SVM is used for linearly separable data, which means if a dataset can be classified into two
classes by using a single straight line, then such data is termed as linearly separable data, and classifier is used
called as Linear SVM classifier.
•Non-linear SVM: Non-Linear SVM is used for non-linearly separated data, which means if a dataset cannot be
classified by using a straight line, then such data is termed as non-linear data and classifier used is called as Non-
linear SVM classifier.
Hyperplane and Support Vectors in the SVM
algorithm:
Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-dimensional space, but
we need to find out the best decision boundary that helps to classify the data points. This best boundary is known
as the hyperplane of SVM.
The dimensions of the hyperplane depend on the features present in the dataset, which means if there are 2
features (as shown in image), then hyperplane will be a straight line. And if there are 3 features, then hyperplane
will be a 2-dimension plane.
We always create a hyperplane that has a maximum margin, which means the maximum distance between the
data points.

Support Vectors:
The data points or vectors that are the closest to the hyperplane and which affect the position of the hyperplane are
termed as Support Vector. Since these vectors support the hyperplane, hence called a Support vector.
How does SVM works? Linear SVM
The working of the SVM So as it is 2-d space so by
algorithm can be understood by just using a straight line, we
using an example. Suppose we can easily separate these
have a dataset that has two tags two classes. But there can
(green and blue), and the dataset be multiple lines that can
has two features x1 and x2. We separate these classes.
want a classifier that can classify Consider the below image:
the pair(x1, x2) of coordinates in
either green or blue. Consider
the below image:
Optimal Hyperplane
Hence, the SVM algorithm helps to find the best line or decision boundary; this best boundary or region is called
as a hyperplane. SVM algorithm finds the closest point of the lines from both the classes. These points are called
support vectors. The distance between the vectors and the hyperplane is called as margin. And the goal of SVM
is to maximize this margin. The hyperplane with maximum margin is called the optimal hyperplane.
Non-linear SVM
If data is linearly arranged, then we can separate it by using a straight line, but for non-linear data, we cannot
draw a single straight line. Consider the below image:

So to separate these data points, we need to add one

more dimension. For linear data, we have used two
dimensions x and y, so for non-linear data, we will
add a third dimension z. It can be calculated as:
z=x2 +y2

By adding the third dimension, the

sample space will become as image
given here:
Non-linear SVM
So now, SVM will divide the Since we are in 3-d Space, hence it is looking like a plane
datasets into classes in the parallel to the x-axis. If we convert it in 2d space with z=1, then
following way it will become as:
Kernel
•All values for z would be positive always because z is the squared sum of both x and y
•In the original plot, green circles appear close to the origin of x and y axes, leading to lower
value of z and blue triangle relatively away from the origin result to higher value of z.

In the SVM classifier, it is easy to have a linear hyper-plane between these two classes. But,
another burning question which arises is, should we need to add this feature manually to have a
hyper-plane. No, the SVM algorithm has a technique called the kernel trick

The SVM kernel is a function that takes low dimensional input space and transforms it to a higher
dimensional space i.e. it converts not separable problem to separable problem.
It is mostly useful in non-linear separation problem.
Simply put, it does some extremely complex data transformations, then finds out the process to
separate the data based on the labels or outputs you’ve defined.

When we look at the hyper-plane in original input space it

looks like a circle:
Advantage and Disadvantage
•Pros:
• It works really well with a clear margin of separation
• It is effective in high dimensional spaces.
• It is effective in cases where the number of dimensions is greater than the
number of samples.
• It uses a subset of training points in the decision function (called support
vectors), so it is also memory efficient.
•Cons:
• It doesn’t perform well when we have large data set because the required
training time is higher
• It also doesn’t perform very well, when the data set has more noise i.e.
target classes are overlapping
• SVM doesn’t directly provide probability estimates, these are calculated
using an expensive five-fold cross-validation. It is included in the related
SVC method of Python scikit-learn library.

Generative AI on Google Cloud with LangChain: Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud
From Everand
Generative AI on Google Cloud with LangChain: Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud
Leonid Kuligin
No ratings yet
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
No ratings yet
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
16 pages
Supervised Machine Learning Algorithms For Credit Card Fraudulent Transaction Detection: A Comparative Study
No ratings yet
Supervised Machine Learning Algorithms For Credit Card Fraudulent Transaction Detection: A Comparative Study
4 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Lecture R
No ratings yet
Lecture R
201 pages
Exploratory Data Analysis With R PDF
No ratings yet
Exploratory Data Analysis With R PDF
125 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
17 pages
Programming in Oracle With PL/SQL
No ratings yet
Programming in Oracle With PL/SQL
26 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
24 pages
HackerRank Notes
No ratings yet
HackerRank Notes
10 pages
R Programming Interview
No ratings yet
R Programming Interview
24 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
16 pages
DataScience Unit1 (+notes)
No ratings yet
DataScience Unit1 (+notes)
56 pages
A Brief Introduction To Mathematica: The Very Basics
No ratings yet
A Brief Introduction To Mathematica: The Very Basics
27 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
R Studio
No ratings yet
R Studio
41 pages
Power BI-Data Analyst
100% (1)
Power BI-Data Analyst
11 pages
Session 3 - Logistic Regression
50% (2)
Session 3 - Logistic Regression
28 pages
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
IMCA Syllabus 2018
No ratings yet
IMCA Syllabus 2018
28 pages
SQL and Relational Databases 101
No ratings yet
SQL and Relational Databases 101
1 page
Web Development Roadmap
No ratings yet
Web Development Roadmap
5 pages
Decision Tree
No ratings yet
Decision Tree
25 pages
Different Types of Regression Models
No ratings yet
Different Types of Regression Models
18 pages
Correlation and Regression
No ratings yet
Correlation and Regression
61 pages
Pizza - Google Hashcode
No ratings yet
Pizza - Google Hashcode
3 pages
ML3 - Evaluation
100% (1)
ML3 - Evaluation
65 pages
Data Visualization in R Sem-III 2021 PDF
No ratings yet
Data Visualization in R Sem-III 2021 PDF
57 pages
Interview Questions
100% (1)
Interview Questions
67 pages
Programming For Data Science
100% (1)
Programming For Data Science
4 pages
Twitter Scraping Streamlit - Py
No ratings yet
Twitter Scraping Streamlit - Py
2 pages
Writing Code For NLP Research PDF
No ratings yet
Writing Code For NLP Research PDF
254 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Intro To ML
No ratings yet
Intro To ML
134 pages
Data Visualization With Ggplot2 PDF
No ratings yet
Data Visualization With Ggplot2 PDF
13 pages
Statistics Interview Questions & Answers For Data Scientists
No ratings yet
Statistics Interview Questions & Answers For Data Scientists
43 pages
41 Essential Machine Learning Interview Questions: 18 Mins Read
No ratings yet
41 Essential Machine Learning Interview Questions: 18 Mins Read
21 pages
Feature Engineering
100% (2)
Feature Engineering
76 pages
Poly
100% (1)
Poly
108 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Full download Modern Statistics with R From Wrangling and Exploring Data to Inference and Predictive Modelling Second Edition Måns Thulin pdf docx
100% (2)
Full download Modern Statistics with R From Wrangling and Exploring Data to Inference and Predictive Modelling Second Edition Måns Thulin pdf docx
71 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
30 pages
CS7641 Machine Learning Midterm Notes PDF
No ratings yet
CS7641 Machine Learning Midterm Notes PDF
239 pages
Data Mining - IMT Nagpur-Manish
No ratings yet
Data Mining - IMT Nagpur-Manish
82 pages
Simple Linear Regression - Assign3
No ratings yet
Simple Linear Regression - Assign3
8 pages
Data Science Course Content
No ratings yet
Data Science Course Content
4 pages
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
No ratings yet
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
12 pages
Datascience and R PDF
No ratings yet
Datascience and R PDF
488 pages
Youtube Data Strcutre and Algorithms New Baghdad
No ratings yet
Youtube Data Strcutre and Algorithms New Baghdad
147 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
46 pages
Machine Learning
100% (1)
Machine Learning
62 pages
Lasoo Regression
No ratings yet
Lasoo Regression
8 pages
Data Science Analytics For Ordinary People PDF
No ratings yet
Data Science Analytics For Ordinary People PDF
199 pages
Chapter 1. Introduction
100% (2)
Chapter 1. Introduction
39 pages
(IJETA-V8I5P1) :yew Kee Wong
No ratings yet
(IJETA-V8I5P1) :yew Kee Wong
5 pages
Excel 2013/2016: Get Your Hands Dirty
From Everand
Excel 2013/2016: Get Your Hands Dirty
Sam Akrasi
No ratings yet
50 Python Concepts Every Developer Should Know
From Everand
50 Python Concepts Every Developer Should Know
Hernando Abella
No ratings yet
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
MATLAB Cheat Sheet For Data Science - LondonSchoolofEconomics
No ratings yet
MATLAB Cheat Sheet For Data Science - LondonSchoolofEconomics
9 pages
Homework 3: SVM and Sentiment Analysis: Minted Listings
No ratings yet
Homework 3: SVM and Sentiment Analysis: Minted Listings
7 pages
Curriculum of Ms in Avionics Engineering
No ratings yet
Curriculum of Ms in Avionics Engineering
34 pages
Instructor's Solution Manual For Neural Networks
No ratings yet
Instructor's Solution Manual For Neural Networks
40 pages
Sample Full Manuscript-FMC - Cabungcag - Calledo - Fuentes - Teoganco
No ratings yet
Sample Full Manuscript-FMC - Cabungcag - Calledo - Fuentes - Teoganco
111 pages
Nonintrusive Smartphone User Verification Using Anonymzed Multimodal Data
No ratings yet
Nonintrusive Smartphone User Verification Using Anonymzed Multimodal Data
13 pages
Prediction of Ship Fuel Consumption by Using An Artificial Neural Network
No ratings yet
Prediction of Ship Fuel Consumption by Using An Artificial Neural Network
12 pages
Tiwari Purushottam 1828469 FYP Report1 PDF
No ratings yet
Tiwari Purushottam 1828469 FYP Report1 PDF
67 pages
What's New in IBM SPSS Statistics v24 & IBM SPSS Modeler v18
No ratings yet
What's New in IBM SPSS Statistics v24 & IBM SPSS Modeler v18
38 pages
Chethana H N REPORT
No ratings yet
Chethana H N REPORT
12 pages
AI Unit 2 - Data & Algorithms by Kulbhushan (Krazy Kaksha & KK World)
No ratings yet
AI Unit 2 - Data & Algorithms by Kulbhushan (Krazy Kaksha & KK World)
5 pages
Automatic Multichannel Volcano-Seismic Classification Using Machine Learning and EMD
No ratings yet
Automatic Multichannel Volcano-Seismic Classification Using Machine Learning and EMD
10 pages
Face Recognition System
No ratings yet
Face Recognition System
63 pages
0 TH
No ratings yet
0 TH
25 pages
Utilizing Block Chain Technology in Various Application Areas of Machine Learning
No ratings yet
Utilizing Block Chain Technology in Various Application Areas of Machine Learning
5 pages
Dataset Indonesia Untuk Analisis Sentimen
No ratings yet
Dataset Indonesia Untuk Analisis Sentimen
7 pages
Wine5 PDF
No ratings yet
Wine5 PDF
29 pages
Svmlight: - Svmlight Is An Implementation of Support Vector Machine (SVM) in C. - Download Source From
No ratings yet
Svmlight: - Svmlight Is An Implementation of Support Vector Machine (SVM) in C. - Download Source From
8 pages
ML Lab Manual TE 2021-22
No ratings yet
ML Lab Manual TE 2021-22
43 pages
Shivanisingh B2019103 Sectionb Assignment4
No ratings yet
Shivanisingh B2019103 Sectionb Assignment4
11 pages
Hadoop and MR Programming: DR G Sudha Sadasivam Professor Cse, PSGCT
No ratings yet
Hadoop and MR Programming: DR G Sudha Sadasivam Professor Cse, PSGCT
71 pages
Malla Reddy University: "Leaf Diseases Detection by Using Machine Learning"
No ratings yet
Malla Reddy University: "Leaf Diseases Detection by Using Machine Learning"
12 pages
Sentiment and Emotion Movie Script Annotation
No ratings yet
Sentiment and Emotion Movie Script Annotation
102 pages
Report On Prediction of Rising Star in The Game of Cricket
No ratings yet
Report On Prediction of Rising Star in The Game of Cricket
37 pages
A Survey On Data Mining Techniques For COVID Prediction
100% (2)
A Survey On Data Mining Techniques For COVID Prediction
6 pages
River Classification and Change Detection From Landsat Images by Using A River Classification Toolbox
No ratings yet
River Classification and Change Detection From Landsat Images by Using A River Classification Toolbox
12 pages
Lecture 10 Clustering and Classification
No ratings yet
Lecture 10 Clustering and Classification
41 pages
Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
No ratings yet
Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
1,000 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Support Vector Machine

Uploaded by

Support Vector Machine

Uploaded by

Support Vector Machine

SVM chooses the extreme

So to separate these data points, we need to add one

By adding the third dimension, the

When we look at the hyper-plane in original input space it

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.