0% found this document useful (0 votes)

67 views

Anomaly Detection

This document discusses anomaly detection techniques. It covers density estimation using a Gaussian distribution to develop an anomaly detection algorithm. It then discusses building an anomaly detection system, including developing and evaluating such a system, the differences between anomaly detection and supervised learning, and choosing relevant features. Finally, it introduces the multivariate Gaussian distribution and how it can be used for anomaly detection to account for correlations between multiple features.

Uploaded by

Varest Yokin

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

Anomaly Detection

Uploaded by

Varest Yokin

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Anomaly Detection

Density estimation, anomaly detection system, and multivariate gaussian distribution.

 1. Density Estimation
o 1a. Problem Motivation
o 1b. Gaussian (Normal) Distribution
o 1c. Algorithm

 2. Building an Anomaly Detection System

o 2a. Developing and Evaluating an Anomaly Detection System
o 2b. Anomaly Detection vs Supervised Learning
o 2c. Choosing What Features to Use

 3. Multivariate Gaussian Distribution

o 3a. Introduction
o 3b. Anomaly Detection using Multivariate Gaussian Distribution

1. Density Estimation
I would like to give full credits to the respective authors as these are my personal python
notebooks taken from deep learning courses from Andrew Ng, Data School and Udemy :)
This is a simple python notebook hosted generously through Github Pages that is on my
main personal notes repository on https://github.com/ritchieng/ritchieng.github.io. They are
meant for my personal review but I have open-source my repository of personal notes as a
lot of people found it useful.

1a. Problem Motivation

 Anomaly detection example in aircraft engines

o Density estimation
 Other anomaly detection examples

1
o If you have too many false positives
 Detecting positives when they are not
 Decrease

1b. Gaussian (Normal) Distribution

 This is also called Normal Distribution
o We will be using the Gaussian Distribution to develop an anomaly detection
algorithm
 Introduction
 Gaussian distribution example
o
o
o
o
o
o
o
o
o
o

o Area under the graph (red shaded area) must always equate to 1
 Parameter estimation

2
o m might be (m - 1)
 In practice, it makes very little difference
 In machine learning, most people typically use (1 / m)

 Now we will use the Gaussian distribution to develop an anomaly detection algorithm

1c. Algorithm
 Density
estimation

3
 Anomaly detection
algorithm

 Anomaly detection example

o Height of contour graph = p(x)
o Set some value of ε
o The pink shaded area on the contour graph have a low probability hence

they’re anomalous

2. Building an Anomaly Detection System

2a. Developing and Evaluating an Anomaly Detection System
 Importance of real-number evaluation

4
o When developing a learning algorithm (choosing features etc.), making
decisions is much easier if we have a way of evaluating our learning
algorithm
o Assume we have some labeled data, of anomalous and non-anomalous
examples
 y = 0 if normal
 y = 1 if anomalous
o Training set (x1, x2, …, xm)
 Assume normal examples, not anomalous
o Cross validation set (xcv_1, xcv_2, …, xcv_m)
o Test set (xtest_1, …, xtest_m)
 Aircraft engines example
o 10,000 good (normal) engines
o 20 flawed (anomalous) engines
 Training set: 6000 good engines
 This will be used to fit p(x)
 CV: 2000 good engines (y = 0), 10 anomalous (y = 1)
 Test: 2000 good engines (y = 0), 10 anomalous (y = 1)
 Algorithm Evaluation
o Because y = 0 is more common, there is a skewed data set
 Hence, classification metric is not appropriate

2b. Anomaly Detection vs Supervised Learning

5
| Anomaly Detection | Supervised Learning | |———-|————-| | Very small number of
positive examples (y = 1 such that 0-20) | Large number of positive and negative examples |
| Large number of negative examples (y = 0) | | | Many different types of anomalies. Hard for
any algorithm to learn from positive examples what the anomalies look like; future
anomalies may look nothing like any of the anomalous examples we have seen so far. |
Enough positive examples for algorithm to get a sense of what positive examples are like,
future positive examples likely to be similar to ones in training set. | | Fraud Detection |
Email Spam Classification | | Manufacturing | Weather Prediction | | Monitoring machines in
a data center | Cancer Classification |

2c. Choosing What Features to Use

 Non-gaussian features
o If the data is non-gaussian, you can transform the data to make it resemble a
gaussian distribution

How
do
we

come up with features?

o Error analysis for anomaly detection
 This allows us to come up with features
 This is similar to the error analysis procedure that we have for
supervised learning, where we would train a complete algorithm, and
run the algorithm on a cross validation set, and look at the examples it
gets wrong, and see if we can come up with extra features to help the
algorithm do better on the examples that it got wrong in the cross-
validation set

6
 How do we choose features?
o Choose features that might take on unusually large or small values in the
event of an anomaly
o Example: monitoring computers in a data center
 The new feature x5 would take a very large value when there is a
huge CPU load but low network traffic
 This way you can catch anomalies

Multivariate Gaussian Distribution

3a. Introduction
 Monitoring machines in a data center example
o The green cross has a pretty high probability
o The anomaly detection algorithm may not detect this anomaly
o We have to use a multivariate gaussian (normal) distribution to fix this issue

7
 Multivariate Gaussian (Normal) Distribution
o Covariance matrix, Σ
 Varying two elements (diagonal) - variance
 If you reduce sigma, sharper gaussian
 If you increase sigma, wider gaussian

 Varying one element (diagonal) - variance

 Reduce variance of x1 and keep variance of x2 constant
(middle graph)
 Increase variance of x1 and keep variance of x2 constant (right
graph)

 Varying two elements (opt diagonal) -

correlation

8
o Mean matrix, μ
 Varying the μ parameter shifts the center of the distribution

9
3b. Anomaly Detection using Multivariate Gaussian Distribution
 Multivariate gaussian distribution

 Anomaly detection algorithm using multivariate gaussian distribution

o It will flag the green arrow as an anomaly

 Relationship with original model

10
o The original model is actually a special case of the multivariate gaussian
model

o Try to get rid features that are linearly dependent and duplicate features for
multivariate gaussian model

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
A Method For Obtaining and Analyzing Sensitivity Data
100% (2)
A Method For Obtaining and Analyzing Sensitivity Data
20 pages
Fundamentals of Biostatistics 8th Edition by Rosner ISBN 130526892X Solution Manual
100% (49)
Fundamentals of Biostatistics 8th Edition by Rosner ISBN 130526892X Solution Manual
19 pages
Snakify Theory Questions and Model Answers
0% (1)
Snakify Theory Questions and Model Answers
214 pages
Test Bank Chap 3 PDF
100% (1)
Test Bank Chap 3 PDF
36 pages
DataMining S
No ratings yet
DataMining S
103 pages
Extensible Markup Language
No ratings yet
Extensible Markup Language
38 pages
12 Outlier
No ratings yet
12 Outlier
55 pages
Machine Learning in Python - Course Notes
No ratings yet
Machine Learning in Python - Course Notes
36 pages
CH 6
No ratings yet
CH 6
72 pages
Chapter 10 Asset Management 2014 From Machine To Machine To The Internet of Things
No ratings yet
Chapter 10 Asset Management 2014 From Machine To Machine To The Internet of Things
8 pages
Data Mining:: Concepts and Techniques
100% (1)
Data Mining:: Concepts and Techniques
63 pages
Review Article: Data Mining For The Internet of Things: Literature Review and Challenges
No ratings yet
Review Article: Data Mining For The Internet of Things: Literature Review and Challenges
14 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
30 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
19 pages
Machine Learning For Automation Software Testing Challenges, Use Cases Advantages & Disadvantages
No ratings yet
Machine Learning For Automation Software Testing Challenges, Use Cases Advantages & Disadvantages
7 pages
L05 - Advance Analytical Theory and Methods - Classification
No ratings yet
L05 - Advance Analytical Theory and Methods - Classification
34 pages
A Survey On Data Mining
No ratings yet
A Survey On Data Mining
4 pages
Map Reduce
100% (1)
Map Reduce
33 pages
A Guide To Teaching Data Science PDF
No ratings yet
A Guide To Teaching Data Science PDF
26 pages
Approaches To The Analysis of Survey Data PDF
No ratings yet
Approaches To The Analysis of Survey Data PDF
28 pages
Question Bank_CSE-DS
No ratings yet
Question Bank_CSE-DS
5 pages
Pert 7 - Ethics and Privacy
No ratings yet
Pert 7 - Ethics and Privacy
18 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
Big Data - S
No ratings yet
Big Data - S
79 pages
Final - Unit 3 Data Preprocessing - Phases
No ratings yet
Final - Unit 3 Data Preprocessing - Phases
42 pages
tmpAF8A TMP
No ratings yet
tmpAF8A TMP
7 pages
(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
100% (8)
(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
51 pages
MDX Tutorial
100% (1)
MDX Tutorial
31 pages
Cluster
100% (1)
Cluster
72 pages
Unit-3 DMDW
No ratings yet
Unit-3 DMDW
36 pages
Elementary Graph Algorithms: Hina Gul Assistant Professor Kinnaird College For Women, Lahore
100% (1)
Elementary Graph Algorithms: Hina Gul Assistant Professor Kinnaird College For Women, Lahore
17 pages
Lesson 6 Data Life Cycle Part 2
No ratings yet
Lesson 6 Data Life Cycle Part 2
30 pages
Classification and Prediction
No ratings yet
Classification and Prediction
143 pages
Exabeam Data Science WP
No ratings yet
Exabeam Data Science WP
6 pages
DWDM R13 Unit 1 PDF
No ratings yet
DWDM R13 Unit 1 PDF
10 pages
Data Cleaning 2021
No ratings yet
Data Cleaning 2021
61 pages
Data Cleaning
No ratings yet
Data Cleaning
8 pages
Introduction To Data Science and Machine Learning
No ratings yet
Introduction To Data Science and Machine Learning
23 pages
Data Mart Info
No ratings yet
Data Mart Info
5 pages
Data Mining
No ratings yet
Data Mining
87 pages
01 Basics of Data Analytics and Machine Learning
No ratings yet
01 Basics of Data Analytics and Machine Learning
16 pages
Feature Engg Pre Processing Python
No ratings yet
Feature Engg Pre Processing Python
68 pages
Lecture1 Big Data
No ratings yet
Lecture1 Big Data
47 pages
Module 2
No ratings yet
Module 2
20 pages
Dev Answer Key
100% (1)
Dev Answer Key
17 pages
A Review On Computational Methods For Denoising and Detecting ECG Signals To Detect Cardiovascular Diseases
No ratings yet
A Review On Computational Methods For Denoising and Detecting ECG Signals To Detect Cardiovascular Diseases
40 pages
Karanja Evanson Mwangi Cit Masters Report Libre PDF
No ratings yet
Karanja Evanson Mwangi Cit Masters Report Libre PDF
136 pages
Big Data Analytics and Visualization Lab
No ratings yet
Big Data Analytics and Visualization Lab
193 pages
Outline: Problem Statement Definitions & Examples Strategies
No ratings yet
Outline: Problem Statement Definitions & Examples Strategies
7 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Dwbi Unit 4 & 5
No ratings yet
Dwbi Unit 4 & 5
26 pages
Poly
100% (1)
Poly
108 pages
Data Mining-Outlier Analysis
No ratings yet
Data Mining-Outlier Analysis
6 pages
POL BigDataStatisticsJune2014
No ratings yet
POL BigDataStatisticsJune2014
27 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
23 pages
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
No ratings yet
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
24 pages
Seminar 7 Introduction To Databases
No ratings yet
Seminar 7 Introduction To Databases
41 pages
Median Finding Algorithm
No ratings yet
Median Finding Algorithm
10 pages
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
No ratings yet
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
6 pages
Topic 1 Etw3482
100% (2)
Topic 1 Etw3482
69 pages
Data Mining Handout
No ratings yet
Data Mining Handout
4 pages
Text Mining: Fundamentals and Applications
From Everand
Text Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Research Data Analysis - Statistics Anova - Assignment
No ratings yet
Research Data Analysis - Statistics Anova - Assignment
7 pages
ST2134_ASSI_2021_guide
No ratings yet
ST2134_ASSI_2021_guide
232 pages
Contoh Uji Validitas Dan Reliabulitas Dengan Excell Dan SPSS
No ratings yet
Contoh Uji Validitas Dan Reliabulitas Dengan Excell Dan SPSS
8 pages
Universiti Teknologi Mara Final Examination: Confidential CS/SEP 2011/QMT181/212/216
No ratings yet
Universiti Teknologi Mara Final Examination: Confidential CS/SEP 2011/QMT181/212/216
10 pages
Parameter Estimation
No ratings yet
Parameter Estimation
19 pages
Case Study Dbm30033 Group1 - 98
No ratings yet
Case Study Dbm30033 Group1 - 98
18 pages
Naresh K. Malhotra - Basic Marketing Research_ Sampling
No ratings yet
Naresh K. Malhotra - Basic Marketing Research_ Sampling
34 pages
Tutorialexercises 1
No ratings yet
Tutorialexercises 1
9 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Taller Diagramas de Caja
No ratings yet
Taller Diagramas de Caja
4 pages
Length-For-Age GIRLS: Birth To 6 Months (Percentiles)
No ratings yet
Length-For-Age GIRLS: Birth To 6 Months (Percentiles)
15 pages
US Output Growth - Clemens
No ratings yet
US Output Growth - Clemens
33 pages
Mba Inf Homework Ps 2018
No ratings yet
Mba Inf Homework Ps 2018
27 pages
PP Thong Ke - QTSPM 2020-2021
No ratings yet
PP Thong Ke - QTSPM 2020-2021
139 pages
One Sample Z Test For Proportions PDF
No ratings yet
One Sample Z Test For Proportions PDF
3 pages
Costructs T Distribution (Dianne)
No ratings yet
Costructs T Distribution (Dianne)
9 pages
Sampling: Click at Http://goo - gl/7Dztn
No ratings yet
Sampling: Click at Http://goo - gl/7Dztn
8 pages
Two Sections Were Given Introduction To Statistics Examinations. The Following Information Was
No ratings yet
Two Sections Were Given Introduction To Statistics Examinations. The Following Information Was
2 pages
Examinations: Subject 101 - Statistical Modelling
No ratings yet
Examinations: Subject 101 - Statistical Modelling
8 pages
MMS Syllabus
No ratings yet
MMS Syllabus
7 pages
Exam 1 Chpt3 Stugy Guide
No ratings yet
Exam 1 Chpt3 Stugy Guide
7 pages
MIT Microeconomics 14.32 Final Review
No ratings yet
MIT Microeconomics 14.32 Final Review
5 pages
9-Tutorials-31-07-2024
No ratings yet
9-Tutorials-31-07-2024
28 pages
Hypothesis Testing Part 1
No ratings yet
Hypothesis Testing Part 1
31 pages
Variance Lecture
No ratings yet
Variance Lecture
14 pages
SBA Mark, Release and Recapture Technique (AI)
No ratings yet
SBA Mark, Release and Recapture Technique (AI)
3 pages
Sol Tutorial Set 5 - Hypothesis
No ratings yet
Sol Tutorial Set 5 - Hypothesis
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Anomaly Detection

Uploaded by

Anomaly Detection

Uploaded by

Anomaly Detection

Density estimation, anomaly detection system, and multivariate gaussian distribution.

 2. Building an Anomaly Detection System

 3. Multivariate Gaussian Distribution

1a. Problem Motivation

1b. Gaussian (Normal) Distribution

 Anomaly detection example

2. Building an Anomaly Detection System

2b. Anomaly Detection vs Supervised Learning

2c. Choosing What Features to Use

come up with features?

Multivariate Gaussian Distribution

 Varying one element (diagonal) - variance

 Varying two elements (opt diagonal) -

 Anomaly detection algorithm using multivariate gaussian distribution

 Relationship with original model

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.