0% found this document useful (0 votes)

49 views

MAI Lecture 01 Introduction

This document provides an introduction to machine learning for artificial intelligence. It discusses how machine learning programs improve with experience by learning from large amounts of data. It outlines some of the key applications of machine learning, including detecting spam, predicting the weather, and classifying images. The document then reviews some of the major developments in the field, including early work on checkers-playing programs in the 1950s and breakthroughs with neural networks that led to today's renaissance in deep learning. It closes by discussing some of the challenging questions around how machine learning will shape the future and impact society.

Uploaded by

Yeabsira

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

MAI Lecture 01 Introduction

Uploaded by

Yeabsira

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Mathematics for AI

LECTURE 1 Introduction

1
Machine Learning (ML)
Programs that improve with experience.

2
Revolutionizing Science and Technology
“A breakthrough in machine learning would be worth ten
Microsofts.” (Bill Gates, Microsoft)

“It will be the basis and fundamentals of every successful

huge IPO win in 5 years.” (Eric Schmidt, Google / Alphabet

“AI and machine learning are going to change the world

and we really have not begun to scratch the surface.”
(Jennifer Chayes, Microsoft / Berkeley)
“ML is transforming sector after sector of the economy, and
the rate of progress only seems to be accelerating.” (Daphne
Koller, Stanford / Coursera/ Insitro)

“Machine learning is the next Internet” (Tony Tether, DARPA)

3
What is Machine Learning?
Yes, Yes, No, No 2, 1, 0, -1 Fun(x): x > 0? 2, 1, 0, -1

Output Input Program Input

Computer Computer

Program Output

Fun(x): x > 0.5? Yes, Yes, No, No

Machine Learning Traditional Computing 4
What is Machine Learning?
Yes, Yes, No, No 2, 1, 0, -1 1, 0.5, 0, -1

Output Input Input

Computer Computer

Program Output

Fun(x): x > 0.5? Yes, No, No, No

Machine Learning Traditional Computing 5
Tom Mitchell, 1997:
A computer program A is said to learn from experience E with respect to
What is Machine Learning?
some class of tasks T and performance measure P, if its performance at
tasks in T, as measured by P, improves with experience E.
Yes, Yes, No, No 2, 1, 0, -1 1, 0.5, 0, -1

Output Input Input

Computer Computer

Program Output

Fun(x): x > 0.5? Yes, No, No, No

Training Testing 6
Learning to Detect Spam
• Use past emails and whether or Spam or Not E-Mails
not they were flagged as spam.
• Learn a program that takes a
future email and decides whether
it is a spam:
• E.g. If the email is from an
unknown sender, has a
misspelling, and has “Million
Dollars” in it flag as spam.

7
Applications of ML
Use past data to …

Detect spam Predict weather Classify images

Other Examples:
Fraud Detection, Flagging inappropriate social media posts,
Natural Language Processing, Document Classification,
Designing Economic Mechanisms, Computational Advertising, …
8
Comp Info
Biology Theory
Computer
Ethics Science Robotics

Cognitive
Math Machine Learning
Science

Control
theory Statistics
Economics

ECE Neuro
Science 9
The Turing Test, 1950

A machine is intelligent if its answers

Alan Turing are indistinguishable from a human’s.

10
Checkers Program, 1952

Created a Checkers-playing program

Arthur Samuel
that got better overtime.

Also introduced the term “Machine

Learning”.

11
Perceptron, 1957
Predecessor of deep networks.

Separating two classes of objects using a linear

Frank Rosenblatt
threshold classifier.
@ Cornell!
Provable learning and convergence guarantees.

12
1960s: Lots of hope for AI to solve everything!

AI didn’t live up to the hype!

• 1966: Machine Translation failed.
• 1970: Minsky and Papert argued against Perceptron.
• 1971: Speech Understanding failed.
• 1973: Lighthill report torn apart AI.

“In no part of the field have the discoveries made so far

produced the major impact that was then promised”

• 1974: The UK and US stopped funding AI research.

The AI Winter, 1974-1980

13
Rebirth as Machine Learning
Machine Learning:
• Originally, a bit of a name game to get funding.
• Fundamentally a different approach to intelligence:

Machine Learning Artificial Intelligence

Data-driven Knowledge-based
Bottom-up approach Heavy use of logic
Top-down approach

14
Foundations of ML, 1980s-present
Formal notions of learnability from Data.
• When data-driven learning is possible?
 Probably Approximately Correct Learning (PAC) by Valiant.
 How much data is required?
• What’s the difference between great and mediocre learners?
 Improving the performance of a learning algorithm.
 Boosting algorithm of Freund and Schapire.
• How to deal with difficult and noisy learning problems?
 (Soft Margin) Support Vector Machines by Cortes and Vapnik
• What to do when the learning task evolves over time?
 Online learning framework.
15
TD-Gammon, 1992
Gerald Tesauro at IBM thought a
neural network to play
Backgammon.

The net played 100K+ games

against itself and beat the world
champion.

Algorithm found new techniques

that people had erroneously ruled
out.

16
Deep Blue, 1997
IBM’s Deep Blue won against
Kasparov in chess.

The crucial winning move was

made due to machine learning
methods developed by Gerald
Tesauro.

17
Expanding the reach, 2000s
Learning to rank
 Powering search engines: Google, Bing, …

Topic Modeling:
 Detecting and organizing documents by subject matter.
 Making sense of the unstructured data on the web.

Online economy:
 Ad placement and pricing.
 Product recommendation.

Machine learning became profitable!

18
Return of Neural Networks, 2010s
Neural networks return and excel at image
recognition, speech recognition, …

The 2018 Turing award was given to Yoshua

Bengio, Geoff Hinton, and Yann LeCun.

19
Surrounded by Machine Learning

Nika Haghtalab

Your instructor?
20
“With great power, there must also come
– great responsibility!”

21
Data Privacy
Learning models leak training data Learning algorithms detect sexual
(Fredrickson et al. ‘15) orientation better than people
(Wang & Kosinski’17)

Leaked data Real image

Formal definitions of data privacy:

• K- anonymity (Sweeney)
• Differential Privacy (Dwork, McSherry, Nissim, Smith).

Latanya Sweeney Cynthia Dwork Frank McSherry Kobbi Nissim 22

Adam Smith
Robust and Secure ML

Image Recognition Speech recognition Poisoning Attacks

Misreading traffic signs Hide commands in Tay (chat bot) became
(Eykholt et al) noise (Carlini & Wagner) inflammatory in 16 hr.

How to create robust and secure machine learning algorithms?

23
Learning and the Society
• Bad dynamics, perpetuating and worsening stereotypes and biases.
• Who carries the burden of bad prediction?
• How to design good dynamics?

24
Challenging Questions

Machine learning and Artificial Intelligence will shape the future,

what kind of a future do we want?

What is the role of machine learning?

 ML for good versus ML for profit.

How do automation and learning change the quality of life?

 Job loss and displacement, life satisfaction, safety and
security?

How do we approach machine learning and (inter-)national

security? Weaponization of machine learning and
25
AI?
Level of Measurements
In statistics data is divided into two
 Qualitative –qualities or descriptions
 Quantitate - quantities or numbers
Example – A Cup of Coffee
 Qualitatively (non –numerical qualities )
 Brown
 Strong aroma
 White cup
 Hot to the touch
 Quantitatively
 12 fluid ounces
 106 calories
 65 degrees Celsius
 $4.99 cost
Level of Measurement
According to
“On the Theory
of Scales of
Measurement.”
(Stevens,
Nominal data
1946)

Ordinal data
Interval data
Ratio data
Observations
 Nominal – Non numeric category
 Ordinal - Non numeric category with order
 Interval have an arbitrary zero, whereas
 Ratio have an actual non-arbitrary zero
Level of Measurements
Normalization Observation
 all input and output from machine learning
algorithms are
 typically vectors of floating-point numbers.
 nominal, ordinal, interval, or ratio.
 However, nominal and ordinal are not inherently
numeric
 Some algorithm have range -1 to +1 or 0 to
+1.
 Why is normalization necessary?
 Different scales of values (in million and in 10)
 Ex. Volume of a stack in are Millions Number of
stack is 10
 The number overwhelms
 Solution :Use percentage 5% , 10%
Example the iris dataset
"Sepal Length","Sepal Width","Petal Length","Petal Width","Spec
ies"
5.1,3.5,1.4,0.2,"setosa"
4.9,3.0,1.4,0.2,"setosa"
4.7,3.2,1.3,0.2,"setosa" Five information
... • Sepal length
7.0,3.2,4.7,1.4,"versicolor" • Sepal width
6.4,3.2,4.5,1.5,"versicolor" • Petal length
6.9,3.1,4.9,1.5,"versicolor"
...
• Petal width
6.3,3.3,6.0,2.5,"virginica" • Species
5.8,2.7,5.1,1.9,"virginica"
7.1,3.0,5.9,2.1,"virginica"
Normalizing Nominal Observations
5.1,3.5,1.4,0.2,"setosa“
7.0,3.2,4.7,1.4,"versicolor“
 one-of-n normalization 6.3,3.3,6.0,2.5,"virginica"
 Range -1, 1
 Setosa 1,-1,-1
 Versicolor -1,1,-1
 Virginica -1,-1,1
 Range 0,1?
 How do we encode?
Normalizing Ordinal Observations

 Ordinal data are not necessarily numeric

but have an implied ordering

Example: Education level

Normalizing Ordinal Observations

Where nH and nL are range of encoding

Normalizing Quantitative Observations

 Quantitative observations are always numeric

 We may not need to normalize
 Given dataHighdH , dataLow dH ,normalizedHigh nH,
normalizedLow nL
Example Normalizing Quantitative Observations

 Normalizing the weight of a car

• dataHigh: 4,000
• dataLow: 100
• normalizedHigh: 1
• normalizedLow: -1
 Given weigh =1000
Other Ways of Normalization
 Reciprocal Normalization

 Equilateral Normalization
Equilateral Normalization
Ideal output: -1, -1, 1
Actual output: -1, 1, -1

isting 2.1: Calculated Class Equilateral Values 3 Classes

: -0.8660 , -0.5000
: -0.8660 , -0.5000
: 0.0000 , 1.0000

Advantages of Equilateral Encoding

• Requires one fewer output than one-of-n
• Spreads the “blame” better than one-of-n
Equilateral Encoding Examples
 2 Cat.  3 Cat.  4 Cat.

Implementation
• “Practical Neural Network Recipes in C++” by Masters (1993), who
cited an article in PCAI as the actual source. (Guiver, 1991)
Equilateral Normalization
Additional Normalizations
 Z Normalization

 Min-Max Normalization

 Unit Vector Normalization Gaussian

Mean Normalization
Machine Learning Models

 Data classification
 Regression analysis
 Clustering
 Time Series

44
Classification

 Example: Credit
scoring
 Differentiating
between low-
risk and high-risk
customers from
their income and
savings

Discriminant: IF income > θ1 AND savings > θ2

THEN low-risk ELSE high-risk

Model 45
Classification: Applications

 Aka Pattern recognition

 Face recognition: Pose, lighting, occlusion (glasses, beard), make-
up, hair style
 Character recognition: Different handwriting styles.
 Speech recognition: Temporal dependency.
 Use of a dictionary or the syntax of the language.
 Sensor fusion: Combine multiple modalities; eg, visual (lip image) and
acoustic for speech
 Medical diagnosis: From symptoms to illnesses
 Web Advertizing: Predict if a user clicks on an ad on the Internet.

46
Face Recognition

Training examples of a person

Test images

AT&T Laboratories, Cambridge UK 47

http://www.uk.research.att.com/facedatabase.html
Regression

 Example: Price of a used car

 x : car attributes
y = wx+w0
y : price
y = g (x | θ )
g ( ) model,
θ parameters

48
Regression Applications

 Navigating a car: Angle of the steering wheel

(CMU NavLab)
 Kinematics of a robot arm
(x,y) α1= g1(x,y)
α2= g2(x,y)
α2

α1

49
Time Series
 Encode the Data
 Financial Analysis  Normalize (Sliding Window)

50
Resources: Datasets

 UCI Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html

 UCI KDD Archive: http://kdd.ics.uci.edu/summary.data.application.html
 Statlib: http://lib.stat.cmu.edu/
 Delve: http://www.cs.utoronto.ca/~delve/

51
Textbook and Course Material
• Textbooks
• Mathematics for machine learning. Deisenroth, Marc Peter,
A. Aldo Faisal, and Cheng Soon Ong. ,Cambridge University
Press, 2020.
• Pattern Recognition and Machine Learning, Christopher
Bishop
• Machine Learning: A Probabilistic Perspective, Kevin P.
Murphy
• References
– Machine Learning, by Tom Mitchell
– The Elements of Statistical Learning by Trevor Hastie, Robert
Tibshirani, Jerome Friedman.
• Course Notes
– Slides available on course Google class room
52

Lec 01 - Intro To ML
No ratings yet
Lec 01 - Intro To ML
28 pages
Machine learning Unit 1
No ratings yet
Machine learning Unit 1
14 pages
Lecture 1
100% (1)
Lecture 1
81 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
CMUdiaml 7
No ratings yet
CMUdiaml 7
66 pages
Karthik
No ratings yet
Karthik
10 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Lec 11 ANN 1
No ratings yet
Lec 11 ANN 1
70 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
w1 - Introduction To ML
No ratings yet
w1 - Introduction To ML
41 pages
Unit 3
No ratings yet
Unit 3
62 pages
Unit 1a - Fundamentals of Deep Learning
No ratings yet
Unit 1a - Fundamentals of Deep Learning
54 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Presentation 33360 Content Document 20250319044717PM
No ratings yet
Presentation 33360 Content Document 20250319044717PM
126 pages
UNIT-I
No ratings yet
UNIT-I
132 pages
01-Introduction - Shared PDF
No ratings yet
01-Introduction - Shared PDF
71 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
33 pages
Week 12 Intro to DS and ML
No ratings yet
Week 12 Intro to DS and ML
67 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
Module 1
No ratings yet
Module 1
22 pages
ML Overview
No ratings yet
ML Overview
26 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
ETI microproject
No ratings yet
ETI microproject
11 pages
01 LecIntro
No ratings yet
01 LecIntro
23 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
17 pages
Machine Learning - MT 2016: Varun Kanade
No ratings yet
Machine Learning - MT 2016: Varun Kanade
50 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
ML Full Slides Final
No ratings yet
ML Full Slides Final
458 pages
ML - Full Slides Srikanth Allamshatty
No ratings yet
ML - Full Slides Srikanth Allamshatty
369 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
58 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Lesson 4 -Introduction Machine Learning
No ratings yet
Lesson 4 -Introduction Machine Learning
44 pages
1.Introduction
No ratings yet
1.Introduction
24 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
Machine Learning: Foundations: Prof. Nathan Intrator
No ratings yet
Machine Learning: Foundations: Prof. Nathan Intrator
60 pages
Core Concepts of AI
No ratings yet
Core Concepts of AI
46 pages
Unit-1
No ratings yet
Unit-1
88 pages
Dr. Ahmed Elngar - ML
No ratings yet
Dr. Ahmed Elngar - ML
118 pages
UNIT-1 ML
No ratings yet
UNIT-1 ML
23 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
ML-cahp-1
No ratings yet
ML-cahp-1
35 pages
ML Introduction-06!08!21 (1)
No ratings yet
ML Introduction-06!08!21 (1)
25 pages
Introduction To Learning: Frederic Precioso 24/01/2019
No ratings yet
Introduction To Learning: Frederic Precioso 24/01/2019
179 pages
Csit (r22) 3-2 Machine Learning Digital Notes
No ratings yet
Csit (r22) 3-2 Machine Learning Digital Notes
120 pages
ML 23 First Lectures 2 3 v0.1
No ratings yet
ML 23 First Lectures 2 3 v0.1
66 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
Overview of machine learning
No ratings yet
Overview of machine learning
60 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
ML PPT1
No ratings yet
ML PPT1
70 pages
Intro To Machine Learning
100% (1)
Intro To Machine Learning
250 pages
Machine Learning
No ratings yet
Machine Learning
81 pages
Machine Learning new
No ratings yet
Machine Learning new
41 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
5 pages
ML Merged
No ratings yet
ML Merged
433 pages
ML All Chapter ppt
No ratings yet
ML All Chapter ppt
118 pages
Deep learning: deep learning explained to your granny – a guide for beginners
From Everand
Deep learning: deep learning explained to your granny – a guide for beginners
PAT NAKAMOTO
3/5 (2)
Rise of the Machines: Exploring Artificial Intelligence: The IT Collection
From Everand
Rise of the Machines: Exploring Artificial Intelligence: The IT Collection
Christopher Ford
No ratings yet
Wepik Unleashing The Power of Data Science A Professional Perspective 20231121050626EiNE
No ratings yet
Wepik Unleashing The Power of Data Science A Professional Perspective 20231121050626EiNE
12 pages
Machine Learning A Basic Approach
No ratings yet
Machine Learning A Basic Approach
9 pages
Rahul Task
No ratings yet
Rahul Task
4 pages
Watershed and Supervised Classification Based Ful-Wageningen University and Research 385470
No ratings yet
Watershed and Supervised Classification Based Ful-Wageningen University and Research 385470
2 pages
Analysis of Mood Based On Song Data Using Clustering and Supervised Learning Techniques
No ratings yet
Analysis of Mood Based On Song Data Using Clustering and Supervised Learning Techniques
3 pages
Machine Learning Andrew NG Week 6
No ratings yet
Machine Learning Andrew NG Week 6
11 pages
Deep Reinforcement Learning
100% (4)
Deep Reinforcement Learning
48 pages
AITools Unit 2
No ratings yet
AITools Unit 2
34 pages
New Text Document
No ratings yet
New Text Document
3 pages
Session 3.8
No ratings yet
Session 3.8
17 pages
IMP - Fundamentals of Deep Learning - Introduction To Recurrent Neural Networks
No ratings yet
IMP - Fundamentals of Deep Learning - Introduction To Recurrent Neural Networks
33 pages
Implementation of Single Layer Perceptron Model Using MATLAB
No ratings yet
Implementation of Single Layer Perceptron Model Using MATLAB
5 pages
Prediction of Asteroid Diameter With The Help of Multi-Layer Perceptron Regressor
No ratings yet
Prediction of Asteroid Diameter With The Help of Multi-Layer Perceptron Regressor
5 pages
Unit - 5
No ratings yet
Unit - 5
8 pages
Knowledge Creation With The Help of AI
No ratings yet
Knowledge Creation With The Help of AI
5 pages
Bayesian Filtering: Dieter Fox
No ratings yet
Bayesian Filtering: Dieter Fox
132 pages
Introduction To Business Communication...
No ratings yet
Introduction To Business Communication...
50 pages
Answer Midterm Exam Data Mining1 2021 - 2022
100% (1)
Answer Midterm Exam Data Mining1 2021 - 2022
4 pages
Image Classification
No ratings yet
Image Classification
16 pages
Understanding Deep Learning DNN RNN LSTM CNN and R-CNN
No ratings yet
Understanding Deep Learning DNN RNN LSTM CNN and R-CNN
6 pages
Budget of Work in Oral Com
100% (1)
Budget of Work in Oral Com
5 pages
Data Egineering Simplified Cheat Sheet 2023 06 03
No ratings yet
Data Egineering Simplified Cheat Sheet 2023 06 03
2 pages
Control Engineering MCQ
50% (2)
Control Engineering MCQ
4 pages
Qwen Technical Report
No ratings yet
Qwen Technical Report
59 pages
Lect 1
No ratings yet
Lect 1
41 pages
Control Systems Kuestion
No ratings yet
Control Systems Kuestion
39 pages
Quiz Robotics
No ratings yet
Quiz Robotics
7 pages
International Journal of Artificial Intelligence and Soft Computing (IJAISC)
No ratings yet
International Journal of Artificial Intelligence and Soft Computing (IJAISC)
2 pages
03 ML Notes PDF
No ratings yet
03 ML Notes PDF
16 pages
James A Anderson
No ratings yet
James A Anderson
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MAI Lecture 01 Introduction

Uploaded by

MAI Lecture 01 Introduction

Uploaded by

Mathematics for AI

“It will be the basis and fundamentals of every successful

“AI and machine learning are going to change the world

“Machine learning is the next Internet” (Tony Tether, DARPA)

Output Input Program Input

Fun(x): x > 0.5? Yes, Yes, No, No

Output Input Input

Fun(x): x > 0.5? Yes, No, No, No

Output Input Input

Fun(x): x > 0.5? Yes, No, No, No

Detect spam Predict weather Classify images

A machine is intelligent if its answers

Created a Checkers-playing program

Also introduced the term “Machine

Separating two classes of objects using a linear

AI didn’t live up to the hype!

“In no part of the field have the discoveries made so far

• 1974: The UK and US stopped funding AI research.

The AI Winter, 1974-1980

Machine Learning Artificial Intelligence

The net played 100K+ games

Algorithm found new techniques

The crucial winning move was

Machine learning became profitable!

The 2018 Turing award was given to Yoshua

Leaked data Real image

Formal definitions of data privacy:

Latanya Sweeney Cynthia Dwork Frank McSherry Kobbi Nissim 22

Image Recognition Speech recognition Poisoning Attacks

How to create robust and secure machine learning algorithms?

Machine learning and Artificial Intelligence will shape the future,

What is the role of machine learning?

How do automation and learning change the quality of life?

How do we approach machine learning and (inter-)national

 Ordinal data are not necessarily numeric

Example: Education level

Where nH and nL are range of encoding

 Quantitative observations are always numeric

 Normalizing the weight of a car

isting 2.1: Calculated Class Equilateral Values 3 Classes

Advantages of Equilateral Encoding

 Unit Vector Normalization Gaussian

Discriminant: IF income > θ1 AND savings > θ2

 Aka Pattern recognition

Training examples of a person

AT&T Laboratories, Cambridge UK 47

 Example: Price of a used car

 Navigating a car: Angle of the steering wheel

 UCI Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.