100% found this document useful (1 vote)

265 views

2 DNN-CNN-RNN

This document is a slide presentation on neural networks given at KAIST. It introduces deep neural networks (DNNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs). The key points covered are: - DNNs are neural networks with more than two layers that can model complex functions. Forward propagation is used to compute outputs, and backpropagation is used to calculate gradients to optimize weights through gradient descent. - CNNs are a type of neural network that use convolution and pooling for computer vision tasks. - RNNs are neural networks with feedback connections that make them well-suited for modeling sequential data like text. - Training deep neural networks is difficult due to vanishing gradients, but

Uploaded by

Salma Hamzaoui

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

265 views

2 DNN-CNN-RNN

Uploaded by

Salma Hamzaoui

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 87

Algorithmic

Intelligence Laboratory

Introduction to Neural Networks:

DNN / CNN / RNN

EE807: Recent Advances in Deep Learning

Lecture 1

Slide made by
Hyungwon Choi and Yunhun Jang
KAIST EE

Algorithmic Intelligence Laboratory

What is Machine/Deep Learning?

• Human Learning

• Machine Learning = Build an algorithm from data

• Deep learning is a special type of algorithms in machine learning

Learning perceptions

Algorithmic Intelligence Laboratory

Learning interactions 2
Definition of Deep Learning

• An algorithm that learns multiple levels of abstractions in data

Deep & Large Networks

Objects

Lots of Data
Edge Parts
Multi-layer Data Representations (feature hierarchy)
Algorithmic Intelligence Laboratory 3
Deep Learning = Feature Learning

• Why deep learning outperforms other machine learning (ML) approaches for
vision, speech, language?

Input Feature Extraction Other ML Output

SIFT

Input Deep Network Output

Algorithmic Intelligence Laboratory 4

Table of Contents

1. Deep Neural Networks (DNN)

• Basics
• Training : Back propagation

2. Convolutional Neural Networks (CNN)

• Basics
• Convolution and pooling
• Some applications

3. Recurrent Neural Networks (RNN)

• Basics
• Character-level language model (example)

4. Question
• Why is it difficult to train a deep neural network?

Algorithmic Intelligence Laboratory 5

Table of Contents

1. Deep Neural Networks (DNN)

• Basics
• Training : Back propagation

2. Convolutional Neural Networks (CNN)

• Basics
• Convolution and pooling
• Some applications

3. Recurrent Neural Networks (RNN)

• Basics
• Character-level language model (example)

4. Question
• Why is it difficult to train a deep neural network?

Algorithmic Intelligence Laboratory 6

DNN: Neurons in the Brain

• Human brain is made up of 100 billion neurons

• Neurons receive electric signals at the dendrites and send them to the axon
• Dendrites can perform complex non-linear computations
• Synapses are not a single weight but a complex non-linear dynamical system

Algorithmic Intelligence Laboratory *source : https://pt.slideshare.net/hammawan/deep-neural-networks 7

DNN: Artificial Neural Networks

• Artificial neural networks

• A simplified version of biological neural network

Bias Nonlinear
activation
function

Output / activation of the neuron

Summation
Inputs
Weights
…

Algorithmic Intelligence Laboratory 8

DNN: The Brain vs. Artificial Neural Networks

• Similarities
• Consists of neurons & connections between neurons
• Learning process = Update of connections
• Massive parallel processing

• Differences
• Computation within neuron vastly simplified
• Discrete time steps
• Typically some of supervised learning with massive number of stimuli

Algorithmic Intelligence Laboratory *source : http://mt-class.org/jhu/slides/lecture-nn-intro.pdf 9

DNN: Basics

• Deep neural networks

• Neural network with more than 2 layers
• Can model more complex functions

Nonlinear Inputs Outputs

Bias
activation
function

Summation
Inputs
Weights
…

Hidden

“2-layer Neural Net”

“1-hidden-layer Neural Net”

Algorithmic Intelligence Laboratory 10

DNN: Notation

• Training dataset
• : input data
• : target data (or label for classification)

• Neural network parameterized by

Next, forward propagation

Algorithmic Intelligence Laboratory 11
DNN: Forward Propagation

• Forward propagation: calculate the output of the neural network

where is activation function (e.g., sigmoid function) and is number of layers

Algorithmic Intelligence Laboratory 12

DNN: Forward Propagation (Example)

Algorithmic Intelligence Laboratory 13

DNN: Forward Propagation (Example)

• Input data

1.0

-0.5

Algorithmic Intelligence Laboratory 14

DNN: Forward Propagation (Example)

• Compute hidden units

0.79
1.0

0.92

-0.5
0.16

where

Algorithmic Intelligence Laboratory 15

DNN: Forward Propagation (Example)

• Compute output

0.79
1.0

0.92 0.62

-0.5
0.16

Next, training objective

Algorithmic Intelligence Laboratory 16
DNN: Objective

• Objective: Find a parameter that minimizes the error (or empirical risk)

where is a loss function e.g., MSE(Mean square error) or cross entropy

Next, how to optimize ?

Algorithmic Intelligence Laboratory 17
DNN: Training

• Gradient descent (GD) updates parameters iteratively to the gradient direction.

parameters loss function

learning rate

• Backpropagation
• First adjust the last layer weights
• Propagate error back to each previous layers
• Adjust previous layer weights

Next, backpropagation in details

Algorithmic Intelligence Laboratory 18
DNN: Backpropagation

• Consider the input

• Forward propagation to compute output
• layer intermediate output