0% found this document useful (0 votes)

235 views

Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908

This document contains the solutions to a deep learning assignment for a student named Vivek Rana. It includes explanations of perceptrons, logistic regression, neural networks, the difference between machine learning and deep learning, disadvantages of neural networks, stochastic gradient descent, common loss functions like mean squared error and cross entropy, the differences between backpropagation and the forward pass, derives the weight update formula for a simple neural network, and explains how support vector machines use a maximum margin technique.

Uploaded by

vik

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

235 views

Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908

Uploaded by

vik

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Deep Learning Assignment 1 Solution

Name: Vivek Rana

Roll No.: 1709113908

1. Explain the following:

i. Perceptrons:
Ans: A perceptron is a neural network unit (an artificial neuron) that does certain computations to
detect features or business intelligence by learning the weights for the input signals in order to draw a
linear decision boundary.

A Perceptron can also be said as an algorithm for supervised learning of binary classifiers. This
algorithm enables neurons to learn and processes elements in the training set one at a time.
There are two types of Perceptrons: Single layer Perceptrons, learns only linearly separable patterns
while Multilayer Perceptrons or feedforward neural networks, containing two or more layers with
greater processing power.

ii. Logistic Regression:

Ans: Logistic regression is a statistical model that in its basic form uses a logistic function to model a
binary dependent variable. In regression analysis, logistic regression (or logit regression) is estimating
the parameters of a logistic model (a form of binary regression)

2. Explain the statement “ Neural Network is a function approximator”.

Ans: Supervised learning in machine learning can be described in terms of function approximation.
Neural networks are an example of a supervised machine learning algorithm that is perhaps best
understood in the context of function approximation. This can be demonstrated with examples of
neural networks approximating simple one-dimensional functions that aid in developing the intuition
for what is being learned by the model.
3. What is a Neural Network? What are its advantages?

Ans: A neural network is a network or circuit of neurons, or in a modern sense, an artificial neural
network, composed of artificial neurons or nodes. Artificial neural networks or connectionist systems
are computing systems vaguely inspired by the biological neural networks that constitute animal brains.
Such systems "learn" to perform tasks by considering examples, generally without being programmed
with task-specific rules

Advantages of Neural Networks ( ANN):

i. Storing information on the entire network

ii. Ability to work with incomplete knowledge
iii. Having fault tolerance
iv. Having a distributed memory
v. Gradual corruption
vi. Ability to make machine learning
vii. Parallel processing capability

4. Difference between Machine Learning and Deep Learning.

Ans: Factors determining the differences between ML and DL are:

i. Data dependencies: The most important difference between deep learning and traditional machine
learning is its performance as the scale of data increases. When the data is small, deep learning
algorithms don’t perform that well. This is because deep learning algorithms need a large amount of
data to understand it perfectly. On the other hand, traditional machine learning algorithms with their
handcrafted rules prevail in this scenario. Below image summarizes this fact.

ii. Hardware dependencies: Deep learning algorithms heavily depend on high-end machines, contrary
to traditional machine learning algorithms, which can work on low-end machines. This is because the
requirements of deep learning algorithm include GPUs which are an integral part of its working.

iii. Feature engineering: Feature engineering is a process of putting domain knowledge into the
creation of feature extractors to reduce the complexity of the data and make patterns
more visible to learning algorithms to work. This process is difficult and expensive in terms of time and
expertise.

In Machine learning, most of the applied features need to be identified by an expert and then hand-
coded as per the domain and data type.

Deep learning algorithms try to learn high-level features from data. This is a very distinctive part of
Deep Learning and a major step ahead of traditional Machine Learning. Therefore, deep learning
reduces the task of developing new feature extractor for every problem.

iv. Problem Solving Approach: When solving a problem using traditional machine learning algorithm, it
is generally recommended to break the problem down into different parts, solve them individually and
combine them to get the result. Deep learning in contrast advocates to solve the problem end-to-end.

v. Execution time: In Deep Learning, mathematically you can find out which nodes of a deep neural
network were activated, but we don’t know what there neurons were supposed to model and what
these layers of neurons were doing collectively. So we fail to interpret the results. On the other hand,
machine learning algorithms like decision trees give us crisp rules as to why it chose what it chose, so it
is particularly easy to interpret the reasoning behind it.

5. List some disadvantages of Neural Networks.

i. Hardware dependence: Artificial neural networks require processors with parallel processing
power, in accordance with their structure. For this reason, the realization of the equipment is
dependent.
ii. Unexplained behavior of the network: This is the most important problem of ANN. When
ANN produces a probing solution, it does not give a clue as to why and how. This reduces trust in
the network.
iii. Determination of proper network structure: There is no specific rule for determining the structure
of artificial neural networks. Appropriate network structure is achieved through experience and
trial and error.
iv. Difficulty of showing the problem to the network: ANNs can work with numerical information.
Problems have to be translated into numerical values before being introduced to ANN. The display
mechanism to be determined here will directly influence the performance of the network . This
depends on the user's ability.
v. The duration of the network is unknown

6. Explain Stochastic Gradient Descent.

The word ‘stochastic‘ means a system or a process that is linked with a random probability. Hence, in
Stochastic Gradient Descent, a few samples are selected randomly instead of the whole data set for
each iteration. In Gradient Descent, there is a term called “batch” which
denotes the total number of samples from a dataset that is used for calculating the gradient for each
iteration. In typical Gradient Descent optimization, like Batch Gradient Descent, the batch is taken to
be the whole dataset. Although, using the whole dataset is really useful for getting to the minima in a
less noisy or less random manner, but the problem arises when our datasets get really huge.

7. Explain the following Loss functions:

i. Mean Squared Error: In statistics, the mean squared error or mean squared deviation of an
estimator measures the average of the squares of the errors—that is, the average squared
difference between the estimated values and the actual value. MSE is a risk function,
corresponding to the expected value of the squared error loss
ii. Cross-Entropy: Cross-entropy loss, or log loss, measures the performance of a classification model
whose output is a probability value between 0 and 1. Cross-entropy loss increases as the predicted
probability diverges from the actual label.

8. What is the difference between backpropagation and the forward pass? Ans:

Forward propagation:

In forward Propagation we provides input x each nueron wull calculate two functions

one is linear multiplication i.e Z= W*X+b and we use activation function a=relu(z) (use different
activation functions) then it will forward through every layer and we will get predicted output.

Back propagation:

Back propagation is a technique to reduce the loss i.e.( Actual o/p-predicted o/p) by updating the
parameters weight, bias by using an algorithm called Gradient descent. So technically both are
different in back propagation we use Gradient Descent algorithm. This is a credit assignment property
that means the reason for wrong output not only due to final layer but also due to previous layer,That’s
why we calculate gradients of every layer w.r.t Loss(L)

9. Derive the weight updation formula for a neural network with one input layer, one hidden layer
and one output layer each having the following nodes respectively: 3,2,1. [There is no activation
layer]
Ans:
Forward Pass:
Inputs: Node 1: x1, Node 2: x2, Node 3: x3

Hidden Layer:
Node 4: W1*[ x1 x2 x3 ] + b1 = Y1
Node 5: W2 *[ x1 x2 x3 ] + b2 = Y2
Output Layer:
Node 6: W3*[ Y1 Y2 ] + b3 = YActual

Error Function:
Mean Square Error:
ΔE = 1/2 *( YActual -YPred )2

Backpropagation: Weights
updatation:
At Output Layer:
W3 = W3 + W3(ΔE)
At Hidden Layers:
Wi = Wi + α*(ΔE)*xi

10. Which machine learning algorithm uses a maximum margin techniqu? Explain it in detail.
Ans: In machine learning, a margin classifier is a classifier which is able to give an associated distance
from the decision boundary for each example. For instance, if a linear classifier (e.g. perceptron or
linear discriminant analysis) is used, the distance (typically euclidean distance, though others may
be used) of an example from the separating hyperplane is the margin of that example.
The notion of margin is important in several machine learning classification algorithms, as it can be
used to bound the generalization error of the classifier. These bounds are frequently shown using the
VC dimension. Of particular prominence is the generalization error bound on boosting algorithms
and support vector machines.

What is LSTM - Long Short Term Memory_ - GeeksforGeeks
No ratings yet
What is LSTM - Long Short Term Memory_ - GeeksforGeeks
10 pages
CNN Short
No ratings yet
CNN Short
61 pages
Challenges and Scope of Data Science Project
No ratings yet
Challenges and Scope of Data Science Project
21 pages
CS964 Data Warehousing and Data Mining
No ratings yet
CS964 Data Warehousing and Data Mining
1 page
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
C++ (C With Classes) : Object Oriented Programming (OOP)
100% (1)
C++ (C With Classes) : Object Oriented Programming (OOP)
77 pages
C++ Interview Questions: Check For Your Free Bonuses at The End of This Ebook
No ratings yet
C++ Interview Questions: Check For Your Free Bonuses at The End of This Ebook
6 pages
TCS Technical Interview Questions
No ratings yet
TCS Technical Interview Questions
15 pages
Object Oriented Programming Using C++ Viva Questions Coders Lodge
No ratings yet
Object Oriented Programming Using C++ Viva Questions Coders Lodge
7 pages
UNIT 3 DV (1)
No ratings yet
UNIT 3 DV (1)
44 pages
Hands-On Kernel Lab: Based On The Yocto Project 1.4 Release (Dylan) April 2013
No ratings yet
Hands-On Kernel Lab: Based On The Yocto Project 1.4 Release (Dylan) April 2013
39 pages
CV Ashish Dangi
No ratings yet
CV Ashish Dangi
2 pages
Advanced C++: Answer
No ratings yet
Advanced C++: Answer
18 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
IntroductionofStatistics PDF
No ratings yet
IntroductionofStatistics PDF
163 pages
Two Stage Job Title Identification-1
No ratings yet
Two Stage Job Title Identification-1
77 pages
NLP Lab Tasks
No ratings yet
NLP Lab Tasks
16 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
9 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
Artificial Intelligence Mcqs
No ratings yet
Artificial Intelligence Mcqs
173 pages
CD Questions With Answers
100% (1)
CD Questions With Answers
36 pages
1.2 Introduction To Applied Data Science
No ratings yet
1.2 Introduction To Applied Data Science
47 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Unit 4 - Domain Testing
100% (1)
Unit 4 - Domain Testing
76 pages
Data Science Interview Questions 30 Days 1686062665
No ratings yet
Data Science Interview Questions 30 Days 1686062665
300 pages
Unit-5 DS Notes
No ratings yet
Unit-5 DS Notes
19 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
CO - CSE 4102_AI Lab course Outline
100% (1)
CO - CSE 4102_AI Lab course Outline
4 pages
CS-475 - Computer Vision
No ratings yet
CS-475 - Computer Vision
5 pages
Deep Learning and CNNFYTGS5101-Guoyangxie
No ratings yet
Deep Learning and CNNFYTGS5101-Guoyangxie
42 pages
UNIT_4_DL
No ratings yet
UNIT_4_DL
31 pages
NNDL Unit-1
No ratings yet
NNDL Unit-1
28 pages
Unit-1 Basics of Algorithms and Mathematics
No ratings yet
Unit-1 Basics of Algorithms and Mathematics
47 pages
CPP InterviewQuestions
No ratings yet
CPP InterviewQuestions
14 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
Web Technology Lab Manual PDF
No ratings yet
Web Technology Lab Manual PDF
74 pages
Natural Language Processing
No ratings yet
Natural Language Processing
13 pages
Techknowledge DevOps Unit 1
No ratings yet
Techknowledge DevOps Unit 1
15 pages
Unit-3 Aim 502
No ratings yet
Unit-3 Aim 502
14 pages
Neural Network Unit - 4 - 221210 - 134739
No ratings yet
Neural Network Unit - 4 - 221210 - 134739
15 pages
Unit - V Implementation, Testing & Maintenance
No ratings yet
Unit - V Implementation, Testing & Maintenance
60 pages
Mastering Machine Learning - A Comprehensive Guide
No ratings yet
Mastering Machine Learning - A Comprehensive Guide
19 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
UNIT-2 Foundations of Deep Learning
No ratings yet
UNIT-2 Foundations of Deep Learning
64 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
COA Assignment
No ratings yet
COA Assignment
11 pages
Unit I R Data Structures
No ratings yet
Unit I R Data Structures
30 pages
Data Mining-Outlier Analysis
No ratings yet
Data Mining-Outlier Analysis
6 pages
Berkeley Socket
100% (1)
Berkeley Socket
7 pages
Krr Unit i Notes
No ratings yet
Krr Unit i Notes
32 pages
NLP SEM QUESTIONS AND ANSWERS
No ratings yet
NLP SEM QUESTIONS AND ANSWERS
72 pages
(XXXX) 2 Marks (XXXX) : - Class B Extends A
No ratings yet
(XXXX) 2 Marks (XXXX) : - Class B Extends A
11 pages
Push Down Automata (PDA) : Non-Deterministic PDA Deterministic PDA
No ratings yet
Push Down Automata (PDA) : Non-Deterministic PDA Deterministic PDA
20 pages
NLP
No ratings yet
NLP
2 pages
Gated Recurrent Unit: Master Sidsd - S2
100% (1)
Gated Recurrent Unit: Master Sidsd - S2
23 pages
AI-unit 3
No ratings yet
AI-unit 3
55 pages
UNIT_5_DL
No ratings yet
UNIT_5_DL
11 pages
Vocab Words
No ratings yet
Vocab Words
100 pages
WAT (Words Association Test)
No ratings yet
WAT (Words Association Test)
4 pages
VIII Sem IT2seminar DT 9 Aug
No ratings yet
VIII Sem IT2seminar DT 9 Aug
28 pages
Artificial Intelligence Lab
No ratings yet
Artificial Intelligence Lab
16 pages
Updated - Seminar Guidelines
No ratings yet
Updated - Seminar Guidelines
8 pages
Tafl KCS 402 Cia-I 2019-20
No ratings yet
Tafl KCS 402 Cia-I 2019-20
2 pages
CS CIA1 2019-20 ME Final
No ratings yet
CS CIA1 2019-20 ME Final
1 page
Jss Mahavidyapeetha: AY 2019-20 (Even Semester)
No ratings yet
Jss Mahavidyapeetha: AY 2019-20 (Even Semester)
2 pages
IoT CIA1-1
No ratings yet
IoT CIA1-1
2 pages
Seminar Excellent Very Good Fair Poor Cos 4 3 2 1 1 Topic Identification
No ratings yet
Seminar Excellent Very Good Fair Poor Cos 4 3 2 1 1 Topic Identification
2 pages
ENGR 351 Numerical Methods College of Engineering Southern Illinois University Carbondale Exams Fall 2007 Instructor: Professor L.R. Chevalier
No ratings yet
ENGR 351 Numerical Methods College of Engineering Southern Illinois University Carbondale Exams Fall 2007 Instructor: Professor L.R. Chevalier
13 pages
Data Structures and Algorithms Made Easy Data Structure and Algorithmic Puzzles 5th Edition by Careermonk Publications, Narasimha Karumanchi ISBN 9788193245279 - The full ebook version is just one click away
100% (19)
Data Structures and Algorithms Made Easy Data Structure and Algorithmic Puzzles 5th Edition by Careermonk Publications, Narasimha Karumanchi ISBN 9788193245279 - The full ebook version is just one click away
85 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
The Linear Algebra Behind Google
No ratings yet
The Linear Algebra Behind Google
34 pages
Ec 502 Digital Communication Dec 2020
No ratings yet
Ec 502 Digital Communication Dec 2020
3 pages
IV
No ratings yet
IV
39 pages
Numerical Differentiation: This Slope Approximates F (A)
No ratings yet
Numerical Differentiation: This Slope Approximates F (A)
8 pages
COMM1208 Unit6 PCM Sampling
No ratings yet
COMM1208 Unit6 PCM Sampling
13 pages
Assessment 8 and 9-23-24
No ratings yet
Assessment 8 and 9-23-24
5 pages
5-LP Simplex (CJ-ZJ Tableau)
No ratings yet
5-LP Simplex (CJ-ZJ Tableau)
7 pages
EE312 Old Exams
100% (1)
EE312 Old Exams
27 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
5 pages
L22 - Iterative Computations of The Transportation Algorithm
No ratings yet
L22 - Iterative Computations of The Transportation Algorithm
35 pages
11 Hashing
No ratings yet
11 Hashing
11 pages
Asymptotic Analysis
No ratings yet
Asymptotic Analysis
29 pages
A01 - Raj Upreti - Ipmv
No ratings yet
A01 - Raj Upreti - Ipmv
21 pages
Computer Science Directed Graph Vertices: Topological Sort or Topological Ordering of A
No ratings yet
Computer Science Directed Graph Vertices: Topological Sort or Topological Ordering of A
25 pages
Dsa Presentation 1
No ratings yet
Dsa Presentation 1
76 pages
Submitted in Partial Fulfilment For The Award of Degree of
No ratings yet
Submitted in Partial Fulfilment For The Award of Degree of
13 pages
S2015 Lecture-7
No ratings yet
S2015 Lecture-7
44 pages
Problem Set 1: Solving Roots of Equation: Saint Louis University
No ratings yet
Problem Set 1: Solving Roots of Equation: Saint Louis University
2 pages
DSA Checklist
No ratings yet
DSA Checklist
3 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
ml_cheatsheet
No ratings yet
ml_cheatsheet
4 pages
Constraint Satisfaction Problems: Slides by Prof WELLING
No ratings yet
Constraint Satisfaction Problems: Slides by Prof WELLING
39 pages
Machine Learning Interview Questions PDF
No ratings yet
Machine Learning Interview Questions PDF
14 pages
2.2 Lazy Learning
No ratings yet
2.2 Lazy Learning
26 pages
Unit 3
No ratings yet
Unit 3
6 pages
Ee 224 Lab 5 Report 1
No ratings yet
Ee 224 Lab 5 Report 1
8 pages
Cse373 09sp Midterm1.Key
No ratings yet
Cse373 09sp Midterm1.Key
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908

Uploaded by

Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908

Uploaded by

Deep Learning Assignment 1 Solution

Name: Vivek Rana

1. Explain the following:

ii. Logistic Regression:

2. Explain the statement “ Neural Network is a function approximator”.

Advantages of Neural Networks ( ANN):

i. Storing information on the entire network

4. Difference between Machine Learning and Deep Learning.

Ans: Factors determining the differences between ML and DL are:

5. List some disadvantages of Neural Networks.

6. Explain Stochastic Gradient Descent.

7. Explain the following Loss functions:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.