0% found this document useful (0 votes)

11 views

Pdf&rendition 1

The document discusses various applications of computer vision including facial recognition, face filters, image search, retail, inventory management, self-driving cars, and medical imaging. It also covers computer vision tasks such as image classification, object detection, and instance segmentation. Basic concepts discussed include pixels, resolution, grayscale and RGB images, and image features.

Uploaded by

pragunagarwalx

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Pdf&rendition 1

Uploaded by

pragunagarwalx

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

4/17/24, 10:52 AM about:blank 4/17/24, 10:52 AM about:blank

Computer Vision
Computer Vision
Introduction

The Computer Vision domain of Artificial Intelligence, enables machines to see through images or visual data, process and
analyse them on the basis of algorithms and methods in order to analyse actual phenomena with images.

Applications of Computer Vision - Facial Recognition

With the advent of smart cities and smart homes, Computer Vision plays a vital role in making the home smarter. Security
being the most important application involves the use of Computer Vision for facial recognition. It can be either guest Classification: The image Classification problem is the task of assigning an input image one label from a fixed set of
recognition or log maintenance of the visitors. It also finds its application in schools for an attendance system based on the categories. This is one of the core problems in CV that, despite its simplicity, has a large variety of practical applications.
facial recognition of students.
Classification + Localisation: This is the task that involves both processes of identifying what object is present in the image
and at the same time identifying at what location that object is present in that image. It is used only for single objects.
Applications of Computer Vision - Face Filters Object Detection: Object detection is the process of finding instances of real-world objects such as faces, bicycles, and
buildings in images or videos. Object detection algorithms typically use extracted features and learning algorithms to
Modern-day apps like Instagram and Snapchat have a lot of features based on the usage of computer vision. The application recognize instances of an object category. It is commonly used in applications such as image retrieval and automated vehicle
of face filters is one among them. Through the camera, the machine or the algorithm is able to identify the facial dynamics of parking systems.
the person and applies the facial filter selected.
Instance Segmentation: Instance Segmentation is the process of detecting instances of the objects, giving them a category
Applications of Computer Vision - Google’s Search by Image and then giving each pixel a label on the basis of that. A segmentation algorithm takes an image as input and outputs a
collection of regions (or segments).
The maximum amount of searching for data on Google’s search engine comes from textual data, but at the same time, it has
an interesting feature of getting search results through an image. This uses Computer Vision as it compares different features Basics of Pixels: The word “pixel” means a picture element. Every photograph, in digital form, is made up of pixels. They are
of the input image to the database of images and gives us the search result while at the same time analysing various features of the smallest unit of information that make up a picture. Usually round or square, they are typically arranged in a 2-
the image. dimensional grid.
Applications of Computer Vision - Retail Resolution: The number of pixels in an image is sometimes called the resolution. When the term is used to describe pixel
count, one convention is to express resolution as the width by the height, for example, a monitor resolution of 1280×1024.
The retail field has been one of the fastest growing fields and at the same time is using Computer Vision for making the user
This means there are 1280 pixels from one side to the other, and 1024 from top to bottom.
experience more fruitful. Retailers can use Computer Vision techniques to track customers’ movements through stores,
analyse navigational routes and detect walking patterns. Another convention is to express the number of pixels as a single number, like a 5 megapixel camera (a megapixel is a million
pixels). This means the pixels along the width multiplied by the pixels along the height of the image taken by the camera
Applications of Computer Vision - Inventory Management
equals 5 million pixels. In the case of our 1280×1024 monitors, it could also be expressed as 1280 x 1024 = 1,310,720, or
Through security camera image analysis, a Computer Vision algorithm can generate a very accurate estimate of the items 1.31 megapixels.
available in the store. Also, it can analyse the use of shelf space to identify suboptimal configurations and suggest better item
Pixel value: Each of the pixels that represent an image stored inside a computer has a pixel value that describes how bright
placement.
that pixel is, and/or what colour it should be. The most common pixel format is the byte image, where this number is stored as
Applications of Computer Vision - Self Driving Cars an 8-bit integer giving a range of possible values from 0 to 255. Typically, zero is to be taken as no colour or black and 255 is
taken to be full colour or white.
Computer Vision is the fundamental technology behind developing autonomous vehicles. Most leading car manufacturers in
the world are reaping the benefits of investing in artificial intelligence for developing on-road versions of hands-free Grayscale Images: Grayscale images are images that have a range of shades of gray without apparent colour. The darkest
technology. This involves the process of identifying the objects, getting navigational routes and also at the same time possible shade is black, which is the total absence of colour or zero value of a pixel. The lightest possible shade is white,
environment monitoring. which is the total presence of colour or 255 value of a pixel. Intermediate shades of gray are represented by equal brightness
levels of the three primary colours. A grayscale has each pixel of size 1 byte having a single plane of 2d array of pixels. The
Applications of Computer Vision - Medical Imaging size of a grayscale image is defined as the Height x Width of that image.

For the last decades, computer-supported medical imaging application has been a trustworthy help for physicians. It doesn’t RGB Images: All the images that we see around are coloured images. These images are made up of three primary colours
only create and analyse images, but also becomes an assistant and helps doctors with their interpretation. The application is Red, Green and Blue. All the colours that are present can be made by combining different intensities of red, green and blue.
used to read and convert 2D scan images into interactive 3D models that enable medical professionals to gain a detailed
understanding of a patient’s health condition. Image Features: In computer vision and image processing, a feature is a piece of information that is relevant for solving the
computational task related to a certain application. Features may be specific structures in the image such as points, edges or
Applications of Computer Vision - Google Translate App objects.

All you need to do to read signs in a foreign language is to point your phone’s camera at the words and let the Google In image processing, we can get a lot of features from the image. It can be either a blob, an edge or a corner. These features
Translate app tell you what it means in your preferred language almost instantly. By using optical character recognition to see help us to perform various tasks and then get the analysis done on the basis of the application. Now the question that arises is
the image and augmented reality to overlay an accurate translation, this is a convenient tool that uses Computer Vision. which of the following are good features to be used? As you saw in the previous activity, the features having corners are easy
to find as they can be found only at a particular location in the image, whereas the edges are spread over a line or an edge look
Computer Vision Tasks: The various applications of Computer Vision are based on a certain number of tasks that are the same all along. This tells us that the corners are always good features to extract from an image followed by the edges.
performed to get certain information from the input image which can be directly used for prediction or forms the base for
further analysis. The tasks used in a computer vision application are : OpenCV or Open Source Computer Vision Library is a tool that helps a computer extract these features from the images. It
is used for all kinds of image and video processing and analysis. It is capable of processing images and videos to identify
about:blank 1/4 about:blank 2/4
4/17/24, 10:52 AM about:blank 4/17/24, 10:52 AM about:blank
objects, faces, or even handwriting. Similar to the Convolutional Layer, the Pooling layer is responsible for reducing the spatial size of the Convolved Feature
while still retaining the important features. There are two types of pooling which can be performed on an image.
Convolution: Different filters applied to an image change the pixel values evenly throughout the image with the help of the
process of convolution and the convolution operator which is commonly used to create these effects. As we change the values i. Max Pooling : Max Pooling returns the maximum value from the portion of the image covered by the Kernel.
of these pixels, the image changes. This process of changing pixel values is the base of image editing. ii. Average Pooling: Max Pooling returns the maximum value from the portion of the image covered by the Kernel.

We all use a lot of image editing software like photoshop and at the same time use apps like Instagram and Snapchat, which The pooling layer is an important layer in the CNN as it performs a series of tasks which are as follows :
apply filters to the image to enhance the quality of that image.
Makes the image smaller and more manageable
Convolution: Convolution is a simple Mathematical operation that is fundamental to many common image-processing Makes the image more resistant to small transformations, distortions and translations in the input image.
operators. Convolution provides a way of `multiplying together' two arrays of numbers, generally of different sizes, but of the
same dimensionality, to produce a third array of numbers of the same dimensionality. An (image) convolution is simply an Fully Connected Layer
element-wise multiplication of image arrays and another array called the kernel followed by a sum.
The final layer in the CNN is the Fully Connected Layer (FCP). The objective of a fully connected layer is to take the results
What is a Kernel? of the convolution/pooling process and use them to classify the image into a label.

A Kernel is a matrix, which is slid across the image and multiplied with the input such that the output is enhanced in a certain
desirable manner. Each kernel has a different value for different kind of effects that we want to apply to an image.

Convolution

i. Convolution is a common tool used for image editing.

ii. It is an element-wise multiplication of an image and a kernel to get the desired output.
iii. In computer vision applications, it is used in Convolutional Neural Network (CNN) to extract image features.

What is a Convolutional Neural Network?

A Convolutional Neural Network (CNN) is a Deep Learning algorithm that can take in an input image, assign importance
(learnable weights and biases) to various aspects/objects in the image and be able to differentiate one from the other.

A convolutional neural network consists of the following layers:

1) Convolution Layer 2) Rectified linear Unit (ReLU) 3) Pooling Layer 4) Fully Connected Layer

Convolution Layer

It is the first layer of a CNN. The objective of the Convolution Operation is to extract the high-level features such as edges,
from the input image. CNN need not be limited to only one Convolutional Layer. Conventionally, the first Convolution Layer
is responsible for capturing the Low-Level features such as edges, colour, gradient orientation, etc. With added layers, the
architecture adapts to the High-Level features as well, giving us a network that has a wholesome understanding of images in
the dataset.

Rectified Linear Unit Function

The next layer in the Convolution Neural Network is the Rectified Linear Unit function or the ReLU layer. After we get the
feature map, it is then passed onto the ReLU layer. This layer simply gets rid of all the negative numbers in the feature map
and lets the positive number stay as it is.

Pooling Layer

about:blank 3/4 about:blank 4/4

Crux of Education Book Complete
No ratings yet
Crux of Education Book Complete
204 pages
Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
No ratings yet
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
35 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Chapter-4 Computer Vision Study material
No ratings yet
Chapter-4 Computer Vision Study material
4 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
C10_AI_COMPUTER VISION (1)
No ratings yet
C10_AI_COMPUTER VISION (1)
40 pages
Computer Vision
No ratings yet
Computer Vision
36 pages
CV
No ratings yet
CV
9 pages
52 BDB
No ratings yet
52 BDB
3 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
AI 10th grade pdfs
No ratings yet
AI 10th grade pdfs
30 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
AI CV NOTES
No ratings yet
AI CV NOTES
6 pages
4. Computer Vision
No ratings yet
4. Computer Vision
23 pages
Computer Vision Xth (1)
No ratings yet
Computer Vision Xth (1)
9 pages
HW_675075_1Compu
No ratings yet
HW_675075_1Compu
3 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
X AI SS CH5 LM
No ratings yet
X AI SS CH5 LM
54 pages
Ch-Computer Vision
No ratings yet
Ch-Computer Vision
6 pages
Multimedia and Computer Vision unit 5
No ratings yet
Multimedia and Computer Vision unit 5
25 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
cv
No ratings yet
cv
4 pages
ASSIGNMENT 5 - X - AI Handout Computer Vision1
No ratings yet
ASSIGNMENT 5 - X - AI Handout Computer Vision1
3 pages
Class 10 AI 417 Computer Vision
No ratings yet
Class 10 AI 417 Computer Vision
22 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Computer vision
No ratings yet
Computer vision
13 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
COMPUTER VISION notes
No ratings yet
COMPUTER VISION notes
3 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
PartA-Unit5-Ass01
No ratings yet
PartA-Unit5-Ass01
3 pages
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
No ratings yet
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
36 pages
Introduction to Computer Vision
No ratings yet
Introduction to Computer Vision
8 pages
Q-Ans-ComputerVision-Ass01
No ratings yet
Q-Ans-ComputerVision-Ass01
2 pages
Ip Cv Summary Finaaaal-1
No ratings yet
Ip Cv Summary Finaaaal-1
178 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Question Bank 9 (1)
No ratings yet
Question Bank 9 (1)
6 pages
Computer Vision
No ratings yet
Computer Vision
7 pages
Unit-5 Computer Vision(Ai)
No ratings yet
Unit-5 Computer Vision(Ai)
14 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
CV (Unit1&2ans)
No ratings yet
CV (Unit1&2ans)
32 pages
Class X Computer Vision
No ratings yet
Class X Computer Vision
7 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
AD8703 Basic of Computer vision UNIT 1
No ratings yet
AD8703 Basic of Computer vision UNIT 1
65 pages
Computer Vision(7th Sem)
No ratings yet
Computer Vision(7th Sem)
48 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
PDF Computer Vision
No ratings yet
PDF Computer Vision
3 pages
Unit 1 to 5 Computer Vision and Image Processing
No ratings yet
Unit 1 to 5 Computer Vision and Image Processing
56 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
image processing
No ratings yet
image processing
105 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
17 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Class X Science (SP-07) 18-10-2024
No ratings yet
Class X Science (SP-07) 18-10-2024
5 pages
The Thief's Story RC
No ratings yet
The Thief's Story RC
2 pages
Probability RD VFBHH
No ratings yet
Probability RD VFBHH
24 pages
His CH 1
No ratings yet
His CH 1
24 pages
The Ultimate Roblox Handbook - Pettman, Kevin, Author - 2020 - London - Mortimer Children's - 9781787393684 - Anna's Archive
100% (2)
The Ultimate Roblox Handbook - Pettman, Kevin, Author - 2020 - London - Mortimer Children's - 9781787393684 - Anna's Archive
68 pages
HW 425322 1sampl
No ratings yet
HW 425322 1sampl
8 pages
RNN-Based Radio Resource Management On
No ratings yet
RNN-Based Radio Resource Management On
14 pages
Norton Antivirus 2003
No ratings yet
Norton Antivirus 2003
3 pages
Laboratory Information System
No ratings yet
Laboratory Information System
3 pages
SRS For CAMPUS RECRUITMENT SYSTEM 3
No ratings yet
SRS For CAMPUS RECRUITMENT SYSTEM 3
20 pages
05 Handout 2
No ratings yet
05 Handout 2
5 pages
Proposal 211311034
No ratings yet
Proposal 211311034
5 pages
CS Year 10 Theory June 2022
No ratings yet
CS Year 10 Theory June 2022
14 pages
RELEASE_NOTES
No ratings yet
RELEASE_NOTES
8 pages
The 6 Types of Information Systems and Their Applications
No ratings yet
The 6 Types of Information Systems and Their Applications
46 pages
LanSafev5featuresbenefits
No ratings yet
LanSafev5featuresbenefits
3 pages
Mini Project
No ratings yet
Mini Project
20 pages
Vertopal.com_Day 1 - Environment Creation and Package Installation in Anaconda Prompt, Introduction to Python
No ratings yet
Vertopal.com_Day 1 - Environment Creation and Package Installation in Anaconda Prompt, Introduction to Python
12 pages
Exaquantum
No ratings yet
Exaquantum
12 pages
Call For Paper IEEE Conference On Digital Platform and Societal Harm (SCOPUS Indexed Conference)
No ratings yet
Call For Paper IEEE Conference On Digital Platform and Societal Harm (SCOPUS Indexed Conference)
9 pages
CBFC User Manual
No ratings yet
CBFC User Manual
89 pages
157 1498417701 - 25-06-2017 PDF
No ratings yet
157 1498417701 - 25-06-2017 PDF
4 pages
ThinkPad L570 Spec
No ratings yet
ThinkPad L570 Spec
1 page
Novell Netware 5 - Advanced Admin - Instructor Guide PDF
No ratings yet
Novell Netware 5 - Advanced Admin - Instructor Guide PDF
1,128 pages
How To Import, Process and Export Zeno Mobile Data
No ratings yet
How To Import, Process and Export Zeno Mobile Data
12 pages
How To Add and Customize User Defined Title Blocks in An Isogen Style FAQ PDF
No ratings yet
How To Add and Customize User Defined Title Blocks in An Isogen Style FAQ PDF
16 pages
(Ebook) jQuery Mobile Web Development Essentials, 2nd Edition: Build mobile-optimized websites using the simple, practical, and powerful jQuery-based framework by Raymond Camden, Andy Matthews ISBN 9781782167891, 1782167897 pdf download
100% (1)
(Ebook) jQuery Mobile Web Development Essentials, 2nd Edition: Build mobile-optimized websites using the simple, practical, and powerful jQuery-based framework by Raymond Camden, Andy Matthews ISBN 9781782167891, 1782167897 pdf download
46 pages
AutoPIPE Tutorial
No ratings yet
AutoPIPE Tutorial
161 pages
Lesson 5 CSS Background
No ratings yet
Lesson 5 CSS Background
16 pages
MC3 Manual
No ratings yet
MC3 Manual
16 pages
Home Automation System Based Mobile Application: Poonphon Suesaowaluk
No ratings yet
Home Automation System Based Mobile Application: Poonphon Suesaowaluk
6 pages
React - Js MCQ (Multiple Choice Questions) - Javatpoint
No ratings yet
React - Js MCQ (Multiple Choice Questions) - Javatpoint
17 pages
PeopleLinkInteractive-Display-R98
No ratings yet
PeopleLinkInteractive-Display-R98
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pdf&rendition 1

Uploaded by

Pdf&rendition 1

Uploaded by

4/17/24, 10:52 AM about:blank 4/17/24, 10:52 AM about:blank

Applications of Computer Vision - Facial Recognition

i. Convolution is a common tool used for image editing.

What is a Convolutional Neural Network?

A convolutional neural network consists of the following layers:

Rectified Linear Unit Function

about:blank 3/4 about:blank 4/4

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.