5 - Grid Computing
5 - Grid Computing
• Evolution
Grid Computing
Grid computing enables the sharing, selection, and aggregation
of geographically distributed heterogeneous resources for solving
large-scale problems in science, engineering, and commerce
Power Grid Analogy
• Electrical/Power Grid
– No worry about where the electricity comes from – simply plug your toaster to get
the electrical power
– Infrastructure that makes this possible is called "the power grid“ e.g. Transmission
lines, power stations, etc
• Grid
– No worry about where the computer power you are using comes from - simply plug
your computer in to the Internet to get the computer power
– Infrastructure that makes this possible is called "the Grid“ i.e. it links PCs, servers,
network elements
– Grid is to be pervasive – accessible through the web services/portal
– Utility: you ask for computer power or storage capacity you get it and pay for
What is Grid Computing?
Super computers
services Personal devices
• Computational grid
– Setting aside resources specifically for computing power
– Most of the machines are high-performance servers
• Scavenging grid
– Used with large numbers of desktop machines
– Machines are scavenged for available CPU cycles and other
resources
– Owners of the desktop machines are usually given control
over when their resources are available to participate in the
grid
Types of Grid
• Data grid
– Housing and providing access to data across multiple
organizations
– Users are not concerned with where this data is located as
long as they have access to the data
What Grid computing can do?
• All of these scenarios have one thing in common: They rely on a concept called grid
computing
What Grid computing can do?
LAN
LAN
What Grid computing can do?
• “Data grid”
– Large capacities than single system
– Such spanning can improve data transfer rates through the use of striping
tech
– Duplicate data to serve as a backup
– Sharing is not limited to files - resources, such as equipment, software,
services, licenses
What Grid computing can do?
• Special equipment
Reliability
• Real-time situations, multiple copies of the important jobs can be run on different
machines throughout the grid
• Any individual file or data base can span several storage devices
and machine
Types of Resources
Communications
• The work involved in managing the grid is distributed to increase the scalability
• Job by central job scheduler submitted to a lower level scheduler that handles the
assignment to the specific machine
Submission software
• Any member machine of a grid can be used to submit jobs and queries to the grid
• When a grid is built using dedicated resources - separate submission software is usually
installed on the user’s PC or workstation
Grid software components
Donor software
Communications
• A grid system may include software to help jobs
communicate with each other
• An application may split itself into a large number of sub jobs
• Sub jobs need to be able to locate other specific sub jobs,
establish a communication and send the appropriate data
Intergrid and Intragrid
Intragrid and intergrid
• A simple grid consists of just a few
machines - homogeneous systems in
one department
• Intra grid - heterogeneous machines
configuration - more types of
resources are available in multiple
departments but within the same
organization
• Inter grid - grid may grow to cross
organization boundaries and may be
used to collaborate on projects of
common interest
• Highest levels of security are usually
required in this configuration
Comparison with P2P Applications
– Grid is about bringing all the resources together necessarily with equal
commitment to sharing, and presenting a virtual single system image
Comparison with Clusters
• Clusters are targeted more towards some specified number of users, and pre-
defined set of application, or might be in most generic form simply load
sharing systems
• Grid, on the other hand, has been designed for dynamic number of users and
applications
Comparison with Clusters
• Clusters surely presents a single system image, wherein the user can visualize the
complete collections of nodes as one Single Computer,
• Whereas Grid is about presenting a virtual image of single large computing device
• Nodes are expected to give there full resources and are fully devoted to the complete
system
• Whereas in Grids, computing nodes which are connected at required to give some or
more (as much available) of its computing power
• Cluster are (generally) homogenous collection of machines tightly coupled over a small
region
• Grids are complete opposite of this, and are heterogeneous collection of nodes, wherein
the heterogeneity is shadowed by middle layer applications
Comparison with Client/Server models
• Client/server models: distributed form of computing nodes stretched
across large geographical region and providing end-user service
(Web Services)
• In Grid System: the user is not required to know anything about the
underlying topology or any individual nodes in particular.
• In Grid interaction is with the system as a whole and not with any
node(s) in particular.
• Web services are used to support the grid computing since web
has emerged as the standards – based approach for accessing
network applications
Conclusion
Grid computing appears to be a promising trend for three
reasons:
• Make more cost-effective use of a given amount of
computer resources
• Provides the way to solve problems that can't be
approached without an enormous amount of computing
power
• Suggests that the resources of many computers can be co-
operative to each other so that they are able to solve a
common problem