1.1 introduction to grid computing itcs 4010 grid computing, 2005, unc-charlotte, b. wilkinson

40
1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson.

Post on 19-Dec-2015

220 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.1

Introduction to Grid Computing

ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson.

Page 2: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.2

Need to harness computers

Original driving force behind grid computing the same as behind the early development of networks that became the Internet:

– Connecting computers at distributed sites for high performance computing.

Just as the Internet has changed, grid computing has changed.

Page 3: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.3

History

• Began in mid 1990’s with experiments using computers at geographically dispersed sites.

• Seminal experiment – “I-way” experiment at 1995 Supercomputing conference (SC’95), using 17 sites across the US running:– 60+ applications.– Existing networks (10 networks).

Page 4: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.4

1995 2000 200519901985

Distributed computing

Remote Procedure calls (RPC)Concept of service registry

Beginnings of service oriented architecture

Object oriented approachesJava Remote Method Invocation (RMI)

CORBA (Common Request Broker Architecture)

Cluster computing

Software Techniques:

Computing platforms:

Parallel computers

Geographically distributed computers (Grid computing in the broadest sense)

Web services

SC’95 experiment

Page 5: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.5

Grid Computing

• Using distributed computers and resources collectively.

• Usually associated with geographically distributed computers and resources on a special high speed network, or the Internet.

• Now become much more that last slide suggests.

Page 6: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.6

Shared Resources

Can share much more than just computers:

• Storage• Sensors for experiments at particular sites• Application Software• Databases• Network capacity, …

Page 7: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.7

Computational Grid Applications

• Biomedical research

• Industrial research

• Engineering research

• Studies in Physics and Chemistry

Page 8: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.8

Sample Grid Computing Projects

Physical Sciences:• Large Hadron Collider project (CERN)• DOE Particle Physics Data grid• DOE Science grid• AstroGrid• Comb-e-Chem project

Natural and Life sciences:• Protein Data grid• Mcell project

Engineering Design:• Distributed Aircraft Maintenance Environment• NASA Information Power grid

Page 9: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.9

Science Today

is a Team Sport

I. Foster

Page 10: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.10

eScience

eScience [n]: Large-scale science carried out through distributed collaborations—often leveraging access to large-scale data & computing

I. Foster

Page 11: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

NSF Network for Earthquake Engineering Simulation (NEES)

Transform our ability to carry out research vital to reducing vulnerability to catastrophic earthquakes

I. Foster

Page 12: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

Global Knowledge Communities: e.g., High Energy Physics

I. Foster

Page 13: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.13www.earthsystemgrid.org

DOE Earth System Grid

Goal: address technical obstacles to the sharing & analysis of high-volume data from advanced earth system models

I. Foster

Page 14: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.14

Earth System Grid I. Foster

Page 15: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.15

TeraGridFunded by NSF in 2002 to link 5 supercomputer sites

with 40 Gb/s links

Page 16: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.16

TeraGrid

Page 17: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.17

Grid networks for collaborative grid computing projects

Grids have been set up at the local level, national level, and international level throughout the world, to promote grid computing

Page 18: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.18

CiscoEPA

North Carolina’s Foundation for Grid: NCREN

4-7 MCNC-owned Clusters distributed throughout the stateLocations still under evaluation

Internet Internet 2

NLR

Internet Internet 2

NLR

InternetInternet

Existing: Blend of owned and leased fiber and circuits moving toward resilient rings powered by Cisco routers

Planned: Strong focus on owned and leased fiber, Lambda, and few circuits, in resilient rings powered by Cisco routers and Wave Division Multiplexers

Close to home:

From “Grid Computing in the Industry” by Wolfgang Gentzsch, presentation to Fall 2004 grid computing course. Full set of slides on course home page.

Page 19: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.19

Grid2003: An Operational National Grid28 sites: Universities + national labs2800 CPUs, 400–1300 jobsRunning since October 2003Applications in HEP, LIGO, SDSS, Genomics

Korea

http://www.ivdgl.org/grid2003From “A Grid of One to a Grid of Many,” Miron Livny, UW-Madison, Keynote presentation, MIDnet conference, 2005.

Page 20: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.20

National GridsMany countries have embraced grid computing and set-up grid computing infrastructure:

• UK e-Science grid• Grid-Ireland• NorduGrid• DutchGrid• POINIER grid (Poland)• ACI grid (France)• Japanese grid• etc, etc., …

Page 21: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.21

UK e-Science Grid

Page 22: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.22

Resource sharing and collaborative computing

• Grid computing is about collaborating and resource sharing as much as it is about high performance computing.

Page 23: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.23

Virtual Organizations

Grid computing offers

potential of virtual organizations:– groups of people, both geographically and

organizationally distributed, working together on a problem, sharing computers AND other resources such as databases and experimental equipment.

• Crosses multiple administrative domains.

Page 24: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.24

Applications

• Originally e-Science applications– Computational intensive

• Not necessarily one big problem but a problem that has to be solved repeatedly with different parameters.

– Data intensive.– Experimental collaborative projects

• Now also e-Business applications to improve business models and practices.

Page 25: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.25(Based on a slide from HP)

Utility ComputingOne of Several Commercial Drivers

shared, traded resources

value

clusters

grid-enabled systems

programmable data center

virtual data center

Open VMS clusters, TruCluster, MC ServiceGuard

Tru64, HP-UX, Linux

switchfabriccompute storage

UDC

computing utility

or

GRID

today

• Utility computing• On-demand• Service-orientation• Virtualization

I. Foster

Page 26: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.26

Grid Computing Software Infrastructure

Page 27: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.27

Globus Project

• Open source software toolkit developed for grid computing.

• Roots in I-way experiment.• Work started in 1996. • Four versions developed to present time.• Reference implementations of grid computing

standards.• Defacto standard for grid computing.

Page 28: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.28

• GSI (Grid Security Infrastructure)– Grid security.

• MDS (Monitoring and discovery Service)– Interface to system and service information.

• GRAM (Grid Resource Allocation Manager) – Remote job submission and control.

• GridFTP– Secure data transfer.

GlobusKey Components

Page 29: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.29

From “Globus Toolkit 4 Tutorial,” MCNC Jan-Feb, 2005, Pawel Plaszczak and Bogdan Lobodzinski, Gridwise Technologies.

2. discover resource

3. submit job

4. transfer data

1. secure environmentGSI

GRAM

MDS

GridFTP

Page 30: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.30

Globus Toolkit: Recent History

• GT2 (2.4 released in 2002)– GRAM, MDS, GridFTP, GSI.

• GT3 (3.2 released mid-2004): redesign– OGSA (Open Grid Service Architecture)/OGSI (Open

Grid Services Infrastructure) based.– Introduced “Grid services” as an extension of web

services.– OGSI now abandoned.

• GT4 (release for April 2005): redesign– WSRF (Web service Resource Framework) based.– Grid standards merged with Web services.

Page 31: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.31

Supercomputing 2003 Demonstration

• We* used Globus version 2.4 in a Supercomputing 2003 demo organized by the University of Melbourne.

• 21 countries involved, numerous sites.

* The Grid group at WCU.

Page 32: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.32

Page 33: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.33

A re-implementation based upon the Open Grid Service Architecture (OGSA) standard.

• We used version 3.2 for the Fall 2004 grid computing course.

• Underlying implementation of version 3.x used OGSI Open Grid Service Infrastructure), which was not embraced by the community.

Version 3

Page 34: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.34

Version 4

• Released April 2005.

• OGSA kept but OGSI abandoned in favor of new implementation standards based around pure web services.(Version 3 used “extended” web services)

• To be used in this course, with other software.

Page 35: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.35

Interconnections and Protocols

Focus now on:

• using standard Internet protocols and technology, i.e. HTTP, SOAP, web services, etc.,

Page 36: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.36

Web Services-Based Grid Computing

• Grid Computing now strongly based upon web services.

• Large number of newly proposed grid computing standards:– WS-Resource Framework (WSRF)– WS-Addressing– etc., etc. …. .

Page 37: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.37

There will be several multiple-choice quizzes in the course (on-line through WebCT).

Quiz

Question: What is a virtual organization?

(a) An imaginary company.(b) A web-based organization.(c) A group of people geographically distributed that come

together from different organizations to work on grid project.

(d) A group of people that come together to work on a virtual reality grid project.

Page 38: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.38

Reading Assignments

Since there is no assigned textbook, there will be a few reading assignments.

Purpose is to consolidate your understanding.

Materials posted on home page from link “reading/on-line materials” at end of section on lecture materials.

Page 39: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.39

First Reading Assignment

"The Anatomy of the Grid: Enabling Scalable Virtual Organizations"

by

I. Foster, C. Kesselman, and S. Tuecke

Int. J. Supercomputer Applications, 2001.

Page 40: 1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson

1.40

AcknowledgementSlides numbers marked with “I. Foster” have been

selected from presentations made by Ian Foster:• Enabling eScience: Grid Technologies Today &

TomorrowAmerican Association for the Advancement of Science Annual Meeting, Washington, DC, February 21 2005.

• Globus: Bridging the GapKeynote Talk, GlobusWORLD, Boston, Mass., February 8, 2005.

• The Grid: Reality, Technologies, ApplicationsDistinguished Lecture, McGill University, Montreal, Canada, January 21 2005.

used for educational purposes only.