hosting cloud, hpc and grid educational activities on futuregrid renato figueiredo – u. of florida...

22
Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Upload: clare-nash

Post on 05-Jan-2016

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Hosting Cloud, HPC and Grid Educational Activities on FutureGrid

Renato Figueiredo – U. of Florida

Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Page 2: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Introduction

Some of the pressing needs faced by educators when teaching Cloud, Grid, HPC:• Availability of suitable resources

• A flexible environment

• Fast turn-around times

• Reuse and sharing of educational materials

FutureGrid sees education as one of its main use cases and seeks to address these needs• Goals of this BOF:

• Highlight key aspects of FG and its use to date

• Benefit from your input and discussions to make it a primary resource used by you and the community for education activities

2

Page 3: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Needs addressed by FutureGrid

Resources:• FutureGrid is an XSEDE resource available to the

community at large which provides unique opportunities for education

• Thousands of cores, multiple sites• Heterogeneous: supporting HPC, cloud, Grid

Flexibility:• Cloud, virtualization, dynamic provisioning

• Environment can adapt to the user, rather than expect user to adapt to the environment

3

Page 4: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Needs addressed by FutureGrid

Fast turn-around times• Reduce barriers to entry and engage new users

• Web 2.0 portal to create class project and manage users; fast turn-around

Reuse and sharing of educational materials• Use of encapsulated “appliances” as a primary

delivery mechanism of education/training modules• Promoting reuse, replication, and sharing

4

Page 5: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Guiding principles

Fidelity: activities should use full-fledged, executable software: education/training modules• Learn using the proper tools

Reproducibility: Creators of content should be able to install, configure, and test their modules once, and be assured of the same functional behavior regardless of where the module is deployed• Incentive to invest effort in developing, testing and

documenting new modules

5

Page 6: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Guiding principles

Deployability: Students and users should be able to deploy modules in a simple manner, and in a variety of resources• Reduce barriers to entry; avoid dependences

upon a particular infrastructure

Community-oriented: Modules should be simple to share, discover, reuse, and expand• Create conditions for “viral” growth

6

Page 7: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

FutureGrid Web Portal

How classes are setup and managed • Create a Web account

• Submit a project proposal: define a class, workshop, short course, tutorial• Education is one of the key use cases of FG; eager

to have more projects!!!

• Needs approval, but not tied to a fixed schedule

• Typically, turn-around of days

• Add users to project• Users create own accounts using the portal

• Project leaders authorize them to gain access

• Students can then interactively use FG resources (e.g. to start VMs)

• Share and reuse – appliances, documents7

Page 8: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Educational appliances

A flexible, extensible platform for hands-on, lab-oriented education on FutureGrid

Support clustering of resources• Virtual private clusters: Virtual machines + virtual

networking to create isolated, repeatable environments• Virtual appliances: self-contained, pre-packaged VMs

• Virtual networks: simple management of virtual clusters by students and educators

8

Page 9: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

9

Virtual appliance example

Virtual appliance + virtual network

copy

instantiate

Hadoop+

VirtualNetwork A Hadoop worker Another Hadoop worker

Repeat…

Virtual machine

Virtual network

Page 10: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Use of FutureGrid in classes

Cloud/distributed computing classes• U.of Florida, U. Central Florida, U. of Puerto

Rico, Univ. of Piemonte Orientale (Italy), Univ. of Mostar (Croatia)

Distributed scientific computing• Louisiana State University

Tutorials, workshops:• Big Data for Science summer school

• A cloudy view on computing

• SC’11 tutorial – Clouds for science

• Science Cloud Summer School (July/Aug’12)

10

Page 11: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Cloud computing classes

Massimo Canonico, U. Piemonte Orientale Difficulties to overcome:

• Hardware issues: find enough free physical machines able to host virtual machines

• Software issues: time to install/configure as many as possible different cloud platforms

• University was not able to provide me the necessary hardware and software support

Students started to play with FutureGrid• After attending few lessons, they were able to

start/stop virtual instance with several Cloud Computing platforms

11

Page 12: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Cloud computing classes

Students used Eucalyptus, OpenStack and Nimbus• Half were not computer scientists.

As FutureGrid freely shares their physical machines and their cloud platforms, decided to freely share all materials of my class.• Hands-out, configuration files and link to

useful documentation are available

• https://portal.futuregrid.org/contrib/cloud-computing-class

12

Page 13: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Cloud computing classes

Graduate-level “Cloud computing for Data-Intensive Sciences” (Judy Qiu, Fall 2010)• Virtualization technologies and tools

• Infrastructure as a service

• Parallel programming (MPI, Hadoop)

• FutureGrid provided a set of software options that made it possible for students to work on different projects along the system stack.

13

Page 14: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Cloud Storage#8 Cloud Storage Survey (Xiaoming, Nixiaogang)

Cloud Storage#8 Cloud Storage Survey (Xiaoming, Nixiaogang)

Iterative MapReduce#3 LDA (Changsi, Yang)

#4 MemCache (Saliya, Yiming ,Jerome)#5 Avro (Yuduo, Yuan, patanachai)#6 PageRank (Shuo-Huan,Parag)

Iterative MapReduce#3 LDA (Changsi, Yang)

#4 MemCache (Saliya, Yiming ,Jerome)#5 Avro (Yuduo, Yuan, patanachai)#6 PageRank (Shuo-Huan,Parag)

Virtualization#9 Hypervisor Performance Analysis Project (James ,

Andrew)

Virtualization#9 Hypervisor Performance Analysis Project (James ,

Andrew)

Cloud Platfor

m

CloudInfrastruc

ture

Cloud Infrastructure #7 Nimbus, Eucalyptus (Stephen, Sonali, Shakeela)

Cloud Infrastructure #7 Nimbus, Eucalyptus (Stephen, Sonali, Shakeela)

Hypervisor/

Virtualization

Dryad/DryadLINQ#1 Matrix Multiplication (Swapnil,Amit,Pradnay)

#2 PhyloD (Ratul,Adrija,Chengming)

Dryad/DryadLINQ#1 Matrix Multiplication (Swapnil,Amit,Pradnay)

#2 PhyloD (Ratul,Adrija,Chengming)

Higher Level

Languages

Term Projects

(Slide courtesy of Judy Qiu)

Page 15: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

University ofArkansas

Indiana University

University ofCalifornia atLos Angeles

Penn State

IowaState

Univ.Illinois at Chicago

University ofMinnesota Michigan

State

NotreDame

University of Texas at El Paso

IBM AlmadenResearch Center

WashingtonUniversity

San DiegoSupercomputer

Center

Universityof Florida

Johns Hopkins

July 26-30, 2010 NCSA Summer School Workshophttp://salsahpc.indiana.edu/tutorial

300+ Students (200 on sites from 10 institutes; 100 online)IU MapReduce and UF Virtual Appliance technologies are supported by FutureGrid.

(Slide courtesy of Judy Qiu)

Big Data for Science

Page 16: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

16

Uploading and sharing images

APIs available to upload images, customize, save, and share images

Community education pages are available• FutureGrid Web portal allows users to publish

their own content

• Tutorials, presentations on Web portal; VMs on image repositories

Page 17: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Discussion starters

At a high-level, does FutureGrid provide capabilities that are aligned with your expectations for an CI useful to hosting educational activities?

What would you need to evaluate whether using FutureGrid is a good idea?

17

Page 18: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Cyberinfrastructure

What cyber-infrastructure features does FutureGrid have that may help you with educational needs?

What cyber-infrastructure features may be missing which would be important?

18

Page 19: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Educational materials

What documentation do educators, students need to use this infrastructure?

What formats are effective? Tutorials, manuals, videos?

19

Page 20: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Topics

IaaS cloud computing? Map/reduce? HPC? Grid? Applications?

20

Page 21: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Outreach, Community-building

What information may be missing, unclear, hindering ability to reach out?

Ways to attract/foster a community that will share and benefit from sharing educational materials?• Within and beyond XSEDE

21

Page 22: Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

22

Questions?

More information:• http://www.futuregrid.org

• http://grid-appliance.org

This document was developed with support from the National Science Foundation (NSF) under Grant No. 0910812 to Indiana University for "FutureGrid: An Experimental, High-Performance Grid Test-bed." Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF