cloud computing with nimbus 020409 - the networking and ...€¦ · 8/25/2009 the nimbus toolkit:...

35
Cloud Computing with Nimbus February 2009 Kate Keahey ([email protected]) University of Chicago Argonne National Laboratory

Upload: others

Post on 03-Jun-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

Cloud Computing with NimbusFebruary 2009

Kate Keahey

([email protected])

University of Chicago

Argonne National Laboratory

Page 2: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 2

Cloud Computing

IaaSInfrastructure-as-a-Service

PaaSPlatform-as-a-Service

SaaSSoftware-as-a-Service

elasticitycomputing on demand

capital expense

operational expense

Page 3: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 3

The Quest Begins

Code complexity

Resource control

Page 4: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 4

“Workspaces”

Dynamically provisioned environments Environment control

Resource control

Implementations Via leasing hardware platforms: reimaging,

configuration management, dynamic accounts…

Via virtualization: VM deployment

Page 5: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 5

A Brief History of Nimbus

Research onagreement-based

services

Xen released

First WSRFWorkspace Service

release

EC2 gatewayavailable

Support forEC2 interfaces

2003 20092006

EC2 goes online

First STARproductionrun on EC2

Nimbus Cloudcomes online

Context Brokerrelease

Page 6: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6

Nimbus Overview

Goal: cloud computing for science Open source, extensible IaaS implementation

A platform for experimentation with features for scientific needs and interoperability

Set up private clouds (privacy, expense considerations) Workspace Service

Orchestration tools Focus on end-to-end picture Context Broker, gateway

http://workspace.globus.org/

Page 7: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 7

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

VWSService

The Workspace Service

Page 8: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 8

The Workspace Service

Poolnode

Trusted Computing Base (TCB)

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

The workspace service publishesinformation on each workspace

Users can query this information to find out

things about theirworkspace (e.g. what IP

the workspace was bound to)

Users can interact directly with their

workspaces the same way the would with a

physical machine.

VWSService

Page 9: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 9

Workspace Service Interfaces and Clients

Web Services based Web Service Resource Framework (WSRF)

GT-based

Elastic Computing Cloud (EC2) Supported: ec2-describe-images, ec2-run-instances, ec2-

describe-instances, ec2-terminate-instances, ec2-reboot-instances, ec2-add-keypair, ec2-delete-keypair

Unsupported: availability zones, security groups, elastic IP assignment, REST

Used alongside WSRF interfaces E.g., the University of Chicago cloud allows you to connect

via the cloud client or via the EC2 client

Page 10: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 10

Security

GSI authentication and authorization PKI credential required

Works with Grid proxies

VOMS, Shibboleth (via GridShib), custom PDPs

Secure access to VMs EC2 key generation or accessed from .ssh

Validating images and image data Collaboration with Vienna University of Technology

Paper: Descher et al., Retaining Data Control in Infrastructure Clouds

Page 11: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 11

Networking

Network configuration External: public IPs or private IPs (via VPN)

Internal: private network via a local cluster network

Each VM can specify multiple NICs mixing private and public networks (WSRF only) E.g., cluster worker nodes on a private

network, headnode on both public and private network

Page 12: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 12

The Back Story

Poolnode

Trusted Computing Base (TCB)

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

VWSService

Workspace WSRF front-endthat allows clients

to deploy and manage virtual workspaces

Resource manager for a pool of physical nodesDeploys and manages

Workspaces on the nodes

Each node must have a VMM (Xen)‏ installed, as

well as the workspace controlprogram that manages

individual nodes

Workspace back-end:

Page 13: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 13

Workspace Components

workspacecontrol

workspaceresourcemanager

workspacepilot

workspaceclient

workspaceservice

EC2

WSR

F

Page 14: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 14

Cloud Closure

workspacecontrol

workspaceresourcemanager

workspacepilot

workspaceclient

cloudclient

storageservice

workspaceservice

EC2

WSR

F

Page 15: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 15

The IaaS Gateway

IaaSgateway EC2

potentially other providers

workspacecontrol

workspaceresourcemanager

workspacepilot

workspaceclient

cloudclient

storageservice

workspaceservice

EC2

WSR

F

Page 16: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 16

IP1IP1 HK1HK1 IP2IP2 HK2HK2 IP3IP3 HK3HK3

MPIMPI

Virtual Clusters

What makes a cluster a cluster? Shared trust/security context Shared configuration/context information

Reciprocal exchange of information: networking and security

Tightly-coupled clusters

Page 17: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 17

IP1IP1 HK1HK1

IP1IP1

IP2IP2

IP3IP3

HK1HK1

HK2HK2

HK3HK3

IP1IP1 HK1HK1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

IP1IP1 HK1HK1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

IP1IP1

Context Context BrokerBroker

IP2IP2 HK2HK2

IP1IP1

IP2IP2

IP3IP3

HK1HK1

HK2HK2

HK3HK3

IP3IP3 HK3HK3

IP1IP1

IP2IP2

IP3IP3

HK1HK1

HK2HK2

HK3HK3

Context Broker

Parameterizable appliance Context information exchange

Page 18: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 18

Context Broker Goals

Can work with every appliance Appliance schema, can be implemented in

terms of many configuration systems

Can work with every cloud provider Simple and minimal conditions on generic

context delivery

Can work across multiple cloud providers, in a distributed environment

Page 19: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 19

Status for Context Broker

Release history: In alpha testing since August ‘07

First released summer July ‘08 (v 1.3.3)

Latest update January ‘09 (v 2.2)

Used to contextualize 100s of nodes for EC2 HEP STAR runs, Hadoop nodes, HEP Alice nodes…

Contextualized images on workspace marketplace

Working with rPath to make contextualization easier for the user OVF extensions to be submitted to DMTF

Page 20: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 20

End of Nimbus Tour

workspacecontrol

workspaceresourcemanager

workspacepilot

workspaceservice

workspaceclient

cloudclient

IaaSgateway

cont

ext b

roke

r

contextclient

EC2potentially other providers

storageservice

EC2

WSR

F

Page 21: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 21

Science Clouds Goals

Make it easy for scientific projects to experiment with cloud computing Can cloud computing be used for science?

Evolve software in response to the needs of scientific projects Start with EC2-like functionality and evolve to serve

scientific projects: virtual clusters, diverse resource leases

Federating clouds: moving between cloud resources in academic and commercial space

Provide a laboratory for exploration of cloud interoperability issues

Page 22: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 22

Science Cloud Resources

University of Chicago (Nimbus): first cloud, online since March 4th 2008 16 nodes of UC TeraPort cluster, public IPs

University of Florida Online since 05/08 16-32 nodes, access via VPN

Other Science Clouds Masaryk University, Brno, Czech Republic (08/08), Purdue

(09/08) Installations in progress: IU, Grid5K, Vrije, others

Using EC2 for overflow Minimal governance model http://workspace.globus.org/clouds

Page 23: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 23

Who Runs on Nimbus?

HadoopAliEnGT-scalabilitySTARMontage workflowsGridFTP testingworkspace-teamTestingOSGgeofestbioinformaticsOther

Project diversity: Science, CS, education, build&test…

Page 24: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 24

Hadoop over ManyClouds

Building clouds on top of clouds Virtual workspace: ViNE router + application VMs Need access to distributed resources, and high level of privilege

to run a ViNE router Papers:

“Sky Computing”, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes. Submitted to IEEE Internet Computing.

“CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications” by A.Matsunaga, M. Tsugawa and J. Fortes. eScience 2008.

U of FloridaU of Chicago

ViNErouter

ViNErouter

Page 25: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 25

Alice HEP Experiment at CERN

Collaboration with CERNVM project (CHEP09 paper)

Page 26: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 26

STAR STAR: a high-energy physics experiment Need resources with the right configuration

Complex environments Consistent environments

A virtual OSG STAR cluster OSG cluster: OSG CE (headnode), gridmapfiles, host

certificates, NSF, Torque, worker nodes: SL4 + STAR

Requirements One-click virtual cluster deployment Science Clouds -> EC2

From proof-of-concept to productions runs Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia

Didenko Results to be published at Quark Matter conference and

CHEP

Page 27: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 27

Infrastructure Testing Motivation

Test middleware scalability, use of different platforms, etc.

Workspaces Globus 101 and several different environments

Requirements very short-term but flexible access to diverse platforms

Work by various members of the Globus community (Tom Howe and John Bresnahan), short-lived “communities of one”

Resulted in provisioning a private cloud for Globus

Page 28: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 28

Montage Workflows

Evaluating a cloud from user’s perspective Paper: “Exploration of the Applicability of Cloud

Computing to Large-Scale Scientific Workflows”, C. Hoffa, T. Freeman, G. Mehta, E. Deelman, K. Keahey, SWBES08: Challenging Issues in Workflow Applications

Page 29: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 29

Friends and Family

Committers: Kate Keahey & Tim Freeman (ANL/UC), Ian Gable (UVIC)

A lot of help from the community, see: http://workspace.globus.org/people.html

Collaborations: Cumulus: S3 implementation (Globus team) EBS implementation with IU Appliance management: rPath and Bcfg2 projects Virtual network overlays: University of Florida Security: Vienna University of Technology

Page 30: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 30

Nimbus Users

Applications users Scientific projects Use clouds, Nimbus client side, user guides

Cloud administrators Resource providers Install Nimbus, administrator guides

Communities extending Nimbus Extensions for research or usability Develop code, extensibility guides

Page 31: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 31

Open Source IaaS Implementations

OpenNebula Open source datacenter implementation University of Madrid, I. Llorente & team, 03/2008

Eucalyptus Open source implementation of EC2 UCSB, R. Wolski & team, 06/2008

Cloud-enabled Nimrod-G Open source implementation of EC2 Monash University, MeSsAGE Lab, 01/2009

Industry efforts openQRM, Enomalism

Page 32: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 32

User Environments

Cloud Computing Ecosystem

VMM/datacenter/IaaSVMM/datacenter/IaaS

Appliance Providersmarketplaces

commercial providerscommunities (CERNVM) Deployment Orchestrator

orchestrate the deployment of environments across possibly

many cloud providers

Page 33: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 33

IaaS Clouds vs Grids

Interfaces to resources in the cloud Significant new abstraction: provisioning resources

and environments/configurations Configuration control Resource control, better articulated SLAs

Revolution or evolution? We can leverage much of the infrastructure

developed in the context of grid computing There is new potential to explore

Page 34: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 34

Some Thoughts

Cloud computing has an enormous potential for Science

New roles: appliance providers Or maybe not so new…

Interoperability Academic vs commercial resources Standards: “rough consensus & working code”

Importance of open source Drive academic needs into the marketplace Drive the development of standards

End-2-end tools Combine with what we have Explore new potential

Page 35: Cloud Computing with Nimbus 020409 - The Networking and ...€¦ · 8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 6 Nimbus Overview Goal: cloud computing for science Open

8/25/2009 The Nimbus Toolkit: http//workspace.globus.org 35

Acknowledgements

NSF CSR “Virtual Playgrounds”

NSF SDCI “Missing Links”

DOE CEDPS proposal

Various other collaborations with TeraGrid, OOI, and other communities