uabgrid : a campus-wide distributed computational infrastructure

23
Internet2 Meeting 2006 UABgrid : A campus-wide distributed computational infrastructure University of Alabama at Birmingham UABgrid Architecture Team Jill Gemmill Purushotham Bangalore John-Paul Robinson

Upload: otylia

Post on 08-Jan-2016

34 views

Category:

Documents


2 download

DESCRIPTION

UABgrid : A campus-wide distributed computational infrastructure. University of Alabama at Birmingham UABgrid Architecture Team Jill Gemmill Purushotham Bangalore John-Paul Robinson. Acknowledgments. This work has been supported by: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: UABgrid : A campus-wide distributed computational infrastructure

Inte

rnet

2 M

eet

ing

20

06

UABgrid : A campus-wide distributed computational infrastructure

University of Alabama at Birmingham

UABgrid Architecture TeamJill GemmillPurushotham BangaloreJohn-Paul Robinson

Page 2: UABgrid : A campus-wide distributed computational infrastructure

AcknowledgmentsThis work has been supported by: Office of the Vice President for Information Technology Department of Computer & Information Sciences, School of Natural Sciences and Mathematics Enabling Technology Laboratory, School of Engineering

National Science Foundationo ANI-0330543 “NMI Enabled Open Source Collaboration Tools for Virtual Organizations”o NSF ANI-0123937 via SURA-2002-103 Subcontract “UAB Middleware Testbed Program: Integrated Directory Services, PKI, Video, and Parallel Computing”oNSF CNR-0420614 “Computer and Information Sciences Grid Node Research Facility” Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation

Page 3: UABgrid : A campus-wide distributed computational infrastructure

3

UAB Background 36 yr. old urban medical research U. 82 city blocks 13 schools (= 13 deans) 17,000 students; 16,000 employees Students are 26.3 percent African American and

60.2 percent female 70 research centers 20th in NIH funding (4th in SE) $433 million in research funding; doubling every 10

years No history of centrally supported HPC or other

research-oriented computing services (eg Statistics) The Alabama Supercomputer Authority

Page 4: UABgrid : A campus-wide distributed computational infrastructure

4

What’s a Campus Grid?

Strategic View: Maximize use of university’s investment in

computational resources Minimize administrative effort involved in

campus-wide resource sharing By leveraging investments in Identity

Management, WebISO, Directories, and Network infrastructures

UABgrid is a federation of resource owners who happen to share a common identity provider

Page 5: UABgrid : A campus-wide distributed computational infrastructure

5

UABgrid Partners

Office of VPIT: Sheila Sanders, VPIT; IT Academic Computing: David L. Shealy, Jill Gemmill, John-Paul Robinson, 128 node cluster; 64 node P3 cluster; desktop condor pool; 6

terabytes IBP storage Department of Computer and Information Sciences:

Tony Skjellum, CIS Chair; Puri Bangalore, Asst. Prof. 256 processor & 64 processor clusters;, Viz Wall, Parallel Storage

System Engineering Enabling Technology Lab: Bharat Soni,

Chair Mechanical Engineering; Alan Shih, ETLab Director 256 processor and 128 processor clusters; Viz Wall, High Speed

Storage Systems

Page 6: UABgrid : A campus-wide distributed computational infrastructure

6

Current UABgrid Applications

BioInformatics BLAST, Gene Sequence Analysis, Structural

Biology, Micro-Array Data Analysis, Visualization PDE

Automotive & Industrial, Surface Simulations, Optimization

Grid and Middleware Research Scheduling, Load Balancing, Granular

Authorization

Page 7: UABgrid : A campus-wide distributed computational infrastructure

7

UABgrid Architecture Today: Phase I

GigaBit

Page 8: UABgrid : A campus-wide distributed computational infrastructure

8

UABgrid Phase II

Additional Grid Nodes

10 GigE

Page 9: UABgrid : A campus-wide distributed computational infrastructure

9

Factors Supporting Resource Sharing

Provost and VP Research are being inundated with competing school requests to purchase clusters;

Deans who’ve gotten clusters find themselves losing classroom space to equipment racks and facing large power and AC bills;

Clusters, large databases, schedulers, etc. require expensive expertise

Page 10: UABgrid : A campus-wide distributed computational infrastructure

10

Grid User Management

Grid identity comes from enterprise authentication system ( “BlazerID”)

WebISO leveraged to provide digital certificate, private key and proxy certs behind the scenes

Grid Portal and Per-System User Accounts Are Provisioned Automatically, saving much administrative effort (Phase I : grid-mapfiles; Phase II : LDAP stored posix accounts + GridShib)

Page 11: UABgrid : A campus-wide distributed computational infrastructure

11

Page 12: UABgrid : A campus-wide distributed computational infrastructure

12

Grids for Mere Mortals

For jobs run repeatedly where only the database or query varies, it is worthwhile to build a user-friendly interface and also to optimize use of resources

Example: BLAST (National Library of Medicine gene sequence matching software)

http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/information3.html

Page 13: UABgrid : A campus-wide distributed computational infrastructure

13

Access using BlazerID and password

Queries and Results easily uploaded & downloaded

Web UI can be hosted on any server

Web UI can be written in any development language

Improving the Interface : GridBLAST

Page 14: UABgrid : A campus-wide distributed computational infrastructure

14

Improving Performance: G-BLAST

A native Grid Service Interface for BLAST G-BLAST provides automatic BLAST

algorithm selection based on # of queries, length of queries, size of the database used, and machines available

BLAST algorithms employed: multi-threaded BLAST, database-splitting BLAST (e.g., mpiBLAST), query-splitting BLAST

Page 15: UABgrid : A campus-wide distributed computational infrastructure

15

G-BLAST architecture

Client Program

Web Interface

Users

… …BLAST1 BLAST2 BLASTn

GIS

Invoker

Grid Service Interface

Resource Information

Grid Service

Query (2)

Response (3)

Query(1), (7)

Dispatch (4) Result (5)

Notify(6)

AIS

Scheduler

Application Information

Page 16: UABgrid : A campus-wide distributed computational infrastructure

16

G-BLAST Scheduler Architecture

Jobs

Job ID’s(JIDs)

Analyzer

BLAST Benchmark database

Resource Broker

Resource Information

Resource Information

Job Submission

Agent

ResourceResource

GIIS/GRIS

AISAIS

Page 17: UABgrid : A campus-wide distributed computational infrastructure

17

UABgrid Funding and Management Today

All equipment has been purchased with various grant funds

ETLab has been designated as a campus resource; ETLab has contracted for 50% of one IT provided

unix administrator to manage its clusters Academic Computing has 2.3 employees and

provides other support in addition to HPC Computer science / NS&M resources are available

to other campus computational scienctists Computer Science has 1 administrator for all CIS

systems Each research department hires its own

programmer(s) Developing sustainable funding model(s) is

challenging

Page 18: UABgrid : A campus-wide distributed computational infrastructure

18

Federated Grids

Exploring cross-domain resource sharing scenarios Federated Identity : experiences in

SURAgrid Federated Attributes : myVocs and

GridShib

Page 19: UABgrid : A campus-wide distributed computational infrastructure

19

SURAgridwww.sura.org

Bla

zerI

D a

nd

pas

sw

ord

Grid Portal

Resources

Ke

rbe

ros

Lo

gin

Grid Portal

Resources

Louisiana State University

Dig

ital

Ce

rtif

ica

te

Lo

gin

Grid PortalResources

University of Virginia

SURAGrid CA Bridge

SURAGrid Portal

Texas Advanced Computating Center

UABgridCA

UVACA

LSUCA

Page 20: UABgrid : A campus-wide distributed computational infrastructure

20

Use of Shibboleth in Grids provides Attribute based Access Control (not just identity)

Example: Faculty may be assigned higher priority in job queues than students

For VO’s the most important attribute is “member of VO ABC”, and VO memberships typically cross domains.

myVocs offers easy, self management for VOs and expects web browser as primary access to resources

Combined with GridShib, myVocs enables VO membership-based access to grid resources

: a Virtual Organization Service Center

Page 21: UABgrid : A campus-wide distributed computational infrastructure

21

Inside myVocsAttribute Aggregation

Page 22: UABgrid : A campus-wide distributed computational infrastructure

22

Page 23: UABgrid : A campus-wide distributed computational infrastructure

23

Q & A

Jill Gemmill [email protected]

Further Information: http://uabgrid.uab.edu