xsede: an ecosystem of advanced digital services accelerating scientific discovery

34
September 18, 2014 XSEDE: an ecosystem of advanced digital services accelerating scientific discovery John Towns PI and Project Director, XSEDE Executive Director, Science & Technology, NCSA [email protected]

Upload: john-towns

Post on 14-Jun-2015

119 views

Category:

Technology


3 download

TRANSCRIPT

Page 1: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

September 18, 2014

XSEDE: an ecosystem of advanced digital

services accelerating scientific discovery

John Towns

PI and Project Director, XSEDE

Executive Director, Science & Technology, NCSA

[email protected]

Page 2: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

License terms

• Please cite as: Towns, John. XSEDE: an ecosystem of advanced digital services accelerating scientific discovery , September 2014, [http://www.slideshare.net/jtownsil/xsedeecosystemsept2014]

• ORCID ID: http://orcid.org/0000-0001-7961-2277 • Except where otherwise noted, by inclusion of a source URL or some other

note, the contents of this presentation are © by the Board of Trustees of University of Illinois. This content is released under the Creative Commons Attribution 3.0 Unported license (http://creativecommons.org/licenses/by/3.0/). This license includes the following terms: You are free to share – copy and redistribute the material in any medium or format; and to adapt – remix, transform, and build upon the material for any purpose, even commercially.

• This can be done under the following conditions: attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

Page 3: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE in Context

• XSEDE is an award made under the eXtreme Digital solicitation

– TeraGrid Phase III: eXtreme Digital Resources for Science and Engineering (XD), NSF 08-571

• Consistent with NSF’s vision and strategy statements

3

Page 4: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

NSF’s Strategic Planning Documents

• Investing in Science, Engineering, and Education for the Nation's Future - National Science Foundation Strategic Plan for 2014-2018

• www.nsf.gov/pubs/2014/nsf14043/nsf14043.pdf

– Vision: A Nation that creates and exploits new concepts in science and engineering and provides global leadership in research and education.

• Cyberinfrastructure Framework for 21st Century Science and Engineering

• www.nsf.gov/cif21

• NSF’s Advanced Computing Infrastructure: Vision and Strategic Plan

• www.nsf.gov/pubs/2012/nsf12051/nsf12051.pdf

4

Page 5: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Original Motivation for XSEDE

• Scientific advancement requires a variety of resources and services – and thus availability of comprehensive cyberinfrastructure

composed of heterogeneous digital resources

• Computational science better served if we leverage aggregate expertise of a small number of leading institutions – not fully centralized at a single institution; not fully

decentralized – full centralization less agile, single point of failure – different sites each offer a unique perspective and talent

to address a particular suite of community needs, – best to have several leadership perspectives for addressing

the broad range of disciplinary needs

5

Page 6: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE – accelerating scientific discovery

XSEDE’s Vision: a world of digitally-enabled scholars, researchers, and engineers participating in multidisciplinary collaborations while seamlessly accessing computing resources and sharing data to tackle society’s grand challenges.

XSEDE’s Mission: to substantially enhance the productivity of a growing community of scholars, researchers, and engineers through access to advanced digital services that support open research; and to coordinate and add significant value to the leading cyberinfrastructure resources funded by the NSF and other agencies.

6

Page 7: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Strategic Goal #1:

Deepen and Extend Use

XSEDE will: a. deepen the use—make more effective use—of the

advanced digital services ecosystem by existing scholars, researchers, and engineers, and

b. extend the use to new communities.

c. We will contribute to preparation—workforce development—of the current and next generation of scholars, researchers, and engineers in the use of advanced digital services via training, education, and outreach; and

d. we will raise the general awareness of the value of advanced digital services.

7

Page 8: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Strategic Goal #2:

Advance the Ecosystem

Exploiting its internal efforts and drawing on those of others, XSEDE will advance the broader ecosystem of advanced digital services by:

a. creating an open and evolving e-infrastructure, and

b. enhancing the array of technical expertise and support services offered

8

Page 9: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Strategic Goal #3:

Sustain the Ecosystem

XSEDE will sustain the advanced digital services ecosystem by:

a. assuring and maintaining a reliable, efficient, and secure infrastructure, and

b. providing excellent user support services

c. XSEDE will further operate an effective, productive, and

d. innovative virtual organization

9

Page 10: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE Factoids: high-order bits

• 5 year, US$121M project – plus US$9M, 5 year Technology Investigation Service

• separate award from NSF

– option for additional 5 years of funding upon major review after PY3

• No funding for major hardware – coordinate, support and create a national/international

cyberinfrastructure – coordinate allocations, support, training and documentation for

>$100M of concurrent project awards from NSF

• ~112 FTE /~250 individuals funded across 20 partner institutions – this requires solid partnering!

10

Page 11: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Total Research Funding Supported by XSEDE

in Program Year 3

11

$767 million in research supported by XSEDE

in PY3 (July 2013-June 2014)

Research funding only. XSEDE leverages and integrates additional infrastructure, some

funded by NSF (e.g. Track 2 systems) and some not (e.g. Internet2).

Page 12: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

What is XSEDE?

• An ecosystem of advanced digital services accelerating scientific discovery – support a growing portfolio of resources and services

• advanced computing, high-end visualization, data analysis, and other resources and services

• interoperability with other infrastructures

• A virtual organization (partnership!) providing – dynamic distributed infrastructure – support services and technical expertise to enable researchers

engineers and scholars • addressing the most important and challenging problems facing the

nation and world

• More than just a project funded by the National Science Foundation – XSEDE is a path-finding experiment in how to develop, deploy

and support e-science infrastructure

12

Page 13: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

• World-class leadership

– partnership led by NCSA, NICS, PSC, TACC and SDSC

• CI centers with deep experience

– partners who strongly complement these CI centers with expertise in science, engineering, technology and education

XSEDE’s Distinguishing Characteristics:

Governance

13

Page 14: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE offers efficient and effective

integrated access to a variety of resources

• Leading-edge distributed memory systems

• Very large shared memory systems

• High throughput systems, including Open Science Grid (OSG)

• Visualization engines

• Accelerators like GPUs and Xeon PHIs

Many scientific problems have components that call for use of more than one architecture.

14

Page 15: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDEnet – Using Internet2’s AL2S Service

Page 16: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

What do you mean by “Advanced Digital

Services?”

• Often use the terms “resources” and “services” – these should be interpreted very broadly – most are likely not operated by XSEDE

• Examples of resources – compute engines: HPC, HTC (high throughput computing), campus,

departmental, research group, project, … – data: simulation output, input files, instrument data, repositories, public

databases, private databases, … – instruments: telescopes, beam lines, sensor nets, shake tables, microscopes, … – infrastructure: local networks, wide-area networks, …

• Examples of services – collaboration: wikis, forums, telepresence, … – data: data transport, data management, sharing, curation, provenance, … – access/used: authentication, authorization, accounting, … – coordination: meta-queuing, … – support: helpdesk, consulting, ECSS, training, … – And many more: education, outreach, community building, …

16

Page 17: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE User Portal: THE User Site

portal.xsede.org

• XSEDE User Portal (XUP) is designed to be the only site a user needs to use XSEDE

• XUP presents information relevant to users – user info is easier to find – XUP also provides dynamic data about XSEDE systems – capabilities to manage usage, files, data

• As a user you can – request an allocation, and manage allocations – sign up for training – request help – manage file and data, and much more!

– Portal provides single sign-on to all XSEDE resources

Page 18: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Mao Ye (U. of Illinois)

Computational Finance

18

• Showed that by using odd lots and rapid trading, traders were able to mask what they were doing.

• His first findings contributed to a change in NASDAQ and New York Stock Exchange rules, such that all trades are reportable and visible moment-by-moment.

• Later work suggests that ever-faster trades may be destabilizing the markets.

Page 19: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

CIPRES Science Gateway for Phylogenetics

• Currently accounts for ~30% of XSEDE users • Served 10,000th user since birth as a gateway in Dec 2009

– over 350k total job submissions – averaging 250 new users per month/ with 1000 users total

submitting 12-14,000 jobs per month – users from 80% of EPSCoR states, significant classroom use

• ~1200 publications including Science, PNAS and Cell • “I did not know there were this many scientists out there to

serve, but there is (still) no sign of usage leveling off.” – Mark Miller, PI

• “CIPRES has moved the needle of what phylogenetic scientists think is possible.” – Brent Mishler, UCB, original CIPRES member

19

Page 20: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Direct interactions with the Community

• Facilitated broad range of ground-breaking research – provided in-depth support contributing to improved user

productivity – supported over 10,600 publications

• Seamlessly integrated and retired resources – transitioned users smoothly

• Pursued new disciplinary areas – increased the diversity of disciplines utilizing advanced

digital services

• Campus Champions reached new heights – over 200 Champions at more then 165 institutions – expanding program: Regional, Student, and Domain

Champions

20

Page 21: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE Computational User Census

21

Page 22: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Over 10k publications, conference papers and

presentations reported by XSEDE users, PY1-3

22

Start of XSEDE

Page 23: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Services Provided to the Community

• Established the XSEDE User Portal as the place for users to go to get information and support – a single location for their needs

– create a single account that gives you access to all XSEDE resources: over 18,000 accounts!

• Over 90,000 training registrations in past year! – HPCU and CI-Tutor, as well as center trainings, have

been used in universities around the country to prepare students to use the nation's pre-eminent computational resources

• Field and route over 10,000 tickets annually

23

Page 24: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Some Unexpected Challenges and Outcomes:

XSEDE is a socio-technical ecosystem

• Highly distributed organization – challenges in managing a project that involves staff at 19

partnering institutions

• A completely virtual organization – breaking new ground from an organizational structure and

management point of view

• Highly distributed engineering project – developing new methodologies to adapt traditional practices to

the unusual context of XSEDE

• Providing value-add to communities we did not expect! – systems engineering, virtual organizations, organizational

studies – 40+ letters provided for proposals

24

Page 25: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

XSEDE Value Added

• Benefits that accrue from having one coordinated and consolidated organization that would not happen in a scenario of multiple organizations.

• Two main categories of value added: – value that results from there being a single entity

serving the national research community and providing leadership with which other projects can align

– value that results from the scale of XSEDE

25

Page 26: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Benefits of a Single Coordinating

Organization • Uniformity of computing environment improves user

productivity – single allocation process – coordinated help desk support – unified authentication mechanism – unified set of tools for data management

• Security effectiveness – unified security team has proved highly effective in responding

to security incidents • Disaster resilience

– key core services operated in one region of the US with backup elsewhere

• Software optimization and support • optimization of widely used community codes

26

Page 27: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

One Resource With Which the Nation Can

Align • Several science projects use XSEDE as a resource

– more than 40 Letters of Support written, LIGO, National Center for Genome Analysis Support

• Science Gateways – one national cyberinfrastructure environment to serve as a back end for gateways

• Campus Bridging – aligns national CI – single entity to bridge to – enables the national research community to better leverage

non-federal investments in campus CI – extends the value of software created by community through

XCBC

• XSEDE becomes a coordinating point for national and international entities who want to engage the NSF HPC programs.

27

Page 28: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Benefits from Scale

• Scale of Staff Overall – disciplinary breadth of expertise

• allows coverage of domains composed of diverse sub-domains that 2 or 4 centers could not cover

– Novel and Innovative Projects – support of emerging & innovative research

• National impact of training, education and outreach programs – student programs serve a more diverse group of students – better ability to cover the entire nation in outreach:

• XSEDE Conference • users in all 50 states, D.C., and US territories • Campus Champions - 49 States

28

Page 29: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Diverse ECSS Expertise Possible Because of

Scale

• Fields of expertise: astrophysics,

bioinformatics, CFD, chemistry, computer science, climate modeling, engineering, genomics, hydrology, humanities , machine learning, molecular dynamics, phylogenetics, physics, seismology, statistics.

• Technologies: clusters, large

shared memory systems, MICs, GPUs

• Languages: C, C++,Fortran, MPI,

OpenMP, Java, JavaScript, shell programming, CUDA, OpenACC, Python, R, MATLAB

• Techniques: benchmarking, cloud

computing, Condor, data mining, databases, FFTs, finite element methods, grid generation, grid middleware, Lattice Boltzmann methods, libraries, linear algebra, Monte Carlo methods, parallel debugging, parallel I/O, petascale computing, scheduling, science gateways, visualization, workflows

29

Page 30: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Convenience Requirements will Always Increase

• Each generation of users requires more convenience than the former

• We must always be adding new capabilities while maintaining and extending existing reliability

• XSEDE has learned from the past – adds value in how we address

going forward

30

Change is the only Constant

– Heraclitis 535BC-475BC

No, his mind is not for rent

To any god or government.

Always hopeful, yet discontent,

He knows changes aren't permanent,

But change is.

– Rush - Tom Sawyer

Page 31: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery
Page 32: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery
Page 33: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery

Questions?

Page 34: XSEDE: an ecosystem of advanced digital services accelerating scientific discovery