challenges and opportunities in preparing students for...

6
22 THIS ARTICLE HAS BEEN PEER-REVIEWED. COMPUTING IN SCIENCE & ENGINEERING P ETASCALE C OMPUTING Global competitiveness requires a 21st century approach to educating students and preparing them to exploit next-generation technologies. The Blue Waters project aims to revolutionize undergraduate and graduate education by promoting new educational resources, models, and methods that transcend traditional boundaries of discipline and institution. Challenges and Opportunities in Preparing Students for Petascale Computational Science and Engineering C omputer modeling, simulation, scien- tific visualization, information analysis, and advancements in computing tech- nologies are among the most signifi- cant developments for advancing scientific inquiry in the 20th century. The integration of these ca- pabilities into comprehensive computing environ- ments are providing the means for understanding and predicting the behavior of real-world natural and engineered systems based on a knowledge of science’s fundamental laws. Advances in computa- tional methods, algorithms, and computer speeds are making possible quantitative predictions and problem-solving approaches in all fields of study. Such advances directly benefit today’s global so- ciety, offering improved weather and climate predictions, healthcare and disease prevention, structural and materials design, natural resources management, new energy sources, and food pro- duction and safety. Given these benefits, compu- tational science is now considered one of the three essential pillars of science, complementing theory and experimentation/observation. Powerful new computing technologies are coming to science and engineering in the form of many-core CPUs, graphics processors, and complex petascale architectures. To fully utilize these resources, the core competencies of high- performance computing must evolve—and high-performance computing education and train- ing must evolve with it. A critical mass of expertise in “bleeding edge” HPC is something few universi- ties have; most also lack formal and informal cur- ricula to educate and train students to exploit new computing technologies for scientific discovery and engineering innovation. Indeed, national and inter- national studies report that students are unprepared to use HPC as a research tool. 1–5 Such studies point to several gaps in student education, including difficulty in balancing domain topics and sci- entific computation in a way that provides both depth and breadth; 1521-9615/09/$26.00 © 2009 IEEE COPUBLISHED BY THE IEEE CS AND THE AIP Sharon C. Glotzer University of Michigan, Ann Arbor Bob Panoff and Scott Lathrop Shodor Education Foundation

Upload: vanminh

Post on 25-Aug-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

22 This arTicle has been peer-reviewed. Computing in SCienCe & engineering

P e t a s c a l eC o m p u t i n g

Global competitiveness requires a 21st century approach to educating students and preparing them to exploit next-generation technologies. The Blue Waters project aims to revolutionize undergraduate and graduate education by promoting new educational resources, models, and methods that transcend traditional boundaries of discipline and institution.

Challenges and Opportunities in Preparing Students for Petascale Computational Science and Engineering

Computer modeling, simulation, scien-tific visualization, information analysis, and advancements in computing tech-nologies are among the most signifi-

cant developments for advancing scientific inquiry in the 20th century. The integration of these ca-pabilities into comprehensive computing environ-ments are providing the means for understanding and predicting the behavior of real-world natural and engineered systems based on a knowledge of science’s fundamental laws. Advances in computa-tional methods, algorithms, and computer speeds are making possible quantitative predictions and problem-solving approaches in all fields of study. Such advances directly benefit today’s global so-ciety, offering improved weather and climate predictions, healthcare and disease prevention,

structural and materials design, natural resources management, new energy sources, and food pro-duction and safety. Given these benefits, compu-tational science is now considered one of the three essential pillars of science, complementing theory and experimentation/observation.

Powerful new computing technologies are coming to science and engineering in the form of many-core CPUs, graphics processors, and complex petascale architectures. To fully utilize these resources, the core competencies of high- performance computing must evolve—and high-performance computing education and train-ing must evolve with it. A critical mass of expertise in “bleeding edge” HPC is something few universi-ties have; most also lack formal and informal cur-ricula to educate and train students to exploit new computing technologies for scientific discovery and engineering innovation. Indeed, national and inter-national studies report that students are unprepared to use HPC as a research tool.1–5 Such studies point to several gaps in student education, including

difficulty in balancing domain topics and sci-•entific computation in a way that provides both depth and breadth;

1521-9615/09/$26.00 © 2009 ieee

CopubliShed by the ieee CS and the aip

Sharon C. GlotzerUniversity of Michigan, Ann ArborBob Panoff and Scott LathropShodor Education Foundation

September/oCtober 2009 23

a lack of software engineering skills needed to •write, modify, verify, and validate robust sys-tems and applications codes that will address community needs over the long term; a lack of understanding of underlying algo-•rithms and their applicability in a highly paral-lel multiscale environment, and a tendency to use codes as “black boxes”; andinsufficient knowledge underlying program-•ming and optimizing for performance.

These gaps will only continue to widen with the advent of petascale computing unless the nation embarks on a commitment to accelerate HPC education and training.

To address these gaps in workforce develop-ment, computational science and engineering (CSE) education must keep pace with scientific and technological advances, which requires new approaches for teaching HPC in light of new ar-chitectures and new algorithms to address new applications. The US currently faces both a criti-cal need and—given the increased national invest-ment and attention to hardware development—a unique opportunity to prepare all students to en-gage in computational science using quantitative reasoning, computational thinking, and multiscale modeling. Thus prepared, our students can fully contribute to advancing scientific discovery us-ing emerging computational environments. In the US as elsewhere, we also have a responsibility and new opportunities to expose a larger, more diverse community of learners to anticipated advances in petascale-based science and engineering.

To accomplish all this, we must leverage faculty expertise to establish best practices, identify and fill gaps in education and training, and modernize the CSE curriculum across all science, technology, engineering, and mathematics (STEM) fields, all learning levels, and all educational institutions. As we describe here, these goals constitute the found-ing vision of the Blue Waters project, which sup-ports undergraduate and graduate research and education programs in collaboration with academia, industry, government agencies, and the public.

opportunities in petascale educationPetascale computing education offers us the op-portunity to rethink how we approach highly par-allel, multiscale modeling in an interdisciplinary way. Because computational scientists must rethink basic algorithmic approaches for using emerging petascale and exascale computing systems, we have the opportunity to revisit fundamental questions about how we might incorporate the inherent

parallelism of nature into the next generation of models and simulations. For example, researchers currently solve many N-body problems by parallel-izing an older serial code through summing pairs of potentials. The real world is a many-body prob-lem and interactions between objects—be they atoms or galaxies—should be calculated in paral-lel. As current and future generations of students address scientific problems of significantly greater size and complexity, they must envision, develop, and deploy new ideas, algorithms, and methods for analyzing nature’s fundamentals.6

Already, faculty members have successfully in-troduced scientific visualizations of complex prob-lems at levels ranging from advanced high school courses to graduate school. Students have shown they can bypass the repetitive, tedious analysis of columns of numbers and derive insight from the color, texture, and intensity of 2D and 3D visual-izations. Using dynamic modeling tools, students can control more complex graphs and visualiza-tion and correlate model parameter variations with predictions. The need for further advancements in visualization education is becoming more widely recognized. In 2007, a Gordon Research Confer-ence focused on visualization in science and edu-cation (http://community.middlebury.edu/~grc). Building on its nationally recognized leadership in scientific visualization, the US National Center for Supercomputing Application (NCSA) is mak-ing scientific visualization an integral component of Blue Waters programs as well as other activities that it manages.

Blue Waters education programHelping the workforce obtain the knowledge and skills needed to conduct petascale-enabled science and engineering requires a long-term coordinat-ed effort to educate and train current and future generations of scientists and engineers. This ef-fort must excite, recruit, educate, train, and retain students as well as research and education pro-fessionals. Such an effort is integral to the Blue Waters project and includes active involvement

petascale computing education offers the

opportunity to rethink how we approach

highly parallel, multiscale modeling in an

interdisciplinary way.

24 Computing in SCienCe & engineering

from partners in the Great Lakes Consortium for Petascale Computation (GLCPC).

Blue Waters seeks to introduce computational thinking in general, and CSE in particular, at the undergraduate and graduate levels to prepare stu-dents to advance scientific discovery and engineering innovation. These efforts build on many ongoing ef-forts, including those at the Ralph Regula School of Computational Science,7 which is working to build interdisciplinary competency in undergraduate CSE education in Ohio. The Blue Waters Education Pro-gram is engaging science and education communities to extend these competencies to include HPC—and in particular, petascale HPC—competencies.

The Blue Waters project will measure the suc-cess of its efforts based on improvements in stu-dent learning and achievement. Three examples illustrate how we’ll accomplish this. First, the IEEE Computing in Science & Engineering special issue on HPC in education8 covered a spectrum of projects—from high school through professional education—focused on various approaches for as-sessing what students are learning. One article, “Transforming Chemistry Education through Computational Science,” for example, discussed an NCSA-led education project.9 Over the first two years, external evaluators reported that par-ticipating high school teachers were able to attain significant achievement among their students on the American Chemical Society standardized test, showing a 45 percent improvement in student gains compared to the control group.

Second, evaluation and assessment of our Virtual School of Computational Science and Engineering programs is guiding our activities. For example, dur-ing the Blue Waters program’s first year, the external evaluator for the Virtual School’s summer school course on using GPUs for scientific simulation found that most participants reported that the pro-gram contributed a great deal to their understanding and abilities to apply accelerators in their research.

Finally, the Computational Science Education Reference Desk (www.shodor.org/refdesk) and Shodor tracked substantial use of the digital li-brary materials from both CSERD,10 a National Science Digital Library pathways portal, and HPC University (www.hpcuniv.org), a digital repository of reference materials. The introductory CSE ma-terials receive over three million page views per month (www.shodor.org/webalizer). The HPCU team will continue to expand the content in response to community needs to provide a rich ar-ray of HPC resources as a partner in the Virtual School of Computational Science and Engineering. With internal self-assessment modules built into

the materials, all new petascale education materials will be scaffolded to ensure that students can use more basic, introductory materials that also align with more advanced tutorials as needed. Shodor and its partners are devising new metrics that will examine use of the more advanced tutorials.

undergraduate petascale education programThe Undergraduate Petascale Education Pro-gram of Blue Waters focuses on preparing the next generation of undergraduate students and K-12 teachers, as well as an educated workforce. UPEP involves all types of two- to four-year un-dergraduate institutions, including those focused on serving women, minorities, and students with disabilities as well as those institutions involved in the Experimental Program to Stimulate Compet-itive Research. UPEP is pursuing three initiatives for engaging the national community:

professional development workshops for under-•graduate faculty,undergraduate materials developed by under-•graduate faculty, andresearch internships for undergraduate students.•

UPEP provides undergraduate faculty and stu-dents with resources, tools, and methodologies for educating students in computational thinking. It will support materials development by under-graduate faculty, while also seeking compelling ex-amples in STEM fields, including everything from desktop to petascale computing that incorporates computational thinking—especially in a multicore environment. Materials will address a specific sci-entific question in physics, chemistry, or biology, for example. They’ll also explore the computational models used to address the question, along with computer algorithms, software implementations, example problems, references, and assessment ru-brics. The materials will aim to prepare diverse stu-dents for petascale computing across all fields and will be broadly disseminated through the digital libraries. Prior to publication, all materials will be subject to a formal validation, verification, and ac-creditation11 review focused on three questions:

Validation:• Is the right problem solved?Verification:• Is the problem solved right?Accreditation:• Is the problem aimed at the right education level?

UPEP will provide models for engaging un-dergraduate students in petascale computing

September/oCtober 2009 25

research and education projects. Blue Waters will offer full-year internships that include sum-mer and academic year support, with an emphasis on under-represented students and students from under-represented institutions. Student interns will participate in a petascale research group, work in a research or education team, and have multiple op-portunities to interact with all other students. In-terns will also be expected to attend and report on a national conference, such as the SC Education Pro-gram, the education, outreach, and training com-ponent of the ACM/IEEE Computer Society’s SC Conference (http://sc09.supercomputing.org). The program consists of a series of pathways workshops that bring together domain experts across the com-putational sciences to develop and deploy effective curriculum. It also seeks to offer professional de-velopment workshops to expose undergraduate fac-ulty to computational science use in the classroom. The program makes extensive use of the CSERD’s digital library, and the program’s faculty and work-shop instructors submit materials they develop to the library for broad dissemination.

UPEP offers a series of workshops throughout the year to prepare faculty for petascale comput-ing education and research. Participants engage in incorporating computational thinking and petascale resources in their undergraduate class-es. The topics include quantitative reasoning, computational thinking, and multiscale model-ing across all STEM fields. The workshops also address petascale approaches in upper-division courses, emphasize modeling over programming, and develop competence and confidence to men-tor undergraduate research and collaborate with research universities.

Virtual school of computational science and engineeringWe established the Virtual School of Computation-al Science and Engineering to bring together facul-ty at universities across the Midwest and ultimately the nation to address the unique opportunities and challenges associated with petascale computing and petascale-enabled science and engineering. The Virtual School will support the creation and inte-gration of informal and formal courses and curri-cula tailored to the education and training needs of 21st century computational scientists and engineers and deliver them using new instructional technolo-gies. Our initial focus is on providing immediate training opportunities on core aspects of HPC that fall outside traditional formal courses, and that often fall into the abyss between computer science and domain sciences. Students urgently need this

training to use current and next-generation HPC architectures. Although we’re initially targeting graduate-level and postdoctoral education, efforts in undergraduate and K-12 education will fol-low through collaborations with programs such as UPEP.

The Virtual School’s founding members are institutions within the Big 10 Plus, and we are en-listing institutions worldwide that are interested in contributing to HPC and petascale education advancement. The Virtual School is the first of its kind in the US to bring together faculty, programs, and expertise across many states to create critical mass in an emerging area of CSE. It leverages uni-versities’ major investments in cyberinfrastruc-ture and will enable 24/7 access to cutting-edge curricula, as well as drive the development of new instructional technologies for teaching core HPC modeling and simulation principles, especially as they relate to petascale computing.

In the Virtual School’s first phase, activities in-clude creating summer school courses and related online learning materials to address critical gaps in HPC and petascale curricula, especially in pro-gramming new and emerging HPC architectures. We’re identifying these gaps through national and international reports, planning workshops, and surveys of students, faculty, and employers conducted as part of a preliminary market analy-sis. Examples of the types of knowledge and skills desired by and of CSE students include1

multicore and many-core computer architec-•tures for scientific computing; modern languages and APIs for multicore •architectures;programming for performance; •debugging, performance, and validation tools; •big data I/O, storage, analysis, and visualization;•coprocessors used for scientific applications, in-•cluding graphics processors (GPUs) and field-programmable gate arrays; application development environments and ap-•plication frameworks; andmodern software engineering.•

The virtual school brings together faculty and

experts across many institutions to address the

unique opportunities and challenges associated

with petascale computing.

26 Computing in SCienCe & engineering

In August 2008, the Virtual School held its first week-long summer school on GPUs and many-core architectures. Response to the summer school call for participants was outstanding, with 179 ap-plicants from over 50 institutions applying in the span of a few weeks. Forty-four graduate students participated on site at the University of Illinois, Urbana-Champaign (UIUC) NCSA facility, and a dozen more participated virtually at the Univer-sity of Illinois at Chicago Electronic Visualiza-tion Laboratory. Roughly half of these students were from Big 10 Plus institutions. Additional participants accessed course materials such as PowerPoint slides, exercises, and real-time video/audio transmission, and asked questions via a live, Web-based discussion board manned by graduate student and postdoctoral teaching assistants at the NCSA host site. Course content covered GPU and many-core programming generally, with primary emphasis on learning CUDA (www.nvidia.com/cudazone) for GPU programming. Each day in-cluded lectures by course developers and instruc-tors UIUC’s Wen-mei Hwu and Nvidia’s David Kirk, lectures by CUDA users on various appli-cations, and hands-on lab exercises. All students (onsite and remote) were given access to NCSA’s GPU cluster for the summer school’s duration. Af-ternoon and evening social events provided ample opportunity for students to interact with each oth-er and with instructors and lecturers.

We conducted pre- and post-summer school surveys and assessments, which are informing our next steps. For example, students surveyed indi-cated overwhelmingly that while online participa-tion in virtual lectures and courses to learn HPC would be useful, they highly value opportunities to interact face to face with both peers and expert instructors. The challenge, therefore, is to provide such a high-quality learning experience not just to 44 students, but to several hundred or thou-sand at a time. As a first step toward this goal, an expanded summer school program is being held in 2009, with GLCPC university campuses host-ing real-time, interactive presentations in high definition to scale-up delivery while maximiz-ing educational impact and student interaction. To support this, we’re using student feedback to develop wiki-based tutorials on HPC and GPU computing for launch on the Virtual School Hub. A dozen summer schools focusing on vari-ous core competencies will be developed for 2010 and beyond.

We anticipate that, ultimately, thousands of graduate and postdoctoral students will benefit directly from Virtual School activities each year.

The Virtual School is also eager to partner with and leverage other programs, projects, individu-als, and teams working to develop and dissemi-nate new approaches to HPC education.

Global competitiveness in today’s tech-nology-rich society requires a 21st cen-tury approach to educating students at all levels and across all fields of study.

The Blue Waters project is working to meet these requirements by promoting models for revolution-izing undergraduate and graduate education. In so doing, we hope to create a larger, more diverse, and better prepared US workforce capable of making quantum leaps in science discovery and engineer-ing innovation across all fields well into the future. We invite the community to join us in tackling the educational challenges we all face, to ensure that we can collectively and significantly accelerate the way we prepare students to use HPC—especially at the petascale—as a research tool.

ReferencesS.C. Glotzer et al., 1. International Assessment of R&D

in Simulation-Based Engineering and Science, World

Technology Evaluation Center, 2009; www.wtec.org/

sbes/SBES-GlobalFinalReport.pdf.

J.T. Oden et al., 2. Simulation-Based Engineering Science:

Revolutionizing Engineering Science through Simula-

tion, US Nat’l Science Foundation, 2006; www.ices.

utexas.edu/events/SBES_Final_Report.pdf.

D.E. Atkins et al., 3. Revolutionizing Science and Engineer-

ing through Cyberinfrastructure, US Nat’l Science Foun-

dation, 2003; www.nsf.gov/od/oci/reports/toc.jsp.

M.R. Benioff and E.D. Lazowska, 4. Computational Sci-

ence: Ensuring America’s Competitiveness, President’s

Info. Tech. Advisory Committee (PITAC), 2005; www.

nitrd.gov/pitac/reports/20050609_computational/

computational.pdf.

SIAM Working Group on CSE in Education, “Gradu-5.

ate Education in Computational Science and

Engineering,” SIAM Rev., vol. 43, no. 1, 2001,

pp. 163–177.

R.M. Panoff, “The Four As of CSE Education: Applica-6.

tion, Algorithm, Architecture, and Active Learning,”

IEEE Computational Science & Eng., vol. 2, nos. 4–6,

1995, p. 6–9.

S. Gordon, K. Carey, and I. Vakalis, “A Shared, Insti-7.

tutional Undergraduate Minor Program in Computa-

tional Science,” Computing in Science & Eng.,

vol. 10, no. 5, 2008, pp. 12–16.

S. Lathrop and T. Murphy, “High-Performance Com-8.

puting Education,” Computing in Science and Eng.,

vol. 10, no. 5, 2008, pp. 9–11.

September/oCtober 2009 27

S.C. Sendinger et al., “High-Performance Computing 9.

Education,” Computing in Science & Eng., vol. 10,

no. 5, 2008, pp. 34–39.

D. Tanase, D. Joiner, and J. Stuart-Moore, “Computa-10.

tional Science Educational Reference Desk: A Digital

Library for Students, Educators, and Scientists,”

D-Lib Magazine, Sept. 2006; http://dlib.org/dlib/

september06/tanase/09tanase.html.

D. Joiner et al., “Applying Verification, Validation, 11.

and Accreditation Processes to Digital Libraries,”

Proc. 5th ACM/IEEE-CS Joint Conf. Digital Libraries,

ACM Press, 2005, p. 382.

sharon c. Glotzer is a professor of chemical engineer-ing, materials science and engineering, physics, applied physics, and macromolecular science and engineering at the University of Michigan. She is a cofounder (with Thom Dunning) and director of the Virtual School of Computational Science and Engineering. Her research interests include computational nanoscience and soft matter simulation. She is a member of the CiSE edito-rial board. Glotzer has a PhD in physics from Boston University. Contact her at [email protected].

bob panoff is the director of undergraduate educa-tion for Blue Waters. In addition to being founder and executive director of Shodor, a national resource in computational science education, he leads the National Computational Science Institute and directs the computational science portal for the National Science Digital Library. His research interests in-clude stochastic optimization and condensed matter theory. Panoff has a PhD in theoretical physics from Washington University in St. Louis. Contact him at [email protected].

scott lathrop is the TeraGrid director of educa-tion, outreach, and training, and the Blue Waters technical program manager for education. He coor-dinated the creation of the SC07-11 Education Pro-grams through SC to assist undergraduate faculty and high school teachers with integrating compu-tational science resources, tools, and methods into the curriculum. Lathrop is the SC11 general confer-ence chair. He has a BA in mathematics from the University of Rochester. Contact him at [email protected].