the john wesley powell center for analysis and … · 26/03/2015 · hands-on unlimited computing...
TRANSCRIPT
1
THE JOHN WESLEY
POWELL CENTER FOR ANALYSIS
AND SYNTHESIS
Enhancing scientific discovery and problem-solving through integrated research
About the Powell Center Background/Need for Synthesis
All about Working Groups
What the Center offers (and expects)
Current activities and some statistics
Scientific Computing at Powell Center Current USGS/NSF coordination
Future Opportunities/Potential Collaborations
2
In 2007, the U.S. Geological Survey (USGS) developed a science strategy outlining the major natural-science issues facing the Nation in the next decade
Increased recognition of need for integrated science led to realignment Mission Areas instead of disciplines Organized around these major issues
Recommendation for a place apart from everyday work pressures where synthesis of multi-disciplinary data could take place
3
4
"We are drowning in information, while starving for wisdom. The world will henceforth be run by synthesizers, people able to put together the right information at the right time, think critically about it, and make important choices wisely." E.O. Wilson, 1988
Synthesis driven by both need and desire to address questions on the leading edge of science as well as pressing global change, societal and human health issues.
Synthesis centers promote collaborative discovery.
Synthesis centers are efficient and cost-effective. They leverage and extract additional value from existing investment in researchers and host institutions.
5
Concept became a reality in 2009
Key people and ideas from the Science Strategy visioning team converged to make it happen
Executive champion – Kevin Gallagher
Co-directors and chief scientists – Dr. Jill Baron and Dr. Marty Goldhaber
Space made available in USGS’ Fort Collins Science Center for Working Groups to gather
6
Working Groups are different than workshops!
Teams of 8-15 people who collaborate intensively around one or more questions ripe for synthesis that may or may not require intensive data manipulation
Made up of diverse group of participants
At least one PI must be a USGS employee
WGs typically meet in Fort Collins several times over two or more years
Work virtually in between face to face meetings
7
The opportunity for synthesis Time for in-depth analysis without distractions All-expense paid collaboration with colleagues Virtual in-between Powell Center meetings Fellows for scientific support Hands-on unlimited computing capabilities Page charges
8
Webex, phone conferencing myUSGS, GoogleDocs common space for your
Working Group ScienceBase data cataloging/data management Temporary onsite data downloads and storage Database setup and administration Website development Geospatial support Automate Workflow Services (e.g Kepler or VisCalc) Modeling services (coding/compiling) High performance/high throughput computing (from
Condor to supercomputing) VisWall
9
10
Induced Seismicity Bayesian Models
Amphibian Decline
Global Earthquakes
Biodiversity and N Deposition
Annual call for proposals – due April 30
All proposals go through Powell Center review and are ranked by the Science Advisory Board
Each proposal reviewed by at least 4 SAB members
Proposals discussed and ranked
Top proposals are funded
Constructive review comments returned for unsuccessful proposals
11
Science Advisory Board currently made up of ~10 scientists from USGS, FS, NSF Broad range of expertise
NSF Geosciences is a formal sponsor and supports some, but not all, proposals USFS, EPA, Smithsonian, others have given in-
kind support
12
Big important ideas o May require large, complicated, and
heterogeneous data sets o Or not – conceptual advancement is equally valued,
(but harder to sell)
Regional, national, global implications Clear, logical arguments Well-stated goals, objectives, methods,
timeline Proof of expertise Potential for high impact
13
Full intellectual engagement of all participants Collegiality, constructive discussions High profile publications Many publications Open access to your metadata at end of Working
Group
14
15
Synthesis
Center
Initial Year
of Center
Total Pubs
Average
number of
authors
# in high
impact
journals
NCEAS 1995 2500 9 295
NESCent 2004 677 5 105
NIMBios 2008 422 4 35
Powell 2009 54 9 9
CESAB 2011 20 8 3
2014 Fall Meeting
21 Presentations
2 Sessions
504
Working Group
Participants
16
Product Subproduct Number Notes
Publications Published 54 293 total citations
259 unique citations
In press/submitted 3/17
Data and Metadata 5 Science Base
Outside Support Travel $44,500 NSF, FS, PAGES,
Smithsonian, GEM
Fellow salary/tuition $220,000 NSF, EPA, FS, CU-Boulder,
UC-Riverside, NRP
New Research Funds Proposals funded $6,777,000 USGS, NPS, NASA, DOE,
Belmont Forum, NIMBios, USDA/NIFA, NSF
Project websites Global Croplands, Western
Mercury 2
Honors and awards Commentaries, Excellence in Res. Application Award
3 Science, Nature, USDA
Forest Service
Measures of Success (as of 022715)
17
Scientific Computing and Informatics at
the Powell Center
18
Earth Science
Computational Science
Data Science
• Modeling and synthesis methods • Computer science research and
development • Computer engineering • Technology-enabled science response • High volume, high speed computing
for science
• Water Census • Hazards & Risk • Ecosystems Understanding • Energy & Minerals • Climate & Land Use • Wildlife & Human health
• Data analysis and synthesis • Data collection, acquisition,
and management • Data transformation, and
visualization • Data documentation (fitness for
use) • Derive new knowledge and
new products through integration
18
Spatio-Temporal Exploratory Models predict the probability of occurrence of bird species across the United States at a 35 km x 35 km grid.
Data Analysis Examples – endless possibilities with science data
Land Cover
Potential Uses- • Examine patterns of migration • Infer impacts of climate change • Measure patterns of habitat
usage • Measure population trends
Model results eBird
Meteorology
MODIS – Remote sensing data
Occurrence of Indigo Bunting (2008)
Jan Sep Dec Jun Apr
Provide a robust scientific cyberinfrastructure in support of data integration, complex modeling, data analysis and visualization. This includes: high performance computing (HPC) and storage capabilities, expertise, education, and reliable cyberinfrastructure to USGS scientists, researchers and collaborators.
Decrease “time to solution”
Faster results
Increase “scope of question”
Complex questions
Higher accuracy
Address growing “data” issues
“Big Data” Challenges
Data transfer
Communities of practice
People
Availability
Cyberinfrastructure platform
Sustainable
Reliable
Time Data
Scale Access
21
BioEco Initiative
JWP/EarthCube Digital Crust
JWP Advisory Board participation
DataONE
InterOP - Virtual Data Center
EarthCube
XSEDE
Semantics - Sonet, RPI, SEEK
Ecoinformatics Dear Colleague Letter/Joint Funding
22
Local representative and liaison between XSEDE and campus (or organization
Local source of knowledge concerning HPC and other digital resources available through XSEDE
USGS Student hires MOU signed by Kevin Gallagher May 2014
establishing two USGS Champions
23
Planning, reviewing EcoINFORMA
Interagency coordination: Ecosystems.
data.gov
Biodiversity Informatics
GBIF, IPBES, EU
24
1 of 4 original PIs for Phase 1 and USGS involved in Phase 2 Working Group Leads Usability, Education
Leadership Team
Member Node
USGS Workshops
Contributing software developer time
Complementary agreements with UNM, UC, RPI, UT, DOE ORNL
25
❖ Mission closely aligns Core Science Systems
❖ Involved in charrettes, governance, and review panels
❖ Letters of support (RCNs, Projects) = Resulting projects
27
❖ Big Data Senior Steering Group works under NITRD to develop big data and data science strategies for the U.S. ❖USGS is a member of the BDSSG and is using the
collaboration to advance development of its own data science practice
❖US Group on Earth Observations (GEO) and the Big Earth Data Initiative (BEDI) ❖USGS co-chairs the Data Management Working Group,
developing Earth Observation System Common Framework to enhance interobility - will impact future data management strategies for all fed-funded programs
28
Joint funding of technologically innovative proposals
NSF CISE representation on Powell Center Science Advisory Board
Enhanced HPC collaboration: Clear HPC allocations; attract proposals with HPC components.
Special XSEDE allocation; XSEDE proposal timeline doesn’t mesh with JWP timeline
On site HPC expansion--USGS buys hardware; XSEDE ECSS program funds domain experts for on-site support
29
Data visualization -- support on site operation, tools and support
Participating with local universities (CU, CSU) for HPC projects
Connect DataONE - increase support potentially in community area, potentially around data management, culture, joint tools, etc.
Workshop around informatics support for workshop recommendations
on-going efforts in the area of semantics, CI, DM, etc
30
Conduct synthesis work at the Powell Center to test a methodology you are interested in (Big Data, Earth Cube, Analytical Modelling) Take the Proposal/WG route - submit a proposal through annual cycle
Take the Workshop route – use the Powell Center facility for your group to meet
Develop advanced analytical methods/capabilities Contribute funds for computing infrastructure for modelling or other analyses
Working Groups can use equipment and test innovative methods using the data they bring to the Powell Center for analysis and synthesis
Develop visualization capabilities Contribute to funds for Vis-wall and/or person who can run it at Powell Center
31
32
Cooperators and additional support from CUAHSI EPA Forest Service Smithsonian Tropical Forest Research Institute Global Earthquake Model Foundation PAGES-IGBP (Past Global Changes)
MOU signing USGS (Kevin Gallagher) and NSF Geosciences Directorate (Roger Wakimoto)
updated 2012
We Gratefully Acknowledge
For more info contact Marcia McNiff at [email protected]