geon: geosciences network
DESCRIPTION
GEON: Geosciences Network. Chaitan Baru Director, Science R&D San Diego Supercomputer Center University of California, San Diego. Domain-specific Cybertools (software). Shared Cybertools (software). Distributed Resources (computation, storage, communication, etc.). - PowerPoint PPT PresentationTRANSCRIPT
Slide 1
GEON: Geosciences Network
Chaitan BaruDirector, Science R&D
San Diego Supercomputer CenterUniversity of California, San Diego
Slide 2
Hardware
Integrated Cyberinfrastructure System: Meeting the needs of multiple communities
Source: Dr. Deborah Crawford, Chair, NSF CyberInfrastructure Working Group
Middleware Services
DevelopmentTools & Libraries
Applications• Geosciences• Environmental Sciences• Neurosciences• High Energy Physics … •
Domain-specific Cybertools (software)
Domain-specific Cybertools (software)
Shared Cybertools (software)
Shared Cybertools (software)
Distributed Resources (computation, storage, communication, etc.)
Distributed Resources (computation, storage, communication, etc.)
Ed
uca
tion a
nd
Tra
inin
g
Dis
covery
& In
novati
on
E-science
Slide 3
Community Cyberinfrastructure Projects
Middleware Services
DevelopmentTools & Libraries
Distributed Computing, Instruments and Data Resources
Shared Tools
ScienceDomains
Shared Tools
ScienceDomains
Your Specific Tools & User Apps.
Your Specific Tools & User Apps.
Friendly Work-Facilitating PortalsAuthentication - Authorization - Auditing - Workflows - Visualization - Analysis
Bio
med
ical
In
form
atic
s (B
IRN
)
Hig
h E
neg
y P
hys
ics
(Gri
Ph
yN)
Geo
scie
nce
s (G
EO
N)
Eco
log
ical
Ob
serv
ato
ries
(N
EO
N)
Ear
thq
uak
e E
ng
inee
rin
g (
NE
ES
)
Oce
an O
bse
rvin
g (
OR
ION
)
Hardware
Adapted from: Prof. Mark Ellisman, UC San Diego
Slide 4
GEON Cyberinfrastructure
• Funded by NSF IT Research program (~$11.5M)• Multi-institution collaboration between IT and
Earth Science researchers• GEON Cyberinfrastructure provides:
• Authenticated access to data and Web services• Registration of data sets, tools, and services with
metadata• Search for data, tools, and services, using ontologies• Scientific workflow environment• Data and map integration capability• Scientific data visualization and GIS mapping
Slide 5
Key Informatics Areas• Portals
• Authenticated, role-based access to cyber resources: data, tools, models, model outputs, collaboration spaces, …
• Data Integration• Search, discover and integrate data from heterogeneous information
sources (“mediation” and “semantic integration”)• Modeling and simulation environments based on “scientific
workflow” software• Users can “program” and steer computations at a higher level of
programming abstraction• Share models (not only data), and support generation and sharing of
provenance information• Geospatial information and Geographic Information Systems (GIS)
• Spatial statistics, spatiotemporal data mining• Visualization of 2D, 2.5D, 3D, 4D data, and multidimensional
information spaces
Slide 6
GEON Science Projects
• Paleo Integration Project• Integrated access to paleogeography, paleobiology,
geochronology databases and services
• LiDAR Integration Project• Development of a national framework for LiDAR data
management. Integration of LiDAR products with other geophysics datasets.
• Geophysics Integration Project• Integration of gravity and magnetic data with seismic
modeling programs, for developing models for crustal structure. Could include geodynamics models.
Slide 7
LiDAR Workflow
• Current implementation• 32-way IBM multiprocessor, 128GB, 8TB SAN• ~2TB point cloud data, ~6B rows in database• ~20TB orthophotos
• Migrating to…• 16-way Linux cluster, 64-bit Intel processors• Central warehouse + replicas for failover and
load balancing
• Provide national framework for• On-demand access & analysis of LiDAR and
other remote sensing data
R. Haugerud, U.S.G.S D. Harding, NASA
Survey Process & Classify
Interpolate/Grid
Point Cloud
Point Cloudx, y, zn, …
Analyze/ Interpret
Courtesy: Chris Crosby &Prof. Ramon Arrowsmith, Arizona State
Meeting in August with USGS EROS Data Center to make continental-scale datasets open to GEON and other user communities (NEON, hazards, …)
Collaboration with AIST, Japan on ASTER and other remote sensing datasets
Slide 8
International GEON (iGEON): India
• Collaboration with University of Hyderabad• Prof. K.V. Subbarao, Professor-in-Charge, Center for Earth and Space
Sciences• Prof. Arun Agarwal, Professor-in-Charge, Center for Modeling, Simulation,
and Design & Head, Department of Computer Science
• Conducted GEON Cyberinfrastructure Workshop, following PRAGMA meeting, Oct. 2005, Hyderabad
• iGEON-India recently funded as a Knowledge Networked R&D Center, by the Indo-US Science and Technology Forum• Deploy GEON Node at UofHyd and an India-based portal• Continue GEON Cyberinfrastructure workshops• Partner with Indian geoscience institutions, including:
• National Geophysical Research Institute, National Remote Sensing Agency, Indian National Center for Ocean Information Systems, Wadia Institute of Himalayan Geology, Birbal Sahni Institute of Paleo-Botany
Slide 9
International GEON (iGEON): Japan
• Collaboration with AIST, Tokyo• Dr. Satoshi Sekiguchi
• Initiating a GEOGrid in Japan. Inauguration in early October, 2006
• Will make various remote sensing data available using GEON technologies, and via GEON
Slide 10
Opportunities for iGEON-China
• Deploy GEON node in China• Share data sets from China
• From agencies as well as individual PI’s—by registering them into GEON
• Use iGEON-China as a way to bring together activities within China
• Share Geoscience tools and applications• Make tools and applications developed by Chinese
research community available as Web/Grid services for remote access
• Share Grid computing resources
Slide 11
Thanks!
Slide 12
Data, Informatics and Cyberinfrastructure
Storage hardware
Networked Storage (SAN)
Grid StorageFilesystems, Database Systems
Data Mining, Simulation Modeling, Analysis, Data Fusion
Applications: Medical informatics,Biosciences, Ecoinformatics,…
Knowledge-Based Integration Advanced Query Processing
Visualization
High speed networking
sensornets
How do we configure computer architectures to optimally support
data-oriented computing?
How do we collect, accessand organize data?
How do we obtain usableinformation from data?
How do we detect trends and relationships in data?
How do we represent data, information and knowledge
to the user?
How do we combine data, knowledge
and information management with simulation and modeling?
instrumentsHPC