the new e-science (bangalore edition)

Post on 26-Jan-2015

109 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Keynote talk at IEEE e-Science Conference, Bangalore, December 2007 (the original Powerpoint 2007 version is available on www.semanticgrid.org).

TRANSCRIPT

Bangalore Edition

Due to the complexity of the software and the backend infrastructural requirements, e-Science projects usually involve large teams managed and developed by research laboratories, large universities or governments.

e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.

How do we know when e-Science has succeeded?

Not just accelerated but new

A. When everyone is using the Grid

B. When there are routine scientific advances that would not have happened otherwise

How do we move from heroic scientists doing heroic science with heroic infrastructure to everyday scientists doing science they couldn’t do before?humanists

archaeologistsgeographersmusicologists...researchers!

research

It’s the democratisation of e-Science!

scientists

LocalWeb

Repositories

Digital Libraries

Graduate Students

Undergraduate Students

Virtual Learning Environment

Technical Reports

Reprints

Peer-Reviewed Journal &

Conference Papers

Preprints &

Metadata

Certified Experimental

Results & Analyses

experimentation

Data, Metadata Provenance WorkflowsOntologies

The social process of science

Between 19th October and23rd November 2007

I attended sixinternational meetings

related to e-Science

Grid 2007Scientific and Scholarly Workflows

e-Social Science 2007W3C

Open Grid ForumMicrosoft e-Science

This is what I found

Everyday researchers doing everyday research

Everyday researchers doing everyday research

• Not just a specialist few doing heroic science with heroic infrastructure

• Chemists are blogging the lab• Everyone is mashing up• Everday hardware – multicore

machines and mobile devices

11

A data-centric perspective, like researchers

A data-centric perspective, like researchers

• Data is large, rich, complex and real-time

• There is new value in data, through new digital artefacts and through metadata e.g. context, provenance, workflows

• This isn’t “anti-computation” –design interaction around data

22

Collaborative and participatoryCollaborative and participatory

• The social process of science revisited in the digital age

• Collaborative tools – blogsand Wikis

• e-Science now focuseson publishing as well as consuming

• Scholarly lifecycle perspective

33

Benefitting from the scale of digital science activity to support science

Benefitting from the scale of digital science activity to support science

• This is new and powerful!• Community intelligence• Review• Usage informing

recommendation• e.g. OpenWetWare• e.g. myExperiment

44

Increasingly openIncreasingly open

• Preprints servers and institutional repositories

• Open journals• Open access to data• Science Commons• Object Reuse & Exchange

55

Better not PerfectBetter not Perfect

• The technologies people are using are not perfect

• They are better• They are easy to use• They are chosen by

scientists

66

Empowering researchersEmpowering researchers

• The success stories come from the researchers who have learned to use ICT

• Domain ICT experts are delivering the solutions

• Anything that takes away autonomy will be resisted

77

About pervasive computingAbout pervasive computing

• e-Science is about the intersection of the digital and physical worlds

• Sensor networks• Mobile handheld

devices

88

1. Everyday researchers doing everyday research2. A data-centric perspective, like researchers3. Collaborative and participatory4. Benefitting from the scale of digital science

activity to support science 5. Increasingly open6. Better not Perfect7. Empowering researchers8. About pervasive computing

Signs of the TimesSigns of the Times

• e-Science is now enabling researchers to do some completely new stuff!

• As the individual pieces become easy to use, researchers can bring them together in new ways and ask new questions

• “The next level”

Onward and UpwardOnward and Upward

“Standing on theshoulders of giants”

1. Everyday researchers doing everyday researchBUT heroic Grid infrastructure not being adopted

2. A data-centric perspective, like researchersBUT Grid gives APIs to computation not data

3. Collaborative and participatoryBUT Grid has deeply rooted service provider mindset

6. Better not PerfectBUT Grid aims to provide well-engineered perfect solution

7. Giving autonomy to researchersBUT Grid imposes institutional control (at this time)

8. About pervasive computingBUT Grid is about portals,not the next generation of users

The Grid ProblemThe Grid Problem

e-ScienceTechnologyCreators& Integrators

ApplicationsResearch

EEResearch

Socio-economic&CommercialInnovation

e-Sciencebespoketailoring

MassUse byResearchers

5 years 5 years 5 years

CSResearch

e-Science

10s ofintegrators

100s ofembeddedconsultants

1000s ofresearch

users

The Arrow ProblemThe Arrow Problem

e-Science Pipeline

Malcolm Atkinson

Web Services RESTful APIs cmd lines ssh http

Web Browser Mobile phone iPod Car Equipment PDA

P2P

mashups

workflows

services

applicationsSubjectICT experts Computer

Scientists

Software Companies

Workflowtools

Ruby on Rails

ecosystem

Scientists

open sourceSoftwareEngineers

nesc

• It’s about empowerment as well as provision• People power• Hence usability:

– Simple/familiar interfaces for users– Simple/familiar interfaces for developers– No need for a summer school!

• Step into user space and look back• Computer Scientists as facilitators and

problem solvers(?)

For a flourishing ecosystem...For a flourishing ecosystem...

• Wikis• Mashups• REST APIs• Google Maps• Technologies:

– AJAX, JSON, Ruby on Rails, ...

• Social networking• Web as a distributed application platform

– Amazon S3 and EC2

But what about Web 2.0?!But what about Web 2.0?!

1. Everyday researchers doing everyday research2. A data-centric perspective, like researchers3. Collaborative and participatory4. Benefitting from the scale of digital science

activity to support science 5. Increasingly open6. Better not Perfect7. Empowering researchers8. About pervasive computing

Signs of the TimesSigns of the Times

The Long TailData is the Next “Intel Inside”Users add valueNetwork effects by default

Some Rights ReservedThe Perpetual BetaCooperate, don’t ContolSoftware above the level of the single device

Web 2.0 patternsWeb 2.0 patterns

use Web 2.0 here?

Grid

use Web 2.0

here?

Grid

use Web 2.0 here

Grid

A utility is a directly and immediately useable service with established functionality, performance and dependability, illustrating the emphasis on user needs and issues such as trust

Services are knowledge-assisted (‘semantic’) to facilitate automation and advanced functionality, the knowledge aspect reinforced by the emphasis on delivering high level services to the user

Service-Oriented Knowledge UtilityService-Oriented Knowledge Utility

The architecture comprises services which may be instantiated and assembled dynamically, hence the structure, behaviour and location of software is changing at run-time

• Web 2.0 is not high performance– It improves the performance of science and people!

• Web 2.0 is not a properly engineered solution– Scientists want better, not perfect. And agility.

• Web 2.0 is not secure– People do lots of “secure” things on the Web

• Web 2.0 is a fad that will pass– It’s inevitable and it’s already happened!

• Web 2.0 works for teenagers but it won’t for scientists– Let’s find out...!

MythsMyths

myexperiment.org

Workflows are the new rock and roll

Machinery for coordinating the execution of (scientific) services and linking together (scientific) resources

The era of Service Oriented Applications

Repetitive and mundane boring stuff made easier

E. Science laboris E. Science laboris

Paul writes workflows for identifying biological pathways implicated in resistance to Trypanosomiasis in cattle

Paul meets Jo. Jo is investigating Whipworm in mouse.

Jo reuses one of Paul’s workflow without change.

Jo identifies the biological pathways involved in sex dependence in the mouse model, believed to be involved in the ability of mice to expel the parasite.

Previously a manual two year study by Jo had failed to do this.

Recycling, Reuse, RepurposingRecycling, Reuse, Repurposing

20072006200520042003

40

Taverna downloads per day

Taverna downloads per day

• Independent third party world-wide service providers of applications, tools and data sets. In the Cloud.– 850 databases, 166 web

servers Nucleic Acids Research Jan 2006

• My local applications, tools and datasets. In the Enterprise. In the laboratory.

• Easily incorporate new service without coding. So even more services from the cloud and enterprise.

e-Services in the Cloude-Services in the Cloud

Kepler

Triana

BPEL

Ptolemy II

myExperiment.org is… “Facebook for Scientists” A community social network. A gateway to other publishing

environments A federated repository A platform for launching

workflows Publishing self-describing

Encapsulated myExperiment Objects

Mindful publication Started March 2007 Closed beta since July 2007 Open beta November 2007

myExperiment.org is...myExperiment.org is...

Google GadgetGoogle Gadget

Challenge: Policy and Permissions without TearsOwnership and AttributionOwnership and Attribution

24/5/2007 | myExperiment | Slide 40

`

users

descriptions

groups

friendships

tags

Enactor

blobsworkflows

HTMLXML

Snapshot map of resources with their relationships and versions

scientists

LocalWeb

Repositories

Graduate Students

Undergraduate Students

Virtual Learning Environment

Technical Reports

Reprints

Peer-Reviewed Journal &

Conference Papers

Preprints &

Metadata

Certified Experimental

Results & Analyses

experimentation

Data, Metadata Provenance WorkflowsOntologies

Digital Libraries

The social process of science 2.0

• e-Science is about doing new science• Grid is just one part of the solution• Users are not just consumers of

infrastructure. Empower them.• Web 2.0 is a set of design patterns• Think Web 2.0 on top of Grid and other

services• Workflows make e-Science easier, and

Web 2 makes workflows easier

Take Homes 2.0Take Homes 2.0

Contact

David De Rouredder@ecs.soton.ac.uk

Carole Goblecarole.goble@manchester.ac.uk

Thanks

Geoffrey Fox, Savas Parastatides,myExperiment team & myGrid team

top related