the new e-science (bangalore edition)

44
Bangalore Edition

Upload: david-de-roure

Post on 26-Jan-2015

109 views

Category:

Technology


0 download

DESCRIPTION

Keynote talk at IEEE e-Science Conference, Bangalore, December 2007 (the original Powerpoint 2007 version is available on www.semanticgrid.org).

TRANSCRIPT

Page 1: The New e-Science (Bangalore Edition)

Bangalore Edition

Page 2: The New e-Science (Bangalore Edition)

Due to the complexity of the software and the backend infrastructural requirements, e-Science projects usually involve large teams managed and developed by research laboratories, large universities or governments.

e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.

Page 3: The New e-Science (Bangalore Edition)

How do we know when e-Science has succeeded?

Not just accelerated but new

A. When everyone is using the Grid

B. When there are routine scientific advances that would not have happened otherwise

Page 4: The New e-Science (Bangalore Edition)

How do we move from heroic scientists doing heroic science with heroic infrastructure to everyday scientists doing science they couldn’t do before?humanists

archaeologistsgeographersmusicologists...researchers!

research

It’s the democratisation of e-Science!

Page 5: The New e-Science (Bangalore Edition)

scientists

LocalWeb

Repositories

Digital Libraries

Graduate Students

Undergraduate Students

Virtual Learning Environment

Technical Reports

Reprints

Peer-Reviewed Journal &

Conference Papers

Preprints &

Metadata

Certified Experimental

Results & Analyses

experimentation

Data, Metadata Provenance WorkflowsOntologies

The social process of science

Page 6: The New e-Science (Bangalore Edition)

Between 19th October and23rd November 2007

I attended sixinternational meetings

related to e-Science

Grid 2007Scientific and Scholarly Workflows

e-Social Science 2007W3C

Open Grid ForumMicrosoft e-Science

This is what I found

Page 7: The New e-Science (Bangalore Edition)

Everyday researchers doing everyday research

Everyday researchers doing everyday research

• Not just a specialist few doing heroic science with heroic infrastructure

• Chemists are blogging the lab• Everyone is mashing up• Everday hardware – multicore

machines and mobile devices

11

Page 8: The New e-Science (Bangalore Edition)

A data-centric perspective, like researchers

A data-centric perspective, like researchers

• Data is large, rich, complex and real-time

• There is new value in data, through new digital artefacts and through metadata e.g. context, provenance, workflows

• This isn’t “anti-computation” –design interaction around data

22

Page 9: The New e-Science (Bangalore Edition)

Collaborative and participatoryCollaborative and participatory

• The social process of science revisited in the digital age

• Collaborative tools – blogsand Wikis

• e-Science now focuseson publishing as well as consuming

• Scholarly lifecycle perspective

33

Page 10: The New e-Science (Bangalore Edition)

Benefitting from the scale of digital science activity to support science

Benefitting from the scale of digital science activity to support science

• This is new and powerful!• Community intelligence• Review• Usage informing

recommendation• e.g. OpenWetWare• e.g. myExperiment

44

Page 11: The New e-Science (Bangalore Edition)

Increasingly openIncreasingly open

• Preprints servers and institutional repositories

• Open journals• Open access to data• Science Commons• Object Reuse & Exchange

55

Page 12: The New e-Science (Bangalore Edition)

Better not PerfectBetter not Perfect

• The technologies people are using are not perfect

• They are better• They are easy to use• They are chosen by

scientists

66

Page 13: The New e-Science (Bangalore Edition)

Empowering researchersEmpowering researchers

• The success stories come from the researchers who have learned to use ICT

• Domain ICT experts are delivering the solutions

• Anything that takes away autonomy will be resisted

77

Page 14: The New e-Science (Bangalore Edition)

About pervasive computingAbout pervasive computing

• e-Science is about the intersection of the digital and physical worlds

• Sensor networks• Mobile handheld

devices

88

Page 15: The New e-Science (Bangalore Edition)

1. Everyday researchers doing everyday research2. A data-centric perspective, like researchers3. Collaborative and participatory4. Benefitting from the scale of digital science

activity to support science 5. Increasingly open6. Better not Perfect7. Empowering researchers8. About pervasive computing

Signs of the TimesSigns of the Times

Page 16: The New e-Science (Bangalore Edition)

• e-Science is now enabling researchers to do some completely new stuff!

• As the individual pieces become easy to use, researchers can bring them together in new ways and ask new questions

• “The next level”

Onward and UpwardOnward and Upward

“Standing on theshoulders of giants”

Page 17: The New e-Science (Bangalore Edition)
Page 18: The New e-Science (Bangalore Edition)

1. Everyday researchers doing everyday researchBUT heroic Grid infrastructure not being adopted

2. A data-centric perspective, like researchersBUT Grid gives APIs to computation not data

3. Collaborative and participatoryBUT Grid has deeply rooted service provider mindset

6. Better not PerfectBUT Grid aims to provide well-engineered perfect solution

7. Giving autonomy to researchersBUT Grid imposes institutional control (at this time)

8. About pervasive computingBUT Grid is about portals,not the next generation of users

The Grid ProblemThe Grid Problem

Page 19: The New e-Science (Bangalore Edition)

e-ScienceTechnologyCreators& Integrators

ApplicationsResearch

EEResearch

Socio-economic&CommercialInnovation

e-Sciencebespoketailoring

MassUse byResearchers

5 years 5 years 5 years

CSResearch

e-Science

10s ofintegrators

100s ofembeddedconsultants

1000s ofresearch

users

The Arrow ProblemThe Arrow Problem

e-Science Pipeline

Malcolm Atkinson

Page 20: The New e-Science (Bangalore Edition)

Web Services RESTful APIs cmd lines ssh http

Web Browser Mobile phone iPod Car Equipment PDA

P2P

mashups

workflows

services

applicationsSubjectICT experts Computer

Scientists

Software Companies

Workflowtools

Ruby on Rails

ecosystem

Scientists

open sourceSoftwareEngineers

nesc

Page 21: The New e-Science (Bangalore Edition)

• It’s about empowerment as well as provision• People power• Hence usability:

– Simple/familiar interfaces for users– Simple/familiar interfaces for developers– No need for a summer school!

• Step into user space and look back• Computer Scientists as facilitators and

problem solvers(?)

For a flourishing ecosystem...For a flourishing ecosystem...

Page 22: The New e-Science (Bangalore Edition)

• Wikis• Mashups• REST APIs• Google Maps• Technologies:

– AJAX, JSON, Ruby on Rails, ...

• Social networking• Web as a distributed application platform

– Amazon S3 and EC2

But what about Web 2.0?!But what about Web 2.0?!

Page 23: The New e-Science (Bangalore Edition)

1. Everyday researchers doing everyday research2. A data-centric perspective, like researchers3. Collaborative and participatory4. Benefitting from the scale of digital science

activity to support science 5. Increasingly open6. Better not Perfect7. Empowering researchers8. About pervasive computing

Signs of the TimesSigns of the Times

The Long TailData is the Next “Intel Inside”Users add valueNetwork effects by default

Some Rights ReservedThe Perpetual BetaCooperate, don’t ContolSoftware above the level of the single device

Web 2.0 patternsWeb 2.0 patterns

Page 24: The New e-Science (Bangalore Edition)

use Web 2.0 here?

Grid

Page 25: The New e-Science (Bangalore Edition)

use Web 2.0

here?

Grid

Page 26: The New e-Science (Bangalore Edition)

use Web 2.0 here

Grid

Page 27: The New e-Science (Bangalore Edition)

A utility is a directly and immediately useable service with established functionality, performance and dependability, illustrating the emphasis on user needs and issues such as trust

Services are knowledge-assisted (‘semantic’) to facilitate automation and advanced functionality, the knowledge aspect reinforced by the emphasis on delivering high level services to the user

Service-Oriented Knowledge UtilityService-Oriented Knowledge Utility

The architecture comprises services which may be instantiated and assembled dynamically, hence the structure, behaviour and location of software is changing at run-time

Page 28: The New e-Science (Bangalore Edition)

• Web 2.0 is not high performance– It improves the performance of science and people!

• Web 2.0 is not a properly engineered solution– Scientists want better, not perfect. And agility.

• Web 2.0 is not secure– People do lots of “secure” things on the Web

• Web 2.0 is a fad that will pass– It’s inevitable and it’s already happened!

• Web 2.0 works for teenagers but it won’t for scientists– Let’s find out...!

MythsMyths

Page 29: The New e-Science (Bangalore Edition)

myexperiment.org

Page 30: The New e-Science (Bangalore Edition)

Workflows are the new rock and roll

Machinery for coordinating the execution of (scientific) services and linking together (scientific) resources

The era of Service Oriented Applications

Repetitive and mundane boring stuff made easier

E. Science laboris E. Science laboris

Page 31: The New e-Science (Bangalore Edition)

Paul writes workflows for identifying biological pathways implicated in resistance to Trypanosomiasis in cattle

Paul meets Jo. Jo is investigating Whipworm in mouse.

Jo reuses one of Paul’s workflow without change.

Jo identifies the biological pathways involved in sex dependence in the mouse model, believed to be involved in the ability of mice to expel the parasite.

Previously a manual two year study by Jo had failed to do this.

Recycling, Reuse, RepurposingRecycling, Reuse, Repurposing

Page 32: The New e-Science (Bangalore Edition)

20072006200520042003

40

Taverna downloads per day

Taverna downloads per day

Page 33: The New e-Science (Bangalore Edition)

• Independent third party world-wide service providers of applications, tools and data sets. In the Cloud.– 850 databases, 166 web

servers Nucleic Acids Research Jan 2006

• My local applications, tools and datasets. In the Enterprise. In the laboratory.

• Easily incorporate new service without coding. So even more services from the cloud and enterprise.

e-Services in the Cloude-Services in the Cloud

Page 34: The New e-Science (Bangalore Edition)

Kepler

Triana

BPEL

Ptolemy II

Page 35: The New e-Science (Bangalore Edition)

myExperiment.org is… “Facebook for Scientists” A community social network. A gateway to other publishing

environments A federated repository A platform for launching

workflows Publishing self-describing

Encapsulated myExperiment Objects

Mindful publication Started March 2007 Closed beta since July 2007 Open beta November 2007

myExperiment.org is...myExperiment.org is...

Page 36: The New e-Science (Bangalore Edition)
Page 37: The New e-Science (Bangalore Edition)
Page 38: The New e-Science (Bangalore Edition)

Google GadgetGoogle Gadget

Page 39: The New e-Science (Bangalore Edition)

Challenge: Policy and Permissions without TearsOwnership and AttributionOwnership and Attribution

Page 40: The New e-Science (Bangalore Edition)

24/5/2007 | myExperiment | Slide 40

Page 41: The New e-Science (Bangalore Edition)

`

users

descriptions

groups

friendships

tags

Enactor

blobsworkflows

HTMLXML

Snapshot map of resources with their relationships and versions

Page 42: The New e-Science (Bangalore Edition)

scientists

LocalWeb

Repositories

Graduate Students

Undergraduate Students

Virtual Learning Environment

Technical Reports

Reprints

Peer-Reviewed Journal &

Conference Papers

Preprints &

Metadata

Certified Experimental

Results & Analyses

experimentation

Data, Metadata Provenance WorkflowsOntologies

Digital Libraries

The social process of science 2.0

Page 43: The New e-Science (Bangalore Edition)

• e-Science is about doing new science• Grid is just one part of the solution• Users are not just consumers of

infrastructure. Empower them.• Web 2.0 is a set of design patterns• Think Web 2.0 on top of Grid and other

services• Workflows make e-Science easier, and

Web 2 makes workflows easier

Take Homes 2.0Take Homes 2.0

Page 44: The New e-Science (Bangalore Edition)

Contact

David De [email protected]

Carole [email protected]

Thanks

Geoffrey Fox, Savas Parastatides,myExperiment team & myGrid team