deployment of bioxsd- enabled services on a cloud · [email protected] deployment of...

28
[email protected] Deployment of BioXSD- enabled services on a Cloud

Upload: trinhhuong

Post on 16-Sep-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

[email protected]

Deployment of BioXSD-enabled services on a Cloud

Page 2: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Outline

• IBCP, provider of BioXSD-enabled services• Cloud Computing• RENABI GRISBI, French infrastructure

Page 3: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Bioinformatics Integrated Tools

gbio-pbil.ibcp.fr/ws

Page 4: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

GBIO WS resources

VirtualizationVMware ESX/Xen

ComputingNodes

PhysicalMachines

VirtualMachines

CentralStorage NFS / S3

NFS / S3

Web ServicesHOST

Jobs

Biologist 'sMachine HTTP

SOAP

Page 5: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Web Services clients

Page 6: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Page 7: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

• Protein sequence analysis at large scale with generic services,

• User: get the most information possible regarding their protein(s)

• Bioinformatics tools• use case with common

software BLAST, ClustalW, GOR4, CATH-Gene3D HMMscan, ...

• Biological data• Analyzing large sets of proteins

obtained for example from Next Generation Sequencing

• Using international databases, i.e. UniProt

Pro

tein

s

Shared

Storage

Results

gathering

ToolTool

ToolTool

ToolTool

ToolTool

10sto

1,000s

User's

sequences

splitting

Master

Protein Similarity

Knowledge Bases- MB to 100s GB

- Swiss-Prot, TrEMBL

Read

only

Tools- BLAST, FastA, SSearch

EMBRACE Usecase «Usage of Generic WS»

https://bioinformatics.bmc.uu.se/WP4/content/view/125/50/

Page 8: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

BioXSD Services

Page 9: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Benefits of Controlled Vocabulary,Ontology and Schema

• verify the quality of the input/ouput data

• expected data type : integer, string, ...

• expected value: size, allowed range, ...

• identify abnormal data

• verify the format of the input data

• return detailled error message: what’s wrong in the format

• convert the data in a good format

• evaluate complexity/cost for the data

• input data and its transfer/computation cost (time and €)

• ouput data and its storage/transfer cost (time and €)

• workflow• group different process according

to the data

• make the Web services «user-friendly»

• Web services selection• BioCatalogue, Seekda

• Web services composition• Taverna, Triana, ...

• switch to other provider in case of failure

• Web services customization• to let users map/connect to

their own ontology

• rich plugins to input data• specific forms, with data

converters, an adequat help/doc about the data to input, on different hardware/support

• data conversion in workflow

• rich plugins to display data

Providers Users

Page 10: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Outline

• IBCP, provider of BioXSD-enabled services• Cloud Computing• RENABI GRISBI, French infrastructure

Page 11: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Cloud ?

Page 12: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Cloud: principles

VirtualInfrastructureBiologist 'sMachine

Web

CLOUDPortal andServices

HTTPWebServices

Web Serv

ices

CloudNode

PhysicalInfrastructure

CloudNode

CloudNode

CloudNode

CLOUDSite

CloudComputing

Page 13: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Page 14: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Cloud: middlewares

• Open source• hipernet (HIPCAL project),

• eucalyptus (Amazon EC2/S3 like),

• open nebula (OGF, StratusLab EU FP7)

• others: nimbus, cloudStack, ...

• Commercial• Amazon EC2,

• Google Apps,

• IBM,

• Microsoft,

• Yahoo, ...

Page 15: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Cloud Comparison

Page 16: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Cloud Console

Page 17: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Deploying on Cloud

ToolsShared

Storage

Results

gathering

ToolTool

ToolTool

ToolTool

ToolTool

10sto

1,000s

User's

sequences

splitting

Master

Protein Similarity

Knowledge Bases- MB to 100s GB

- Swiss-Prot, TrEMBL

Read

only

Tools- BLAST, FastA, SSearch

Virtual

Infrastructure

Biologist 'sMachine SSH

WS

...

SSH

WS

...

SSH

WS

...IBCP’s cloud

5 servers40 cores, 160GB

Page 18: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Connect to Cloud

Choose VMs

Deploy VMs

Submit SOAP

Process SOAP

Results

Compute job

Ye

t d

on

e b

y I

BC

P

Bioinformatics Services on a CLOUD

VirtualizationVMware ESX/Xen

ComputingNodes

PhysicalMachines

VirtualMachines

CentralStorage NFS / S3

NFS / S3

BioinformaticsVirtual Apps

3.Jobs

Biologist 'sMachine 1. & 5.

HTTPSOAP Cloud

Console

Mgmt VMs

2.Register

BiologicalData

4. Get

Results

CloudStorage

(S3)

Page 19: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

PerspectivesConnect to

Cloud

Choose VMs

Deploy VMs

Submit SOAP

Process SOAP

Results

Compute job

VirtualAppliances

Deployment

VirtualizationVMware ESX/Xen

ComputingNodes

PhysicalMachines

VirtualMachines

CentralStorage NFS / S3

NFS / S3

Jobs

Biologist 'sMachine

2.HTTPSOAP

CloudConsole

Mgmt VMs

0.HTTP/S

SSH

1.Register

BiologicalData

3. Get

Results

CloudStorage

(S3)

CloudConsole

(EC2, ONE, hipcal)

BioinformaticsVirtual Apps

Page 20: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

What next ?

• Platform-as-a-Service• Provide scientists with pre-defined VM to deploy

• on the Research Infrastructure, e.g. RENABI GRISBI

• on their own computer/cloud

• Bioinformatics centers switch to virtual appliances provider

• Infrastructure-as-a-Service• User can connect to the community cloud to

deploy required VM

• Deploy the required infrastructure according to the workflow:

• VXDL/VPXI developed by HIPCAL project

• Haizea with the OpenNebula.org

Page 21: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Outline

• IBCP, provider of BioXSD-enabled services• Cloud Computing• RENABI GRISBI, French infrastructure

Page 22: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

GRISBI - Grid, Support to Bioinformatics, www.grisbio.fr

GRISBI- Grid Support to Bioinformatics -

• National Research Infrastructure

• RENABI, IBISA 2008-2010, Institut des Grilles 2009-2010

• 6 centers from RENABI

• PRABI, MIGALE, GenOuest, CBIB Bordeaux, BIPS, CIB

• 8 sites, with 7 CNRS institutes IBCP Lyon, SBR Roscoff, CBiB Bordeaux, CIB Lille, IRISA Rennes, LBBE Lyon, MIGALE Jouy-en-Josas, BIPS Strasbourg

• 40 participants

• Computig resources

• 1200 cores, 220 TB storage

Make possible challenging bioinformatics applications dealing with large scale biological systems

ii

GRSB

6 centers1000 cores - RAM 2TB

Storage 150 TB

© RENABI GRISBI 2009 - www.grisbio.fr

CIB

BIPS

GenOuestMigale

PRABI

CBiB

Page 23: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

GRISBI - Grid, Support to Bioinformatics, www.grisbio.fr

GRISBI site

GRIDComputing

Nodes

PhysicalMachines

VirtualizationVMware ESX

UI

SECE

VirtualMachines

Page 24: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

GRISBI - Grid, Support to Bioinformatics, www.grisbio.fr

GRISBI Infrastructure

ii

GRSB

CIB

BIPS

GenOuestMigale

PRABI

CBiB

412 c75 TB

48 c30 TB

128 c1.5 TB

216 c62 TB

376 c50 TB

32 c5 TB

1212 c224 TB

150 c10 TB

06/201015c,1TB

136 c3.2 GB

426 c17 TB

120 c2 TB

09/2010

06/2010192+2

08/2010

20 c5 TB

+ Core services (IDG)WMS (GRIF),

LFC&VOMS (CC)

Page 25: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

GRISBI - Grid, Support to Bioinformatics, www.grisbio.fr

GRISBI usecase

Will be used tomorrow

as hands-on material

by LABRI people

(RENABI GRISBI partner)

Page 26: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Conclusions

• Standard Integration of bioinformatics tools• Freely available to community

• Annotated tools with BioXSD/EDAM

• Cloud perspective• Use standard infrastructures

• Managed infrastructure with Web interface

• Platform- and Infrastructure-as-a-Service

• Perspectives• Research Infrastructures based on public/private

cloud

• Provide the community with bioinformatics Virtual Appliances

Page 27: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Acknowledgment

CNRS - Centre National de la Recherche Scientifique

University of Lyon 1

The ANR throught the HIPCAL project

GIS IBISA through the project GRISBI PF 2008

The European Commission through the EU FP7 EGEE III project, contract number INFSO-RI-222667.

CNRS IBCP: E. Bettler, C. Combet, G. Deléage, C. Eloto, C. Gauthey, A. Joseph, A. Michon, J Pessey, F. Penin.HIPCAL: Pascale Vicat-Blanc and partnersEMBRACE partners GRISBI partners CNRS IBCP

Institute of Biology and Chemistry of Proteins7 passage du Vercors, 69007 LYON, FRANCE

Page 28: Deployment of BioXSD- enabled services on a Cloud · christophe.blanchet@ibcp.fr Deployment of BioXSD-enabled services on a Cloud. Workshop for Web Service Providers in Bioinformatics

Workshop for Web Service Providers in Bioinformatics2 June 2010, CBS, Lyngby

[email protected]

Thanks

• Questions ?