(hls303) how cloud computing is redefining research: secure, collaborative science at scale | aws...

37
November 13, 2014 | Las Vegas, NV Angel Pizarro, Amazon Web Services | Omar Serang, DNAnexus | Jeffrey Reid, Regeneron Pharmaceuticals

Upload: amazon-web-services

Post on 30-Jun-2015

349 views

Category:

Technology


1 download

DESCRIPTION

Genome sequencing technologies have lowered costs and increased data output at a much faster rate than traditional hardware refresh cycles can service effectively. With lowered costs, more biomedical research and clinical operations are integrating sequencing into a larger portion of their projects, compounding an already tough problem for IT operations. When Regeneron committed to using Next Generation Sequencing technology for biopharmaceutical research, the decision was made to keep the server room empty and deliver all genomic analysis and storage services using AWS and DNAnexus. We review the consequences of this decision, illustrating how basic AWS core services are game-changers in the design and operation of a scientific infrastructure at scale. We show how Amazon S3 changes the storage performance equation, discuss the impact of new instance types on the cost of genomic analysis, and explain how secure collaboration enables Regeneron to advance biopharmaceutical research techniques and support the future of genomic medicine.

TRANSCRIPT

Page 1: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

November 13, 2014 | Las Vegas, NV

Angel Pizarro, Amazon Web Services | Omar Serang, DNAnexus | Jeffrey Reid,

Regeneron Pharmaceuticals

Page 2: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 3: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 4: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Facilities

Physical security

Compute infrastructure

Storage infrastructure

Network infrastructure

Virtualization layer (EC2)

Hardened service endpoints

Rich IAM capabilities

Network configuration

Security groups

OS firewalls

Operating systems

Applications

Proper service configuration

Auth & acct management

Authorization policies

+ =

Customer/Partner

• Re-focus your security professionals on a subset of the problem

• Take advantage of high levels of uniformity and automation

First global public cloud provider to achieve certification for security & quality management system

Page 5: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

EC2

10.0.2.12

AWS region – VPC network isolation

AZ A AZ B

VPC 10.0.0.0/16

SN 10.0.1.0/24 (DMZ) SN 10.0.2.0/24 (Private)

(23.20.103.11)

Internet

EC2

10.0.1.11

Internet GW Service

Virtual Gateway

Page 6: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

AWS system

entitlements

Roles

Root Account

Multi-factor

authentication

Groups

Administrators Developers Applications

Alyson

Anand

Tomcat

Jim Shandra

Xiao

Susan

Reporting

Console

Page 7: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

You get to choose who

can do what in your AWS

environment and from

where

Manage and operate

US EAST

A

VPC

Internet GW Service

Virtual GatewayB

SM

(EIP)

(EIP)

AWS account owner (master)

Network & security

Researcher Operations EMR

Page 8: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Amazon EBS

Amazon S3

• HTTPS

• Server-side encryption of each object

• AWS-managed keys

• Customer-managed keys

• End-to-end secure network traffic

• Whole volume encryption

• AWS-managed keys

• Encrypted incremental snapshots

• Minimal performance overhead (utilizes Intel AES-NI)

Page 9: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

• Records API calls, no matter how those API calls were

made (console, SDK, CLI)

• Who did what and when and from what IP address

• Logs saved to Amazon S3

• Includes Amazon EC2, Amazon EBS, Amazon VPC,

Amazon RDS, IAM, AWS STS, and Amazon Redshift

• Be notified of log file delivery by using the Amazon

Simple Notification Service (Amazon SNS)

• Aggregate log information across services into a

single S3 bucket

• Out of the box integration with log analysis tools

from AWS partners including Splunk, AlertLogic, and

SumoLogic

Page 10: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Virtual Private

Cloud

AWS Cloud

Amazon

S3Amazon

EBS

IAM

CloudTrail

Page 11: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 12: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

DNAnexus is a platform-as-a-service for genome

informatics. In production now with over 20% of the high

throughput sequencing installations globally.

Enables collaboration and datasharing within a secure

and compliant environment that helps attract new users.

Creates a “Network Effect” for genomics.

Platform launched in 2013. Located in Silicon Valley.

* Source: JP Morgan

Page 13: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Baylor College of Medicine

x14,691

INDIVIDUAL

GENOMES

x430TBANALYTIC

DATA

x3.3MCORE-HOURS

x20,800

CORES

x300+RESEARCHERS

1 customer. 1 project. 1 week.

Page 14: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

x

GEISINGER

x

REGENERON

$30B pharma company

focused on cholesterol,

oncology, and rheumatoid

arthritis

$3.5B health care

provider for

3M patients

Page 15: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 16: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 17: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 18: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 19: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 20: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 21: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 22: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Amazon S3: the center of the universe

Page 23: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Amazon EC2: job manager and payloads

Page 24: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Amazon EC2: application services

Page 25: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Leveraging the AWS detailed billing report

Page 26: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

VPC + Dx: connecting partners

Page 27: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

DNAnexus: connecting science

Page 28: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 29: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 30: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 31: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Samples (extracted DNA)

1.4M Sample BioBank

Library Prep Automation

Illumina HiSeq Fleet

Data

Page 32: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

Data QC

Page 33: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

40

140

Page 34: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 35: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

30

50

9/28/1998 9/28/1999 9/28/2000 9/28/2001 9/28/2002 9/28/2003 9/28/2004 9/28/2005 9/28/2006 9/28/2007 9/28/2008 9/28/2009 9/28/2010 9/28/2011 9/28/2012 9/28/2013

40

140

9/28/1998 9/28/1999 9/28/2000 9/28/2001 9/28/2002 9/28/2003 9/28/2004 9/28/2005 9/28/2006 9/28/2007 9/28/2008 9/28/2009 9/28/2010 9/28/2011 9/28/2012 9/28/2013

Page 36: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014
Page 37: (HLS303) How Cloud Computing is Redefining Research: Secure, Collaborative Science at Scale | AWS re:Invent 2014

http://bit.ly/awsevals