analyze genomes services for precision medicine

14
Analyze Genomes Services for Precision Medicine Dr. Matthieu-P. Schapranow Healthcare Information and Management Systems Society Conference, Las Vegas, NV Mar 2, 2016

Upload: matthieu-schapranow

Post on 09-Jan-2017

963 views

Category:

Technology


1 download

TRANSCRIPT

Analyze Genomes Services for Precision Medicine

Dr. Matthieu-P. Schapranow Healthcare Information and Management Systems Society Conference, Las Vegas, NV

Mar 2, 2016

■  Patients

□  Individual anamnesis, family history, and background

□  Require fast access to individualized therapy

■  Clinicians

□  Identify root and extent of disease using laboratory tests

□  Evaluate therapy alternatives, adapt existing therapy

■  Researchers

□  Conduct laboratory work, e.g. analyze patient samples

□  Create new research findings and come-up with treatment alternatives

The Setting Actors in Oncology

Schapranow, HIMSS, Mar 2, 2016

2

Analyze Genomes Services for Precision Medicine

IT Challenges Distributed Heterogeneous Data Sources

3

Human genome/biological data 600GB per full genome 15PB+ in databases of leading institutes

Prescription data 1.5B records from 10,000 doctors and 10M Patients (100 GB)

Clinical trials Currently more than 30k recruiting on ClinicalTrials.gov

Human proteome 160M data points (2.4GB) per sample >3TB raw proteome data in ProteomicsDB

PubMed database >23M articles

Hospital information systems Often more than 50GB

Medical sensor data Scan of a single organ in 1s creates 10GB of raw data Cancer patient records

>160k records at NCT Analyze Genomes Services for Precision Medicine

Schapranow, HIMSS, Mar 2, 2016

Schapranow, HIMSS, Mar 2, 2016

Our Approach Analyze Genomes: Real-time Analysis of Big Medical Data

4

In-Memory Database

Extensions for Life Sciences

Data Exchange, App Store

Access Control, Data Protection

Fair Use

Statistical Tools

Real-time Analysis

App-spanning User Profiles

Combined and Linked Data

Genome Data

Cellular Pathways

Genome Metadata

Research Publications

Pipeline and Analysis Models

Drugs and Interactions

Analyze Genomes Services for Precision Medicine

Drug Response Analysis

Pathway Topology Analysis

Medical Knowledge Cockpit Oncolyzer

Clinical Trial Recruitment

Cohort Analysis

...

Indexed Sources

Combined column and row store

Map/Reduce Single and multi-tenancy

Lightweight compression

Insert only for time travel

Real-time replication

Working on integers

SQL interface on columns and rows

Active/passive data store

Minimal projections

Group key Reduction of software layers

Dynamic multi-threading

Bulk load of data

Object-relational mapping

Text retrieval and extraction engine

No aggregate tables

Data partitioning Any attribute as index

No disk

On-the-fly extensibility

Analytics on historical data

Multi-core/ parallelization

Our Technology In-Memory Database Technology

+

+++

+

P

v

+++t

SQL

xx

T

disk

5

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

Our Software Architecture A Federated In-Memory Database System

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

6

Federated In-M em ory D atabase System

M aster D ata andS hared A lgorithm s

S ite A S ite BC loud Provider

C loud IM D BInstance

Local IM D BInstance

S ensitive D ata,e.g . P atient D ata

R

Local IM D BInstance

Sensitive D ata,e .g. P atien t D ata

R

Use Case: Precision Medicine in Oncology Identification of Best Treatment Option for Cancer Patient

■  Patient: 48 years, female, non-smoker, smoke-free environment

■  Diagnosis: Non-Small Cell Lung Cancer (NSCLC), stage IV

■  Markers: KRAS, EGFR, BRAF, NRAS, (ERBB2)

1.  Surgery to remove tumor

2.  Tumor sample is sent to laboratory to extract DNA

3.  DNA is sequenced resulting in 750 GB of raw data per sample

4.  Processing of raw data to perform analysis

5.  Identification of relevant driver mutations using international medical knowledge

6.  Informed decision making Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

7

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

8

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

9

App Example I: Integrating Processing and Real-time Analysis of Genome Data in the Clinical Routine

■  Control center for processing of raw DNA data, such as FASTQ, SAM, and VCF

■  Personal user profile guarantees privacy of uploaded and processed data

■  Supports reproducible research process by storing all relevant process parameters

■  Implements prioritized data processing and fair use, e.g. per department or per institute

■  Supports additional service, such as data annotations, billing, and sharing for all Analyze Genomes services

■  Honored by the 2014 European Life Science Award

Analyze Genomes Services for Precision Medicine

Standardized Modeling and runtime environment for

analysis pipelines

10

Schapranow, HIMSS, Mar 2, 2016

■  Query-oriented search interface

■  Seamless integration of patient specifics, e.g. from EMR

■  Parallel search in international knowledge bases, e.g. for biomarkers, literature, cellular pathway, and clinical trials

App Example II: Medical Knowledge Cockpit for Patients and Clinicians

Analyze Genomes Services for Precision Medicine

11

Schapranow, HIMSS, Mar 2, 2016

Real-time Data Analysis and Interactive Exploration

App Example III: Identifying Best Chemotherapy using Drug Response Analysis

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

Smoking status, tumor classification

and age (1MB - 100MB)

Raw DNA data and genetic variants

(100MB - 1TB)

Medication efficiency and wet lab results

(10MB - 1GB)

12

Patient-specific Data

Tumor-specific Data

Compound Interaction Data

■  For patients

□  Identify relevant clinical trials and medical experts

□  Become an informed patient

■  For clinicians

□  Identify pharmacokinetic correlations

□  Scan for similar patient cases, e.g. to evaluate therapy efficiency

■  For researchers

□  Enable real-time analysis of medical data, e.g. assess pathways to identify impact of detected variants

□  Combined mining in structured and unstructured data, e.g. publications, diagnosis, and EMR data

What to Take Home? Test it Yourself: AnalyzeGenomes.com

Schapranow, HIMSS, Mar 2, 2016

13

Analyze Genomes Services for Precision Medicine

Keep in contact with us!

Hasso Plattner Institute Enterprise Platform & Integration Concepts (EPIC)

Program Manager E-Health Dr. Matthieu-P. Schapranow

August-Bebel-Str. 88 14482 Potsdam, Germany

Dr. Matthieu-P. Schapranow [email protected] http://we.analyzegenomes.com/

Schapranow, HIMSS, Mar 2, 2016

Analyze Genomes Services for Precision Medicine

14