canada's irida platform for genomic epidemiology...canada's irida platform for genomic...

29
Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada

Upload: others

Post on 13-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Canada's IRIDA platform for genomic

epidemiology Gary Van Domselaar Chief, Bioinformatics

National Microbiology Lab

Public Health Agency of Canada

Page 2: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Integrated Rapid Infectious Disease Analysis informatics platform to support real-time infectious disease outbreak investigations

Open source, standards compliant, resource for public health agencies and researchers that complements other initiatives

Page 3: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Platform Overview

IRIDA

Servlet Co

ntain

er

REST API Central File

Storage

Web Interface

Ap

plicatio

n Lo

gic

Compute Cluster

Galaxy

$ ~ >_ Galaxy

Page 4: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Getting data into IRIDA

• Manual web interface upload

• Automated instrument upload (Illumina MiSeq)

Page 5: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Data Management

Page 6: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

User Access Control System User Management

Project User Management

Page 7: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Getting data out of IRIDA

• Sharing project data

• Downloading

• Export to external Galaxy instance

• Exporting to the command-line

Page 8: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

NCBI SRA Upload

Page 9: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

NCBI SRA Upload

Accession

Page 10: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Data Sharing

Project X

Project Y

Project Z

Page 11: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Data Analysis

Galaxy

Assembly Tools

Variant Calling Tools

API

Worker Worker

IRIDA

Kmer=99 Min=500

Assembly

Page 12: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Sample Selection

Galaxy

Assembly Tools

Variant Calling Tools

API

Worker Worker

IRIDA

Kmer=99 Min=500

Assembly

Page 13: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Pipeline Selection

Galaxy

Assembly Tools

Variant Calling Tools

API

Worker Worker

IRIDA

Kmer=99 Min=500

Assembly

Page 14: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Analysis Execution

Galaxy

Assembly Tools

Variant Calling Tools

API

Worker Worker

IRIDA

Kmer=99 Min=500

Assembly

Page 15: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Variant Consolidation

HGT & Recombination

Filtering

Repeat region filtering

Meta-alignment generation

SNV Matrix

Whole Genome Phylogeny

Isolate Sequencing

Reads Variant Calling

Isolate Sequencing

Reads Variant Calling

The IRIDA SNVPhyl Pipeline

User

selects isolates

Phylogeny Viewer

selects reference

Reference Genome

Page 16: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Analysis Results

Page 17: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Automated Assemblies

Page 18: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Analysis Provenance

Page 19: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Auditing • Every creation,

modification, or deletion of data is audited.

• Data can be restored on accidental deletion or modification.

• Trace back data to justify decisions.

Page 20: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Sequencing Quality Control

Page 21: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Analytical Tool

Quality Control Module

Quality Metrics

Quality Control

Analysis QA/QC Model

Page 22: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Types of (Meta)Data Standardized Within IRIDA

Lab Analytics Genomics, PFGE

Serotyping, Phage typing MLST, AMR

Sample Metadata Isolation Source (Food, Host

Body Product, Environmental), BioSample

Epidemiology Investigation Exposures

Clinical Data Patient demographics, Medical

History, Comorbidities, Symptoms, Health Status

Reporting Case/Investigation Status

“Not just what data IS collected, but what SHOULD be collected”

Page 23: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Tools: GenGIS

Page 24: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Coming Soon: CARD – Comprehensive Antibiotic Resistance Database

Page 25: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Coming Soon: The Salmonella In Silico Typing Resource (SISTR)

SISTR: THE SALMONELLA IN SILICO TYPING RESOURCE │

https://lfz.corefacility.ca/sistr-app/ 25

In silico analysis of WGS data assembly statistics serovar prediction in silico typing (MLST, cgMLST) AMR prediction

Comparative genomic analyses cgMLST accessory gene content core SNPs

Epidemiologic analysis geospatial distribution temporal distribution source association

Page 26: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Coming Soon: IslandViewer and IslandCompare

Page 27: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

Outbreak investigation Routine surveillance

PulseNet Canada Deployment

Page 28: Canada's IRIDA platform for genomic epidemiology...Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health

http://irida.ca