proficiency tests in genomics – for gmi · 2018. 9. 28. · • gmi-134, gmi-148, gmi-150,...
TRANSCRIPT
-
9/28/2018
Prof. Rene S. Hendriksen, PhDResearch Group of Genomics EpidemiologyNational Food Institute, Technical University of Denmarkwww.compare-europe.euwww.genomicepidemiology.org
Proficiency Tests in genomics –for GMI
28th Sep 2018, DTU, Denmark
-
Objectives of GMI PT
• To facilitate the production of reliable laboratory results of consistently good quality within the area of whole genome sequencing (WGS) by
– Selecting two strains of three species of public health importance– Selecting species that range in sequencing difficulties– Assessing the sequencing quality based on a set of quality markers
e.g. N50, no of contigs etc. but also the ability to identify epidemiological markers such as MLST and resistance genes
– Identify participants underperforming
• To facilitate harmonization and standardization of whole genome sequencing and data analysis setting tentative arbitrary quality control thresholds
-
Structure of GMI PT, wet-lab
Component 1aMaterial provided: Bacterial cultures (lyophilized)• DNA extraction, purification• Library-preparation, and whole-genome-sequencing of six bacterial
cultures
Component 1bMaterial provided: Purified DNA (pre-prepared, dried)• Library-preparation, and whole-genome-sequencing of the same six
bacterial cultures
Results• Submission reads (via a portal or ftp site) • Survey response
– Method details – MLST (optional) – Resistance genes (optional)
http://www.google.dk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwjR-p-YttjTAhWDB5oKHUWeCJ0QjRwIBw&url=http://clipartall.com/clipart/5801-building-blocks-clipart.html&psig=AFQjCNGf0MiWKZckyv96wOvGSxVU6-J-lw&ust=1494062302703621
-
Development of GMI PT
2014 – pilot PT20152017
Salmonella (2)E. coli (2)S. aureus (2)
2016 K. pneumonia (2)L. monocytogenes (2)C. coli (1) C. jejuni (1)
http://www.google.dk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&ved=0ahUKEwihmNysttjTAhWGA5oKHWqfCckQjRwIBw&url=http://www.clipartkid.com/germs-bacteria-cliparts/&psig=AFQjCNG0yNKxnmAoIUgjeCYzTxpYWD-lVQ&ust=1494062408405202
-
DTU Food, Technical University of Denmark
Participation in the 2016/2017 GMI PT
• 46 laboratories in 22 countries had provided data for at least one of the PT components
– Australia (3), Austria, Belgium (2), Canada (2), Denmark (3), Finland, France, Germany (3), Hong Kong, Italy (7), Latvia, Luxembourg, Mexico, the Netherlands (3), Poland, Portugal, Singapore (2), Sweden (2), Switzerland, Taiwan, the United Kingdom (2), the United States (6)
• 66 laboratories in 27 countries had provided data for at least one of the PT components
– Australia (5), Austria, Belgium (3), Canada (2), China, Denmark (3), Finland, France (2), Germany (3), India, Ireland (3), Italy (8), Japan (2), Luxembourg, Mexico (2), the Netherlands (3), Peru, Poland, Portugal, Scotland, Senegal, Singapore (2), Sweden, Taiwan, the United Kingdom (3), the United States (10)
-
DTU Food, Technical University of Denmark
Participation in the 2017 GMI PT
2016 2017
-
DTU Food, Technical University of Denmark
• Number of reads mapped to – reference total DNA sequence– reference chromosome– reference plasmid #1 – reference plasmid #2 – reference plasmid #3– and unmapped reads
• Proportion of reads mapped to the above• Depth of coverage, of the above• Size of assembled genome • Size of assembled genome per total size of DNA sequence (%)• Total number of contigs• Number of contigs > 200 bp• N50• NG50
Measured QC parameters
http://www.google.dk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiMxLvTttjTAhWsApoKHaQbDWsQjRwIBw&url=http://alphaislandemirates.com/responsibilities/quality-control/&psig=AFQjCNGkPmI_NWJK4klLz5SOqIsg_sPrbA&ust=1494062473702915http://www.google.dk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiMxLvTttjTAhWsApoKHaQbDWsQjRwIBw&url=http://alphaislandemirates.com/responsibilities/quality-control/&psig=AFQjCNGkPmI_NWJK4klLz5SOqIsg_sPrbA&ust=1494062473702915http://www.google.dk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiMxLvTttjTAhWsApoKHaQbDWsQjRwIBw&url=http://alphaislandemirates.com/responsibilities/quality-control/&psig=AFQjCNGkPmI_NWJK4klLz5SOqIsg_sPrbA&ust=1494062473702915
-
Individual participants reports
Pending for the 2016/ 2017 trial
-
DTU Food, Technical University of Denmark
Sequencing method
Qubit Nanodrop DNA concentration notmeasured
Other0,00%
10,00%
20,00%
30,00%
40,00%
50,00%
60,00%
70,00%
80,00%
90,00%
For bacterial cultures, DNA concentration (ng/µl) prior to library preparation was measured on (please select one
answer)
-
DTU Food, Technical University of Denmark
Sequencing method
Bioanalyser Nanodrop DNA quality not measured Other0,00%
10,00%
20,00%
30,00%
40,00%
50,00%
60,00%
For bacterial cultures, DNA quality (e.g. RIN, 260/280 ratio and/or 260/230 ratio) prior to library preparation was
measured on (please select one answer)
-
DTU Food, Technical University of Denmark
Sequencing method
-
DTU Food, Technical University of Denmark
Sequencing method
Single-end Paired-end Not relevant0,00%
20,00%
40,00%
60,00%
80,00%
100,00%
120,00%
Sequencing details #1 (please select one answer)
-
DTU Food, Technical University of Denmark
QC parameters output
• 2 times standard deviation• 3 times standard deviation
-
DTU Food, Technical University of Denmark
• The proportion of reads produced which map directly to the closed genome of the same strain. (=> cannot exceed 100%)
Proportion of reads mapped to reference DNA sequence (%)
-
DTU Food, Technical University of Denmark
• The proportion of reads produced which map directly to the closed genome of the same strain. (=> cannot exceed 100%)
Proportion of reads mapped to reference DNA sequence (%)
% %
Outlier
Outlier
Consistency across samples• Indication of contamination or strain mix up
• #22 and 87 - missing reads• #31 – too many reads
Outlier
Outlier
-
DTU Food, Technical University of Denmark
• The proportion of contigs which map directly to the closed genome of the same strain (=> should not exceed 100%)
Size of assembled genome per total size of DNA sequence (%)
%%
-
DTU Food, Technical University of Denmark
• The proportion of contigs which map directly to the closed genome of the same strain (=> should not exceed 100%)
Size of assembled genome per total size of DNA sequence (%)
Outlier
%%
Outlier
• Assembly exceed the expected size of the reference - contamination
• #13, #32 Assembly lower that the expected size of the reference – poor performance
• #4; #5, #83
Outlier
Outlier
-
DTU Food, Technical University of Denmark
• Definition: The length for which the collection of all contigs of that length or longer contains at least half of the sum of the lengths of all contigs, and for which the collection of all contigs of that length or shorter also contains at least half of the sum of the lengths of all contigs. A N50 more than 15000 normally indicate good quality.
N50
bp bp
-
19
-
DTU Food, Technical University of Denmark
• Definition: The length for which the collection of all contigs of that length or longer contains at least half of the sum of the lengths of all contigs, and for which the collection of all contigs of that length or shorter also contains at least half of the sum of the lengths of all contigs. A N50 more than 15000 normally indicate good quality.
N50
15.000 Outlier
bp bp
Poor performance – short contigs• #22 for the Bact sample
15.000 Outlier
-
DTU Food, Technical University of Denmark
• The total number of contigs assembled. A number of contigs less than 200 normally indicate good quality.
Total number of contigs
bp bp
-
DTU Food, Technical University of Denmark
• The total number of contigs assembled. A number of contigs less than 200 normally indicate good quality.
Total number of contigs
Outlier
bp
200
bp
Poor performance – large number of contigs• #22, #23, #41, #37, #83
200 Outlier
Outlier
-
DTU Food, Technical University of Denmark
N50 No ctgs longest total bps species11905 793 60618 5413102 Escherichia coli58951 268 229096 5259803 Escherichia coli
490118 32 1549500 4703228 Salmonella enterica
”What comes together goes together”
-
DTU Food, Technical University of Denmark
Q1-ScorePhred quality score (Q score) between Q30 and Q40 indicate a 99.9 % base call accuracy with the probability of an incorrect base call in 1 out of 1,000 bp
-
DTU Food, Technical University of Denmark
Q-ScorePhred quality score (Q score) between Q30 and Q40 indicate a 99.9 % base call accuracy with the probability of an incorrect base call in 1 out of 1,000 bp
30
Some laboratories obtained a base calling accuracy, Phred quality score lower than 30 indicating some quality issue with the sequencing
Outlier
30 Outlier
-
DTU Food, Technical University of Denmark
SNP analysis
#83
Strain Sample Type Number of
SNPs
GMI16-001 Culture 3 DNA 0
Number of SNPs per strain
3 SNPs difference to the ref. (#83)
#83
Strain
Sample Type
Number of SNPs
GMI16-001
Culture
3
DNA
0
-
DTU Food, Technical University of Denmark
Overall Score – based on CDC scoring(different user codes)
-
DTU Food, Technical University of Denmark
Overall Score 2017, Salmonella 001/002
100 99-90 89-80 79-70 69-60 59-50 49-40 39-30 29-20 19-10 9-0
-
DTU Food, Technical University of Denmark
• GMI17-001 BACT:– GMI-150
• GMI17-001 DNA:– GMI-134, GMI-148, GMI-150, GMI-179
• GMI17-002 BACT:– GMI-150
• GMI17-002 DNA:– GMI-150
• 5 outliers
Summary 2017, Salmonella 001/002(different user codes)
-
DTU Food, Technical University of Denmark
Overall Score 2017, E. coli 003/004
100 99-90 89-80 79-70 69-60 59-50 49-40 39-30 29-20 19-10 9-0
-
DTU Food, Technical University of Denmark
• GMI17-003 BACT:– GMI-150, GMI-156,
• GMI17-003 DNA:– GMI-148, GMI-150, GMI-179
• GMI17-004 BACT:– GMI-150,
• GMI17-004 DNA:– GMI-134, GMI-148, GMI-150, GMI-179
• 5 outliers
Summary 2017, E. coli 003/004(different user codes)
-
DTU Food, Technical University of Denmark
Overall Score, S. aureus 005/006
100 99-90 89-80 79-70 69-60 59-50 49-40 39-30 29-20 19-10 9-0
-
DTU Food, Technical University of Denmark
• GMI17-005 BACT:
• GMI17-005 DNA:
• GMI17-006 BACT:– GMI-179
• GMI17-006 DNA:– GMI-179
• 1 outliers
Summary 2017, S. aureus 005/006(different user codes)
-
DTU Food, Technical University of Denmark
Summary of PT 2017
• Most laboratories performed well but six labs seems to have problems with
– Contaminations– Poor WGS performance – low yield – Need for optimization of WGS
• GMI-134, GMI-148, GMI-150, GMI-156, GMI-179
• The individual participants reports for 2017 pending
• PT report 2016 / 2017 published as a scientific paper in 2017
• No PT in 2018 – Consideration to terminate the PT– If continued in 2019
• focus on K. pneumonia, L. monocytogenes, C. coli / C. jejuni
-
AcknowledgementSusanne Klarsmose Pedersen (DTU Food)
Pimlapas Leekitcharoenphon (DTU Food)
Rolf Sommer Kaas (DTU Food)
Inge Marianne Hansen (DTU Food)
Jacob Dyring Jensen (DTU Food)
Frank Aarestrup (DTU Food)
Ole Lund (DTU Systems Biology)
Jose Luis Bellod Cisneros (DTU Systems Biology)
James Pettengill (US FDA)
Division of Microbiology (CFSAN/FDA)
Anthony Underwood (PHE)
Brian Beck (Microbiologics)
Isabel Cuesta de la Plaza (ISCIII)
Angel Zaballos (ISCIII)
Jorge De La Barrera Martinez (ISCIII)
…..and the rest of WG 4 (‘advisory group’)
GMI is supported by:
-
DTU Food, Technical University of Denmark
Thank you for your attention
Oksana Lukjancenko, PhD
Research Group Genomic Epidemiology
WHO Collaborating Centre for Antimicrobial Resistance in Food borne Pathogens
and Genomics
European Union Reference Laboratory for Antimicrobial Resistance
National Food Institute, Technical University of Denmark
mailto:[email protected]
Slide Number 1Objectives of GMI PTStructure of GMI PT, wet-labDevelopment of GMI PTParticipation in the 2016/2017 GMI PT�Participation in the 2017 GMI PT�Measured QC parametersIndividual participants reports�Sequencing methodSequencing methodSequencing methodSequencing methodQC parameters outputProportion of reads mapped to reference DNA sequence (%)Proportion of reads mapped to reference DNA sequence (%)Size of assembled genome per total size of DNA sequence (%)Size of assembled genome per total size of DNA sequence (%)N50Slide Number 19N50Total number of contigsTotal number of contigs”What comes together goes together”Slide Number 24Slide Number 25SNP analysisSlide Number 27Slide Number 28Slide Number 29Slide Number 30Slide Number 31Slide Number 32Slide Number 33Summary of PT 2017GMI is supported by:Slide Number 36