1 comprehensive evolution and molecular characteristics of ... · 24/04/2020 · in addition, for...
TRANSCRIPT
1
Comprehensive evolution and molecular characteristics of a large number of 1
SARS-CoV-2 genomes revealed its epidemic trend and possible origins 2
3
Yunmeng Bai1#, Dawei Jiang1#, Jerome R Lon1#, Xiaoshi Chen1, Meiling Hu1, Shudai Lin1, Zixi Chen1, 4
Xiaoning Wang1,2, Yuhuan Meng3*, Hongli Du1* 5
1School of Biology and Biological Engineering, South China University of Technology, Guangzhou 6
510006, China 7
2State Clinic Center of Gearitic, Chinese PLA General Hospital, Beijing 100853, China 8
3Guangzhou KingMed Transformative Medicine Institute Co., Ltd, Guangzhou 510330, China 9
# Equal contribution. 10
* Corresponding authors. 11
E-mail: [email protected], [email protected] 12
13
Abstract 14
Objectives: 15
To reveal epidemic trend and possible origins of SARS-CoV-2 by exploring its evolution and 16
molecular characteristics based on a large number of genomes since it has infected millions of 17
people and spread quickly all over the world. 18
Methods: 19
Various evolution analysis methods were employed. 20
Results: 21
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
2
The estimated Ka/Ks ratio of SARS-CoV-2 is 1.008 or 1.094 based on 622 or 3624 SARS-CoV-2 22
genomes, and the time to the most recent common ancestor (tMRCA) was inferred in late 23
September 2019. Further 9 key specific sites of highly linkage and four major haplotypes H1, H2, 24
H3 and H4 were found. The Ka/Ks, detected population size and development trends of each 25
major haplotype showed H3 and H4 subgroups were going through a purify evolution and almost 26
disappeared after detection, indicating H3 and H4 might have existed for a long time, while H1 27
and H2 subgroups were going through a near neutral or neutral evolution and globally increased 28
with time. Notably the frequency of H1 was generally high in Europe and correlated to death rate 29
(r>0.37). 30
Conclusions: 31
In this study, the evolution and molecular characteristics of more than 16000 genomic sequences 32
provided a new perspective for revealing epidemiology of SARS-CoV-2. 33
34
KEYWORDS: SARS-CoV-2; evolution; classification; haplotype 35
36
Introduction 37
The global outbreak of SARS-CoV-2 is currently and increasingly recognized as a serious, public 38
health concern worldwide. Coronaviruses exist widely around the world, and to date, a total of seven 39
types of coronaviruses that can infect humans have been found, of which four coronaviruses including 40
hCoV-229E, hCoV-NL63, hCoV-OC43 and hCoV-HKU1 cause a cold, while the other three including 41
SARS-CoV, MERS-CoV and SARS-CoV-2 usually cause mild to severe respiratory diseases. The total 42
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
3
number of SARS-CoV and MERS-CoV infections are only 8,069 and 2,494 with reproduction number 43
(R0) fluctuates 2.5-3.9 and 0.3-0.8, respectively. However, SARS-CoV-2 has infected 2471136 people 44
in 212 countries up to April 22th(WHO, 2020), with the basic R0 ranging from 1.4 to 6.49(Liu, et al., 45
2020). Among these three typical coronavirus, MERS-CoV has the highest death rate of 34.40%, 46
SARS-CoV has the modest death rate of 9.59%, SARS-CoV-2 is about 6.99% and 7.69% death rate of 47
the global and Wuhan, but is quite high death rate in some countries of Europe, such as Belgium, Italy, 48
United Kingdom, Netherlands, Spain and France, which even reaches 14.95%, 13.39%, 13.48%, 49
11.61%, 10.42% and 13.60% respectively according to the data of April 22th, 2020. Except for the 50
shortage of medical supplies, aging and other factors, it is not clear whether there is virus mutation 51
effect in these countries with such a significantly increased death rate. 52
It has been reported that SARS-CoV-2 belongs to beta-coronavirus and is mainly transmitted by the 53
respiratory tract, which belongs to the same subgenus (SarbeCoVirus) as SARS-CoV(Lu, et al., 2020). 54
Through analyzing the genome and structure of SARS-CoV-2, its receptor-binding domain (RBD) was 55
found to bind with angiotensin-converting enzyme 2 (ACE2), which is also one of the receptors for 56
binding SARS-CoV(Wrapp, et al., 2020). Some early genomic studies have shown that SARS-CoV-2 is 57
similar to certain bat viruses (RaTG13, with the whole genome homology of 96.2%(Zhou, et al., 2020)) 58
and Malayan pangolins coronaviruses (GD/P1L and GDP2S, with the whole genome homology of 59
92.4%(Lam, et al., 2020)). They have speculated several possible origins of SARS-CoV-2 based on its 60
spike protein characteristics (cleavage sites or the RBD)(Lam, et al., 2020, Zhang and Holmes, 2020, 61
Zhou, et al., 2020), in particular, SARS-CoV-2 exhibits 97.4% amino acid similarity to the Guangdong 62
pangolin coronaviruses in RBD, even though it is most closely related to bat coronavirus RaTG13 at 63
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
4
the whole genome level. However, it is not enough to present genome-wide evolution by a single gene 64
or local evolution of RBD, whether bats or pangolins play an important role in the zoonotic origin of 65
SARS-CoV-2 remains uncertain(Andersen, et al., 2020). Furthermore, since there is little molecular 66
characteristics analysis of SARS-CoV-2 based on a large number of genomes, it is difficult to 67
determine whether the virus has significant variation that affects its phenotype. Thereby, it is necessary 68
to further reveal the phylogenetic evolution and molecular characteristics of the whole genome of 69
SARS-CoV-2 in order to develop a comprehensive understanding of the virus and provide a basis for 70
the prevention and treatment of SARS-CoV-2. 71
72
Results 73
Genome sequences 74
We got a total of 1053 genomic sequences up to March 22th, 2020. According to the filter criteria, 37 75
sequences with ambiguous time, 314 with low quality and 78 with similarity of 100% were removed. A 76
total of 624 sequences were obtained to perform multiple sequences alignment. Two highly divergent 77
sequences (EPI_ ISL_414690, EPI_ISL_415710) according to the firstly constructed phylogenetic tree 78
were also filtered out (Table S1). The remaining 622 sequences were used to reconstruct a phylogenetic 79
tree. In addition, total of 3624 and 16373 genomic sequences were redownloaded up to April 6th, 2020 80
and May 10th, 2020 respectively for further exploring the evolution and molecular characteristics of 81
SARS-CoV-2. 82
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
5
Estimate of evolution rate and the time to the most recent common ancestor for SARS-CoV, 83
MERS-CoV, and SARS-CoV-2 84
In this study, date for SARS-CoV-2 ranged from 2019/12/26 to 2020/03/18 was collected. The average 85
Ka/Ks of all the coding sequences was closer to 1 (1.008), which indicating the genome was going 86
through a neutral evolution. We also reevaluated the Ka/Ks of SARS-CoV and MERS-CoV through the 87
whole period, and found the ratio were smaller than SARS-CoV-2 (Table 1). To estimate more credible 88
Ka/Ks for SARS-CoV-2, we recalculated it using redownloaded 3624 genome sequences ranged from 89
2019/12/26 to 2020/04/06, and the average Ka/Ks of it was 1.094 (Table 1), which was almost same 90
with the above result. 91
We assessed the temporal signal using TempEst v1.5.3(Rambaut, et al., 2016). All three datasets 92
exhibit a positive correlation between root-to-tip divergence and sample collecting time (Figure S1), 93
hence they are suitable for molecular clock analysis in BEAST(Bouckaert, et al., 2019, Rambaut, et al., 94
2016). The substitution rate of SARS-CoV-2 genome was estimated to be 1.601 x 10-3 (95% CI: 1.418 x 95
10-3 - 1.796 x 10-3, Table 2, Figure S2A) substitution/site/year, which is as in the same order of 96
magnitude as SARS-CoV and MERS-CoV. The tMRCA was inferred on the late September, 2019 (95% 97
CI: 2019/08/28- 2019/10/26,Table 2, Figure S2B), about 2 months before the early cases of 98
SARS-CoV-2(Huang, et al., 2020). 99
Phylogenetic tree and clusters of SARS-CoV-2 100
The no-root phylogenetic trees constructed by the maximum likelihood method with PhyML 101
3.1(Guindon, et al., 2010) and MEGA(Kumar, et al., 2018) were showed in Figure 1, Figure S3A and 102
3B. According to the shape of phylogenetic trees, we divided 622 sequences into three clusters: Cluster 103
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
6
1 including 76 sequences mainly from North America, Cluster 2 including 367 sequences from all 104
regions of the world, and Cluster 3 including 179 sequences mainly from Europe (Table S2). 105
The specific sites of each Cluster 106
The Fst and population frequency of a total of 9 sites (NC_045512.2 as reference genome) were 107
detected (Table 3, Table S3). Thereinto, three (C17747T, A17858G and C18060T) are the specific sites 108
of Cluster 1, and four (C241T, C3037T, C14408T and A23403G) are the specific sites of Cluster 3. 109
Notably, C241T was located in the 5’-UTR region and the others were located in coding regions (6 in 110
ofr1ab gene, 1 in S gene and 1 in ORF8 gene). Five of them were missense variant, including C14408T, 111
C17747T and A17858G in ofr1ab gene, A23403G in S gene, and T28144C in ORF8 gene. The PCA 112
results showed that these 9 specific sites could clearly separate the three Clusters, while all SNV 113
dataset could not clearly separate Cluster 1 and Cluster 2 (Figure S4), which further suggested that 114
these 9 specific sites are the key sites for separating the three Clusters. 115
Linkage of specific sites 116
We found the 9 specific sites are highly linkage based on 622 genome sequences (Figure 2A), then we 117
carried out a further linkage analysis using the 3624 genome sequences (Figure 2B). As a result, for the 118
3624 genome sequences, 3 specific sites in the Cluster 1 were almost complete linkage, and haplotype 119
CAC and TGT accounted for 98.65% of all the 3 site haplotypes. The same phenomenon was also 120
found in 4 specific sites of Cluster 3, and haplotype CCCA and TTTG accounted for 97.68% of all the 121
4 site haplotypes. Intriguingly, the 9 specific sites were still highly linkage, and four haplotypes, 122
including TTCTCACGT (H1), CCCCCACAT (H2), CCTCTGTAC (H3) and CCTCCACAC (H4), 123
accounted for 95.89% of all the 9 site haplotypes. Thereinto, H1 and H3 had completely different bases 124
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
7
at the 9 specific sites. The frequencies of each site and major haplotype for each country were showed 125
in Figure 3 and Table S4. The data showed that the haplotype TTTG of the 4 specific sites in Cluster 3 126
had existed globally at present, and still exhibited high frequencies in most European countries but 127
quite low in Asian countries. While the haplotype TGT of the 3 specific sites in Cluster 1 existed 128
almost only in North America and Australia. For the 9 specific sites, most countries had only two or 129
three major haplotypes, except America and Australia had all of four major haplotypes and with 130
relative higher frequencies. 131
Characteristics and epidemic trends of major haplotype subgroups 132
All haplotypes of 9 specific sites for 3624 or 16373 genomes and the numbers of them were shown in 133
Table S5. Four major haplotypes H1, H2, H3 and H4, three minor haplotypes H5, H7 and H8 close to 134
H1 and one minor haplotype H6 close to H3 were found in both datasets. The numbers of these 135
haplotypes for 16373 genomes with clear collection data detected in each country in chronological 136
order were shown in Figure 4A. From these results, H2 and H4 haplotype subgroups had existed for a 137
long time (2019-12-24 to 2020-04-28), the detected population size of H2 subgroup was far greater 138
than that of H4 subgroup, and the H3 haplotype subgroup almost disappeared after detection 139
(2020-02-18 to 2020-04-28), while H1 haplotype subgroup was globally increasing with time 140
(2020-02-18 to 2020-05-05), which indicated H1 subgroup had adapted to the human hosts, and was 141
under an adaptive growth period worldwide. However, due to the nonrandom sampling on early phase 142
(only patients having a recent travel to Wuhan were detected), some earlier cases of H3 may be lost, the 143
high proportion of H3 subgroup during February 18 to March 10 could confirm this point. 144
H3 and H4 subgroups had the lowest Ka/Ks ratio in 3624 or 16373 genomes among the 4 major 145
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
8
subgroups (Table 4), suggesting that H3 and H4 subgroups might be going through a purifying 146
evolution and have existed for a long time, while H2 and H1 subgroups might be going through a near 147
neutral or neutral evolution according to their Ka/Ks ratios (Table 4), which were consistent with the 148
above phenomenon that only H1 and H2 subgroups were expanding with time around the world. From 149
the whole genome mutations in each major haplotype subgroup (Figure 4B, Table S6), we found that 150
except the 9 specific sites, there was no common mutations with frequencies more than 0.05 in these 151
haplotype subgroups but between H2 and H4 subgroups. 152
Phylogenetic network of haplotype subgroups 153
Phylogenetic networks were inferred with 697 mutations called from 3624 genomes dataset. For these 154
datasets, the network structures of TCS and MSN were similar. The major haplotype subgroups H4 and 155
H2 were in the middle of the network, while H1 and H3 were in the end nodes of the network (Figure 156
5). According to the phylogenetic networks, we proposed four hypothesis: (1) the ancestral haplotypes 157
evolved in four directions to obtain H1, H2, H3 and H4 respectively or evolved in two or more 158
directions to obtain two or more major haplotypes and then involved into the other major haplotype(s); 159
(2) H2 or H4 evolved in two directions respectively and finally generated H1 and H3; (3) H1 evolved 160
in one direction to generate H2 and H4, and then evolved into H3; (4) H3 evolved in one direction to 161
generate H4 and H2, and then evolved into H1. Although we cannot exclude the first hypothesis based 162
on the present data, but if there are evolutionary relationships among the four major haplotypes, 163
according to the Ka/Ks, detected population size and development trends in chronological order of each 164
major haplotype subgroup, we speculate that the most likely evolution hypothesis is that H3 and H4 are 165
the earliest haplotypes, which is gradually eliminated with selection, while H2 is the transitional 166
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
9
haplotype in the evolution process, and H1 may be the haplotype to be finally fixed. 167
Correlation analysis of specific sites with death rate and infectivity 168
To explore the relationship between death rate and the 9 specific sites, we used Pearson method to 169
calculate the correlation coefficient between death rate and frequency of each specific site or major 170
haplotype in 17 countries with 3624 genomes at early stage. As a result, all r values of 241T, 3037T, 171
14408T, 23403G and haplotype TTTG and H1 were more than 0.4 (Figure 6, Table S4). We also 172
evaluated the correlation coefficient with 16373 genomes in 30 countries, the r values of haplotype 173
TTTG and H1 were still more than 0.37 (Table S4), which might be increased when the death rates of 174
some countries were further stabilized with time. These finding integrating their high frequencies in 175
most European countries indicated that the 4 sites and haplotype TTTG and H1 might be related to the 176
pathogenicity of SARS-CoV-2. 177
To explore the relationship between infectivity and the 9 specific sites, we used population size of 178
major haplotypes to deduce the possible specific sites related infectivity. We assumed that these major 179
haplotype in each country were subject to similar virus transmission and control patterns, while the 180
population sizes of H1 and H2 subgroups were far greater than those of H3 and H4 subgroups (Figure 181
4A, Table S4), then the common different specific sites of H1 and H2 subgroups with H3 and H4 182
subgroups, C8782T and T28144C, might be related to the infectivity of SARS-CoV-2, and the virus 183
with C8782 and T28144 might be more infectious than those virus with 8782T and 28144C. 184
185
Discussion 186
SARS-CoV-2 poses a great threat to the production, living and even survival of human beings(Guo, et 187
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
10
al., 2020). With the further outbreak of SARS-CoV-2 in the world, further understanding on the 188
evolution and molecular characteristics of SARS-CoV-2 based on a large number of genome sequences 189
will help us cope better with the challenges brought by SARS-CoV-2. 190
Exploring evolution rate, tMRCA and phylogenetic tree of SARS-CoV-2 would help us better 191
understand the virus(Yuen, et al., 2020). The average Ka/Ks for all the coding sequences of 622 and 192
3624 SARS-CoV-2 genomes was 1.008 and 1.094, which was higher than those of SARS-CoV and 193
MERS-CoV, indicating that the SARS-CoV-2 was going through a neutral evolution. In general, 194
SARS-CoV would show purifying selection pressures and the Ka/Ks would go down at the middle or 195
end of the epidemic(He, et al., 2004). Therefore, it seemed to be reasonable considering that 196
SARS-CoV-2 was under an exponential growth period around the world. Interestingly, we also found 197
that the subgroups of different haplotypes (H1, H2, H3 and H4) seemed to experience different 198
evolutionary patterns according to their Ka/Ks. The H3 haplotype subgroup disappeared soon after 199
detection (2020-02-18 to 2020-04-28, Figure 4A), while H1 haplotype subgroup was globally 200
increasing with time, these evolution and changes should be considered in developing therapeutic drugs 201
and vaccines. The tMRCA of SARS-CoV-2 was inferred on the late September, 2019 (95% CI: 202
2019/08/28- 2019/10/26), about 2 months before the early cases of SARS-CoV-2(Huang, et al., 2020). 203
We also estimated tMRCA of SARS-CoV and MERS-CoV with the same methods, both were about 3 204
months later than the corresponding tMRCAs estimated by previous studied(Cotten, et al., 2014, Zhao, 205
et al., 2004). A recent study used TreeDater method to estimate tMRCA for more than 7000 206
SARS-CoV-2 genomes and indicated the tMRCA of SARS-CoV-2 was around 6 October 2019 to 11 207
December 2019, which were about one and a half months later than ours and in broad agreement with 208
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
11
six previous studies all performed on no more than 120 early SARS-CoV-2 genomes with BEAST 209
method(van Dorp, et al., 2020). While we used the most common method BEAST to estimate tMRCA 210
of SARS-CoV-2 based on 622 genomes and combined with the comparison of tMRCA of SARS-CoV 211
and MERS-CoV, indicating that the estimated tMRCA in the present study are more reliable. 212
A recent study clustered 160 SARS-CoV-2 whole-genome sequences into A, B and C groups by a 213
phylogenetic network analysis by taking bat RaTG13 as root (Forster, et al., 2020). The cluster result 214
was similar to our study: both the samples in Cluster A and our Cluster 1 were mainly from the United 215
States; the samples in Cluster C and our Cluster 3 were mainly from European countries, while Cluster 216
B and our Cluster 2 were mainly from China and the other regions. It was interesting that the markers 217
C8782T and T28144C, which were also discovered by Yu et al(Yu, et al., 2020) , in Cluster B were 218
also found in our study, but the other markers in Cluster A (T29095C) and Cluster C (G26144T) were 219
not significantly in our study. That may be caused by different sample sizes and different constructing 220
methods of phylogenetic tree. Based on the base substitution model, the ML method avoids the 221
possible "long-branch attraction" problem in the maximum parsimony method and is faster than the 222
Bayesian method(Holder and Lewis, 2003), hence it could be used as a reliable method for 223
phylogenetic analysis. Some studies used the genome of bat SARS-like-CoV(Zhang, et al., 2020), 224
RaTG13(Zhang, et al., 2020) or MT019529 (https://bigd.big.ac.cn/ncov/tree) as the root of 225
phylogenetic tree. Unfortunately, there was no obvious evidence showing that SARS-CoV-2 was from 226
the bat coronavirus even the identity between SARS-CoV-2 and RaTG13 was up to 96.2%(Zhou, et al., 227
2020). In our study, the tMRCA of SARS-CoV-2 was inferred on the late September 2019, which 228
indicated there might exist an earlier SARS-CoV-2 strain we didn't find. Then in the case of unclear 229
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
12
source of SARS-CoV-2 and high homology of its genomes (> 99.9% homology mostly), it may be 230
inappropriate to identify the evolutionary characteristics inside the genomes by taking bat 231
SARS-like-CoV, RaTG13 or MT019529 as root. Therefore, the maximum likelihood method was used 232
in current study to construct a no-root tree to obtain the reliable clusters with different characteristics. 233
Based on the no-root tree, we identified 9 specific sites of highly linkage that play a decisive role in the 234
classification of clusters successfully. Among the four major haplotypes, H1 and H3 were in Cluster 3 235
and Cluster 1, respectively, while there were two haplotypes H2 and H4 in Cluster 2 (Figure 1, Figure 236
S3B). 237
Among the 9 specific sites, 8 of them were located in coding regions (6 in ofr1ab gene, 1 in S gene 238
and 1 in ORF8 gene), and 5 of them were missense variant, including C14408T, C17747T and 239
A17858G in ofr1ab gene, A23403G in S gene, and C28144T in ORF8 gene. Orf1ab is composed of 240
two partially overlapping open reading frames (orf), orf1a and 1b. It is proteolytic cleaved into 16 241
non-structural proteins (nsp), including nsp1 (suppress antiviral host response), nsp9 (RNA/DNA 242
binding activity), nsp12 (RNA-dependent RNA polymerase), nsp13 (helicase) and others(Chan, et al., 243
2020), indicating the vital role of it in transcription, replication, innate immune response and 244
virulence(Graham, et al., 2008). C14408T with high frequencies of T in European countries were 245
located at nsp12 region, indicating that this missense variant might influence the role of RNA 246
polymerase. Spike glycoprotein, the largest structural protein on the surface of coronaviruses, 247
comprises of S1 and S2 subunits which mediating binding the receptor on the host cell surface and 248
fusing membranes, respectively(Li, 2016). It has been reported that S protein of SARS-CoV-2 can bind 249
angiotensin-converting enzyme 2 (ACE2) with higher affinity than that of SARS(Wrapp, et al., 2020, 250
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
13
Zhou, et al., 2020). Whether the missense variant of A23403G in S gene, which with high frequencies 251
of G in European countries, will change the affinity between S protein and ACE2 remains to be further 252
investigated. 253
It seems to take a long time to finally fix mutations according to the mutation frequency of each 254
subgroup. For example, H2 and H4 subgroups, which have been detected for more than four months 255
from December 24, 2019 to May 5, 2020 (Figure 4A), have more mutations with higher frequencies, 256
but the highest mutation frequency is only 0.486 at the position of 11083 (Figure 4B, Table S6). From 257
these phenomena, it can be inferred that it takes a long time for the specific sites of each major 258
subgroup to be fixed, but it may be faster if the early population is small. In addition, there is also the 259
possibility that an ancestor strain evolved in four directions by obtaining the specific mutations directly 260
and produced the four current major haplotypes, so the evolution time for obtaining four major 261
haplotypes may be shorter, which seems to be consistent with the phenomenon that four major 262
haplotypes can be detected in two months (Figure 4A). However, if there exist evolution relationship 263
among the major haplotypes of SARS-CoV-2, it is difficult to complete the evolution among the four 264
major haplotypes within two months (2019-12-24 to 2020-02-18, Figure 4A) at the current evolution 265
rate of each major haplotype population (Table 4). Therefore, we speculate that the transformation 266
among the four major haplotypes may have been completed for a long time, which have not been 267
detected. What's interesting is that only United States and Australia among 29 countries had all of four 268
major haplotypes and with relative higher frequency (Figure 4A, Table S4), which indicated that the 269
two countries are the most likely places where the virus appeared earlier based on all the present data. 270
Previous studies showed that the death rate of SARS-CoV-2 is affected by many factors, such as 271
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
14
medical supplies, aging, life style and other factors(Liu, et al., 2020, Malavolta, et al., 2020). However, 272
our study found that the 4 specific sites (C241T, C3037T, C14408T and A23403G) in the Cluster 3 273
were almost complete linkage, and the frequency of haplotype TTTG was generally high in European 274
countries and correlated to death rate (r>0.37) both based on 3624 or 16373 SARS-CoV-2 genomes, 275
which provides a new perspective to the reasons of relatively high death rate in Europe, and these 276
specific sites should be considered in designing new vaccine and drug development of SARS-CoV-2. 277
Two possible specific sites C8782T and T28144C related to the infectivity of SARS-CoV-2 were also 278
deduced in the present study, which would provide some basis for SARS-CoV-2 epidemiology. 279
280
Conclusion and prospective 281
The Ka/Ks ratio of SARS-CoV-2 and tMRCA of SARS-CoV-2 (95% CI: 2019/08/28- 2019/10/26) 282
indicated that SARS-CoV-2 might have completed the selection pressure of cross-host evolution in the 283
early stage and be going through a neutral evolution at present. The 9 specific sites with highly linkage 284
were found to play a decisive role in the classification of clusters. Thereinto, 3 of them are the specific 285
sites of Cluster 1 mainly from North America and Australia, 4 of them are the specific sites of Cluster 3 286
mainly from Europe in the early stage, both of these 3 and 4 specific sites are almost complete linkage. 287
The frequencies of haplotype TTTG for the 4 specific sites and H1 for 9 specific sites were generally 288
high in European countries and correlated to death rate (r>0.37) based on 3624 or 16373 SARS-CoV-2 289
genomes, which suggested these haplotypes might relate to pathogenicity of SARS-CoV-2 and 290
provided a new perspective to the reasons of relatively high death rates in Europe. The relationship 291
between the haplotype TTTG or H1 and the pathogenicity of SARS-CoV-2 needs to be further verified 292
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
15
by clinical samples or virus virulence test experiment, and the 9 specific sites with highly linkage may 293
provide a starting point for the traceability research of SARS-CoV-2. Given that the different evolution 294
patterns of different haplotypes subgroups, we should consider these evolution and changes in the 295
development of therapeutic drugs and vaccines. 296
297
Materials and methods 298
Genome sequences 299
The complete genome sequences of SARS-CoV-2 were downloaded from China National Center for 300
Bioinformation (https://bigd.big.ac.cn/ncov/release_genome/) and GISAID (https://www.gisaid.org/) 301
up to March 22th, 2020. The sequences were filtered out according to the following criteria: (1) 302
sequences with ambiguous time; (2) sequences with low quality which contained the counts of 303
unknown bases > 15 and degenerate bases > 50 (https://bigd.big.ac.cn/ncov/release_genome); (3) 304
sequences with similarity of 100% were removed to unique one. Finally, 624 high quality genomes 305
with precise collection time were selected and aligned using MAFFT v7 with automatic parameters. 306
Besides, the genome sequences of 7 SARS-CoV and 475 MERS-CoV were also downloaded from 307
NCBI (https://www.ncbi.nlm.nih.gov/), and the MERS-CoV dataset including samples collected from 308
both human and camel. In addition, for further exploring evolution and molecular characteristics of 309
SARS-CoV-2 based on the larger amount of genomic data, we redownloaded validation datasets of the 310
genome sequences from GISAID up to April 6th, 2020 and May 10th, 2020 respectively. 311
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
16
Estimate of evolution rate and the time to the most recent common ancestor for SARS-CoV, 312
MERS-CoV, and SARS-CoV-2 313
The average Ka, Ks and Ka/Ks for all coding sequences were calculated using KaKs_Calculator 314
v1.2(Zhang, et al., 2006), and the substitution rate and tMRCA were estimated using BEAST 315
v2.6.2(Bouckaert, et al., 2019). The temporal signal with root-to-tip divergence was visualized in 316
TempEst v1.5.3(Rambaut, et al., 2016) using a ML whole genome tree with bootstrap value as input. 317
For SARS-CoV and SARS-CoV-2, we selected a strict molecular clock and Coalescent Exponential 318
Population Model. For MERS-CoV, we selected a relaxed molecular clock and Birth Death Skyline 319
Serial Cond Root Model. We used the tip dates and chose the HKY as the site substitution model in all 320
these analyses. Markov Chain Monte Carlo (MCMC) chain length was set to 10,000,000 steps 321
sampling after every 1000 steps. The output was examined in Tracer v1.6 322
(http://tree.bio.ed.ac.uk/software/tracer/). 323
Variants calling of SARS-CoV-2 genome sequences 324
Each genome sequence was aligned to the reference genome (NC_045512.2) using bowtie2 with 325
default parameters(Langmead and Salzberg, 2012), and variants were called by samtools (sort; mpileup 326
-gf) and bcftoots (call -vm). The merge VCF files were created by bgzip and bcftools (merge 327
--missing-to-ref)(Li, 2011, Li, et al., 2009). 328
Phylogenetic tree construction and virus isolates clustering for SARS-CoV-2 329
After alignment of 624 SARS-CoV-2 genomes with high quality and manually deleted 2 highly 330
divergent genomes (EPI_ISL_415710, EPI_ISL_414690) according to the firstly constructed 331
phylogenetic tree, the aligned dataset of 622 sequences was phylogenetically analyzed. The SMS 332
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
17
method was used to select GTR+G as the base substitution model(Lefort, et al., 2017), and the PhyML 333
3.1(Guindon, et al., 2010) and MEGA(Kumar, et al., 2018) were used to construct the no-root 334
phylogenetic tree by the maximum likelihood method with the bootstrap value of 100. The online tool 335
iTOL(Letunic and Bork, 2019) was used to visualized the phylogenetic tree. The clusters were defined 336
by the shape of phylogenetic tree. 337
Detection of specific sites from each Cluster 338
Information (ID, countries/regions and collection time) and variants (NC_045512.2 as reference 339
genome) of each genome from each Cluster were extracted. The allele frequency and nucleotide 340
divergency (pi) for each site in the virus population of each Cluster were measured by 341
vcftools(Danecek, et al., 2011). The Fst were also calculated by vcftools(Danecek, et al., 2011) to 342
assess the diversity between the Clusters. Sites with high level of Fst together with different major 343
allele in each Cluster were filtered as the specific sites. PCA were analyzed by the GCTA 344
v1.93.1beta(Yang, et al., 2011) with the specific sites and all SNV dataset respectively. 345
Linkage analysis of specific sites and characteristics of major haplotype subgroups 346
The linkage disequilibrium of the specific sites were analyzed by haploview(Barrett, et al., 2005), and 347
the statistics of the haplotype of the specific sites for each Cluster or country were used in-house perl 348
script. 349
Phylogenetic network of haplotype subgroups 350
The phylogenetic networks were inferred by PopART package v1.7.2(Leigh, et al., 2015) using TCS 351
and minimum spanning network (MSN) methods respectively. 352
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
18
Frequencies of specific sites or haplotypes and correlation with death rate 353
The frequencies of specific sites for each country were calculated. The death rate was estimated with 354
Total Deaths /Confirmed Cases based on the data from Johns Hopkins resources on May 12th, 2020 355
(https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6).The 356
correlation coefficient between death rate and frequencies of specific site or haplotype in different 357
countries was calculated using Pearson method. 358
359
Authors’ contributions 360
HD conceived the study. YM, YB, DJ, JL and XC carried out the data analysis and wrote the 361
manuscript. MH, SL, and ZC collected data and revised the manuscript. XW attended the discussions. 362
HD and YM supervised the whole work and revised the manuscript. 363
Conflict of Interest 364
The authors declare no conflict of interest. 365
Funding sources 366
This work was supported by the National Key R&D Program of China [2018YFC0910201], the Key 367
R&D Program of Guangdong Province [2019B020226001], the Science and the Technology Planning 368
Project of Guangzhou [201704020176 and 2020Q-P013]. 369
Ethical Approval 370
Not required. 371
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
19
372
References 373
Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF. (2020) The proximal origin of 374 SARS-CoV-2. Nature Medicine, 10.1038/s41591-020-0820-9 375 Barrett JC, Fry B, Maller J, Daly MJ. (2005) Haploview: analysis and visualization of LD and 376 haplotype maps. Bioinformatics, 21:263-5. 377 Bouckaert R, Vaughan TG, Barido-Sottani J, et al. (2019) BEAST 2.5: An advanced software platform 378 for Bayesian evolutionary analysis. PLoS Comput Biol, 15:e1006650. 379 Chan JF, Kok KH, Zhu Z, et al. (2020) Genomic characterization of the 2019 novel human-pathogenic 380 coronavirus isolated from a patient with atypical pneumonia after visiting Wuhan. Emerg Microbes 381 Infect, 9:221-36. 382 Cotten M, Watson SJ, Zumla AI, et al. (2014) Spread, circulation, and evolution of the Middle East 383 respiratory syndrome coronavirus. mBio, 5: 384 Danecek P, Auton A, Abecasis G, et al. (2011) The variant call format and VCFtools. Bioinformatics 385 (Oxford, England), 27:2156-8. 386 Forster P, Forster L, Renfrew C, Forster M. (2020) Phylogenetic network analysis of SARS-CoV-2 387 genomes. Proceedings of the National Academy of Sciences, 10.1073/pnas.2004999117202004999. 388 Graham RL, Sparks JS, Eckerle LD, Sims AC, Denison MR. (2008) SARS coronavirus replicase 389 proteins in pathogenesis. Virus Res, 133:88-100. 390 Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. (2010) New algorithms and 391 methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst 392 Biol, 59:307-21. 393 Guo YR, Cao QD, Hong ZS, et al. (2020) The origin, transmission and clinical therapies on 394 coronavirus disease 2019 (COVID-19) outbreak - an update on the status. Mil Med Res, 7:11. 395 He J, Peng G, Min J. (2004) Molecular Evolution of the SARS Coronavirus During the Course of the 396 SARS Epidemic in China. Science, 303: 397 Holder M, Lewis PO. (2003) Phylogeny estimation: traditional and Bayesian approaches. Nat Rev 398 Genet, 4:275-84. 399 Huang C, Wang Y, Li X, et al. (2020) Clinical features of patients infected with 2019 novel coronavirus 400 in Wuhan, China. Lancet, 395:497-506. 401 Kumar S, Stecher G, Li M, Knyaz C, Tamura K. (2018) MEGA X: Molecular Evolutionary Genetics 402 Analysis across Computing Platforms. Mol Biol Evol, 35:1547-9. 403 Lam TT, Shum MH, Zhu HC, et al. (2020) Identifying SARS-CoV-2 related coronaviruses in Malayan 404 pangolins. Nature, 10.1038/s41586-020-2169-0 405 Langmead B, Salzberg SL. (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods, 9:357-9. 406 Lefort V, Longueville JE, Gascuel O. (2017) SMS: Smart Model Selection in PhyML. Mol Biol Evol, 407 34:2422-4. 408
Leigh JW, Bryant D, Nakagawa S. (2015) popart: full‐feature software for haplotype network 409
construction. Methods in Ecology and Evolution, 6:1110-6. 410
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
20
Letunic I, Bork P. (2019) Interactive Tree Of Life (iTOL) v4: recent updates and new developments. 411 Nucleic Acids Res, 47:W256-W9. 412 Li F. (2016) Structure, Function, and Evolution of Coronavirus Spike Proteins. Annu Rev Virol, 413 3:237-61. 414 Li H. (2011) A statistical framework for SNP calling, mutation discovery, association mapping and 415 population genetical parameter estimation from sequencing data. Bioinformatics (Oxford, England), 416 27:2987-93. 417 Li H, Handsaker B, Wysoker A, et al. (2009) The Sequence Alignment/Map format and SAMtools. 418 Bioinformatics (Oxford, England), 25:2078-9. 419 Liu Y, Du X, Chen J, et al. (2020) Neutrophil-to-lymphocyte ratio as an independent risk factor for 420 mortality in hospitalized patients with COVID-19. J Infect, 10.1016/j.jinf.2020.04.002 421 Liu Y, Gayle AA, Wilder-Smith A, Rocklov J. (2020) The reproductive number of COVID-19 is higher 422 compared to SARS coronavirus. J Travel Med, 27: 423 Lu R, Zhao X, Li J, et al. (2020) Genomic characterisation and epidemiology of 2019 novel 424 coronavirus: implications for virus origins and receptor binding. Lancet, 395:565-74. 425 Malavolta M, Giacconi R, Brunetti D, Provinciali M, Maggi F. (2020) Exploring the Relevance of 426 Senotherapeutics for the Current SARS-CoV-2 Emergency and Similar Future Global Health Threats. 427 Cells, 9: 428 Rambaut A, Lam TT, Max Carvalho L, Pybus OG. (2016) Exploring the temporal structure of 429 heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol, 2:vew007. 430 van Dorp L, Acman M, Richard D, et al. (2020) Emergence of genomic diversity and recurrent 431 mutations in SARS-CoV-2. Infection, Genetics and Evolution, 432 https://doi.org/10.1016/j.meegid.2020.104351104351. 433 WHO. (2020) Coronavirus disease 2019 (COVID-19) Situation Report – 93. WHO, 434 Wrapp D, Wang N, Corbett KS, et al. (2020) Cryo-EM structure of the 2019-nCoV spike in the 435 prefusion conformation. Science, 367:1260-3. 436 Yang J, Lee SH, Goddard ME, Visscher PM. (2011) GCTA: a tool for genome-wide complex trait 437 analysis. Am J Hum Genet, 88:76-82. 438 Yu WB, Tang GD, Zhang L, Corlett RT. (2020) Decoding the evolution and transmissions of the novel 439 pneumonia coronavirus (SARS-CoV-2 / HCoV-19) using whole genomic data. Zool Res, 41:247-57. 440 Yuen K-S, Ye Z-W, Fung S-Y, Chan C-P, Jin D-Y. (2020) SARS-CoV-2 and COVID-19: The most 441 important research questions. Cell Biosci, 10:40. 442 Zhang L, Shen F-m, Chen F, Lin Z. (2020) Origin and Evolution of the 2019 Novel Coronavirus. 443 Clinical Infectious Diseases, 10.1093/cid/ciaa112 444 Zhang YZ, Holmes EC. (2020) A Genomic Perspective on the Origin and Emergence of SARS-CoV-2. 445 Cell, 181:223-7. 446 Zhang Z, Li J, Zhao XQ, Wang J, Wong GK, Yu J. (2006) KaKs_Calculator: calculating Ka and Ks 447 through model selection and model averaging. Genomics Proteomics Bioinformatics, 4:259-63. 448 Zhao Z, Li H, Wu X, et al. (2004) Moderate mutation rate in the SARS coronavirus genome and its 449 implications. BMC Evol Biol, 4:21. 450 Zhou P, Yang XL, Wang XG, et al. (2020) A pneumonia outbreak associated with a new coronavirus of 451 probable bat origin. Nature, 579:270-3. 452
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
21
453
Figure Legends 454
Figure 1 Phylogenetic tree and clusters of 622 SARS-CoV-2 genomes 455
The 622 sequences were clustered into three clusters: Cluster 1 were mainly from North America, 456
Cluster 2 were from regions all over the world, and Cluster 3 were mainly from Europe. 457
Figure 2 Linkage disequilibrium plot of haplotypes of the 9 specific sites 458
A. The plot for 622 genome sequences; B. The plot for 3624 genome sequences. 459
Figure 3 The frequencies of both the 9 specific sites and haplotypes 460
The frequencies of the 9 specific sites (A) and haplotypes (B) in each country for 3624 genomes. 461
Figure 4 The characteristics of haplotype subgroups 462
A. The numbers of haplotypes of the 9 specific sites for 16373 genomes with clear collection data 463
detected in each country in chronological order; B. The whole genome mutations in each major 464
haplotype subgroup. 465
Figure 5 Phylogenetic network of haplotype subgroups for 3624 genomes 466
The network was inferred by POPART using TCS method. Each colored vertex represents a haplotype, 467
with different colors indicating the different sampling areas. Hatch marks along edge indicated the 468
number of mutations. Small black circles within the network indicated unsampled haplotypes. H1-H5 469
subgroups were point out according haplotypes of the 9 specific sites, and other small subgroups were 470
not specially pointed out. 471
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
22
Figure 6 The correlation between death rate and frequencies of both the 9 specific sites and 472
haplotypes 473
474
Tables 475
Table 1 Statistics of Ka, Ks and Ka/Ks ratios for all coding regions of the SARS-CoV, 476
MERS-CoV and SARS-CoV-2 genome sequences 477
Table 2 Substitution rate and tMRCA estimated by BEAST v2.6.2 478
Table 3 The information of the 9 specific sites in each Cluster 479
Table 4 Statistics of Ka, Ks and Ka/Ks ratios for all coding regions of each 4 major haplotype 480
subgroup with 3624 and 16373 genomes respectively 481
482
Supplementary material 483
Figure S1 Regression analyses of the root-to-tip divergence with the maximum-likelihood 484
phylogenetic tree of SARS-CoV (A), MERS-CoV (B) and SARS-CoV-2 (C) 485
The root-to-tip genetic distances were inferred in TempEst v1.5.3. Three data sets exhibit a positive 486
correlation between root-to-tip divergence and sample collecting time, and they appear to be suitable 487
for molecular clock analysis. 488
Figure S2 Substitution rate and tMRCA estimate of SARS-CoV-2 489
A. Estimated substitution rate of SARS-CoV-2 using BEAST v2.6.2 under a strict molecular clock. The 490
mean rate was 1.601 x 10-3 (substitutions per site per year). B. The estimated tMRCA of SARS-CoV-2 491
using BEAST v2.6.2 under a strict molecular clock. The mean tMRCA was 2019.74 (date: 492
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
23
2019-09-27). 493
Figure S3 Phylogenetic tree of 622 SARS-CoV-2 genomes 494
A. Phylogenetic tree constructed by PhyML with bootstrap value of 100; B. Phylogenetic tree 495
constructed by MEGA with bootstrap value of 100. 496
Figure S4 The PCAs analysis result based on the 9 specific sites (A) and all SNVs (B) 497
Table S1 The information of the first downloaded complete genome sequences 498
Table S2 The sequence IDs in each Cluster 499
Table S3 The details information of the SNVs in all Clusters 500
Table S4 The information of death rate, frequencies of the 9 specific sites and major haplotypes 501
in each country for 3624 and 16373 genomes respectively 502
Table S5 All haplotypes of the 9 specific sites and numbers of them for 3624 and 16373 genomes 503
respectively 504
Table S6 Whole genome mutations and frequencies in each major haplotype subgroup of 3624 505
and 16373 genomes respectively 506
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
EPI.ISL.413573
EPI.ISL.414624
EPI.ISL.413018
EPI.ISL.415605
EPI.ISL.413577
EPI.ISL.412873
EPI.ISL.41
5707
EPI.ISL.403963
EPI.ISL.416338
MT152824
EPI.ISL.415154
EPI.ISL.4140
20
EPI.ISL.414617
EPI.ISL.410531
EPI.IS
L.4156
41
EPI.ISL.416415
70-
20
03
10
06
CD
MN
EPI.ISL.413015
EPI.ISL.415626
EPI.IS
L.4164
82
EPI.ISL.415711
EPI.ISL.413853
EPI.ISL.416448
MN994468
EPI.ISL.415471
EPI.ISL.412872
EPI.ISL.414537
EPI.ISL.413459
EPI.ISL.416411
EPI.ISL.413864
EPI.ISL.416442
EPI.ISL.414563
EPI.ISL.414571
EPI.ISL.416036
EPI.ISL.414544
EPI.ISL.414638
MT159712
EPI.ISL.413599
EPI.ISL.41
4011
EPI.ISL.414554
EPI.ISL.414630
EPI.IS
L.4154
56
EPI.ISL.413214
EPI.ISL.413582
EPI.ISL.415621
EPI.ISL.
415128
EPI.ISL.415700
EPI.ISL.416373
EPI.ISL.411915
MT106053
EPI.ISL.412981
MT019529
EPI.ISL.415640
MT072688
EPI.ISL.407896
EPI.ISL.416451
EPI.ISL.415602
EPI.ISL.415625
MN988713
EPI.ISL.406533
EPI.ISL.415596
EPI.ISL.415543
EPI.ISL.414429
EPI.ISL.407893
EPI.ISL.415606
EPI.ISL.416521
EPI.ISL.406973
EPI.ISL.413861
EPI.ISL.415608
EPI.ISL.41637679
44
14.
LSI.I
PE
EPI.ISL.
414005
EPI.ISL.408514
EPI.IS
L.4105
46
EPI.IS
L.4145
24
EPI.IS
L.4157
09
EPI.IS
L.4140
45
EPI.ISL.416336
MT163718
MT159718
EPI.ISL.413555
EPI.ISL.415609
13
55
14.
LSI.I
PE
EPI.ISL.414632
64
46
14.
LSI.I
PE
EPI.ISL.415595
EPI.ISL.407987
EPI.ISL.412978
EPI.ISL.413863
EPI.ISL.415627
EPI.ISL.416497
EPI.ISL.413490
EPI.ISL.
413648
EPI.ISL.407313
EPI.ISL.415619
EPI.ISL.406595
EPI.IS
L.4060
31
EPI.ISL.413996
EPI.ISL.408668
EPI.ISL.408485
EPI.ISL.408976
12
54
14.
LSI
.IP
E
MT188341
EPI.ISL.415516
87
48
04.
LSI
.IP
E
EPI.ISL.408977
MT066176EPI.ISL.410717
EPI.ISL.416519
EPI.IS
L.4165
07
EPI.ISL.416402
EPI.ISL.415156
EPI.ISL.416333
EPI.ISL.413020
EPI.ISL.415607
EPI.ISL.415158
MN994467
EPI.ISL.414536
NMDC60013002-09
EPI.ISL.414597
EPI.ISL.414467
MT012098
EPI.ISL.408486
EPI.ISL.416458
EPI.ISL.416361
EPI.ISL.414625
EPI.ISL.415603
EPI.ISL.412975
MT159720
78
44
14.
LSI.I
PE
EPI.ISL.415646
EPI.ISL.415153
26
93
04.
LSI
.IP
E
LC522974
EPI.ISL.416412
EPI.ISL.415485
EPI.ISL.415597
EPI.ISL.414470
EPI.ISL.415654
EPI.ISL.414634
EPI.ISL.415650
EPI.ISL.415614
EPI.ISL.415486
EPI.ISL.413860
EPI.ISL.414635
MT123293
EPI.ISL.4140
19
EPI.ISL.415467
EPI.ISL.415598
EPI.ISL.415612
EPI.ISL.415581
EPI.ISL.
416493
EPI.ISL.403936
EPI.ISL.415501
EPI.ISL.411950
EPI.ISL.415129
EPI.ISL.413595
EPI.ISL.415658
EPI.ISL.
414525
EPI.ISL.415491
EPI.ISL.414618
82
93
14.
LSI.I
PE
EPI.ISL.414600
EPI.ISL.408488
EPI.ISL.416431
97
36
14.
LSI
.IP
E
02
64
14.
LSI.I
PE
EPI.ISL.416377
EPI.IS
L.4157
01
EPI.ISL.414601
EPI.ISL.412870
EPI.ISL.415698
LC521925
EPI.ISL.414626
EPI.ISL.415534
EPI.ISL.411952
EPI.ISL.415476
52
35
89
NM
EPI.ISL.414663
EPI.ISL.414526
EPI.ISL.416363
EPI.IS
L.4139
99
EPI.ISL.415525
EPI.ISL.415488
MT126808
EPI.ISL.413600
EPI.ISL.412912
EPI.ISL.416317
EPI.ISL.413651
EPI.ISL.414688
EPI.ISL.412026
MN996531
EPI.ISL.414595
EPI.ISL.407193
EPI.ISL.414578
MN938384
EPI.ISL.416035
EPI.ISL.416425
EPI.ISL.414691
EPI.ISL.414633
EPI.ISL.415157
EPI.ISL.413014
MT159707
EPI.ISL.412030
EPI.ISL.415473
EPI.ISL.416498
EPI.ISL.416465
MT019532
EPI.ISL.416353
EPI.ISL.416436
EPI.ISL.415620
EPI.ISL.416389
EPI.ISL.415497
EPI.ISL.414621
EPI.ISL.
414006
23
12
04.
LSI
.IP
E
45
06
01
TM
EPI.ISL.414938
EPI.ISL.408479
EPI.ISL.41
5704
EPI.ISL.416513
EPI.IS
L.4154
55
EPI.ISL.415468
LC522975
EPI.ISL.416473
EPI.ISL.413604
EPI.ISL.406535
EPI.ISL.416479
EPI.ISL.410714
EPI.ISL.413580
EPI.ISL.411
219
EPI.ISL.413857
EPI.ISL.415613
EPI.ISL.416455
MT044258
EPI.ISL.411926
EPI.ISL.414522
NMDC60013002-08
MT135041
EPI.ISL.409067
EPI.ISL.4164
89
EPI.ISL.416024
EPI.ISL.408430
EPI.ISL.413456
EPI.ISL.406592
EPI.ISL.415592
EPI.ISL.416398
EPI.ISL.414586
EPI.ISL.411902
MT159716
EPI.ISL.415615
EPI.ISL.414580
EPI.ISL.414627
MT019533
EPI.ISL.416512
EPI.ISL.415457
EPI.ISL.413022
EPI.ISL.412982
EPI.ISL.416403
EPI.ISL.416514
EPI.ISL.410719
MN908947
EPI.ISL.416320
EPI.ISL.414428
EPI.ISL.413854
EPI.IS
L.4144
45
EPI.ISL.416342
EPI.ISL.412973
EPI.ISL.
414631
EPI.ISL.415458
MT184908
EPI.ISL.414579
MT007544
EPI.ISL.416397
EPI.ISL.413653
EPI.ISL.410716
EPI.ISL.41
6426
EPI.ISL.41
4451
EPI.ISL.414539
EPI.ISL.416434
EPI.ISL.414023
02
95
14.
LSI.I
PE
EPI.ISL.416477
EPI.IS
L.4140
22EPI
.ISL.4
13997
EPI.ISL.413650
EPI.ISL.415481
EPI.ISL.4154
92
EPI.ISL.415601
52
93
14.
LSI.I
PE
EPI.ISL.415583
EPI.ISL.415151
MT159717
EPI.ISL.416330
EPI.ISL.416390
EPI.ISL.416452
EPI.ISL.416334
EPI.ISL.414636
EPI.ISL.415703
EPI.ISL.415532
EPI.ISL.413592
EPI.ISL.413563
EPI.ISL.413559
EPI.ISL.415539
06
46
14.
LSI.I
PE
EPI.ISL.413560
EPI.ISL.415743
CNA0007332
EPI.ISL.416468
EPI.ISL.
415435
EPI.ISL.416441
EPI.ISL.416510
MT039873
MT163719
EPI.ISL.416406
EPI.ISL.410715
EPI.ISL.414637
EPI.IS
L.4145
27
EPI.ISL.416322
MT039887
EPI.ISL.416430
EPI.ISL.415604
EPI.ISL.412979
EPI.ISL.416494
EPI.ISL.414555
EPI.IS
L.4145
69
EPI.ISL.414623
EPI.ISL.412972
MT1637
16
EPI.ISL.414593
EPI.ISL.414641
MT093631
EPI.ISL.414545
EPI.ISL.416323
EPI.ISL.
415630
EPI.ISL.414940
EPI.ISL.
416031
EPI.ISL.408484
EPI.ISL.416367
EPI.ISL.410713
EPI.ISL.411218
EPI.ISL.414587
EPI.ISL.416362
EPI.ISL.416433
MT159705
EPI.ISL.410536
EPI.ISL.413572
EPI.ISL.413017
EPI.ISL.414523
EPI.ISL.410535
EPI.IS
L.4136
02
EPI.ISL.414468
EPI.ISL.416469
EPI.ISL.416429
EPI.ISL.415611
MT135043
EPI.ISL.413213
EPI.ISL.410718
EPI.ISL.413489
MT123290
CNA0007335
EPI.ISL.416454
MT121215
EPI.ISL.416471
EPI.ISL.415487
EPI.IS
L.4163
25
MT188339
EPI.IS
L.4129
74
EPI.ISL.416502
EPI.ISL.414559
EPI.ISL.414476
MT066175
EPI.ISL.415622
EPI.ISL.
416499
26
25
79
NM
EPI.ISL.415503
EPI.ISL.416046
EPI.ISL.410537
EPI.ISL.414687
EPI.ISL.412028
MT027064
EPI.ISL.416491
EPI.ISL.416444
EPI.ISL.406534
EPI.ISL.413647
EPI.ISL.414562
72
24
04.
LSI
.IP
E
MT184913
MN996527
EPI.ISL.413589
EPI.ISL.415624
EPI.ISL.416495
EPI.IS
L.4146
46
EPI.ISL.414692
EPI.ISL.416438
EPI.ISL.41
5706
EPI.ISL.414500
EPI.ISL.414551
EPI.ISL.4156
31
MT106052
CNA0007334
NMDC60013002-06
EPI.ISL.407894
EPI.IS
L.4144
57
EPI.ISL.407976
EPI.ISL.413558
MT184912
EPI.ISL.416439
EPI.ISL.41
6032
EPI.ISL.
413603
EPI.ISL.406536
EPI.IS
L.4112
20
EPI.ISL.
415652
EPI.ISL.416340
EPI.ISL.413513
EPI.ISL.414941
EPI.ISL.416355
EPI.ISL.416443
EPI.ISL.414642
EPI.ISL.408481
EPI.ISL.416324
EPI.ISL.416370
EPI.ISL.416335
LC528233
EPI.ISL.416496
39
56
04.
LSI
.IP
E
EPI.ISL.414549
EPI.ISL.412983
MT163717
EPI.ISL.407071
EPI.ISL.416365
EPI.ISL.415648
EPI.ISL.
412116
EPI.ISL.416044
99
44
14.
LSI
.IP
E
EPI.ISL.416511
EPI.ISL.410984
88
43
14.
LSI
.IP
E
EPI.ISL.416028
88
46
14.
LSI.I
PE
EPI.ISL.416435
MT0661
56
EPI.ISL.416141
MT159715
EPI.ISL.408482
EPI.ISL.415500
48
55
14.
LSI.I
PE
EPI.ISL.416467
EPI.ISL.410720
EPI.ISL.413023
EPI.ISL.415702
EPI.ISL.413594
EPI.ISL.415535
MT184910
EPI.ISL.415454
EPI.ISL.403932
EPI.IS
L.4156
42
EPI.ISL.416033
LC522973
EPI.ISL.415616
MT044257
06
011
4.L
SI.I
PE
EPI.ISL.407073
EPI.ISL.415660
MN996529
EPI.ISL.415649
EPI.ISL.413485
EPI.ISL.416366
EPI.ISL.408515
EPI.ISL.406596
EPI.ISL.413515
MN996530
MT019531
EPI.ISL.416437
MT093571
EPI.ISL.416337
EPI.ISL.416472
MT027062
EPI.ISL.414689
EPI.ISL.415617
EPI.ISL.
414007
EPI.ISL.406597
EPI.ISL.416432
EPI.ISL.406594
EPI.ISL.413021
EPI.ISL.416457
95
42
14.
LSI
.IP
E
EPI.ISL.41
3019
EPI.ISL.413584
EPI.ISL.414591
EPI.ISL.414628
EPI.ISL.413566
24
06
14.
LSI
.IP
E
EPI.ISL.41
5705
EPI.ISL.416332
EPI.IS
L.4145
28
EPI.ISL.416382
EPI.ISL.416339
EPI.ISL.416440
16
46
14.
LSI.I
PE
MT123292
LC522972
EPI.ISL.413593
EPI.ISL.416326
EPI.ISL.416445
MT159708
EPI.ISL.414012
EPI.ISL.415479
MT019530
MN988668
EPI.ISL.416318
MT159719
MT159722
EPI.IS
L.4156
43
EPI.ISL.416476
EPI.IS
L.4130
24
EPI.ISL.416140
EPI.ISL.416428
90
54
14.
LSI.I
PE
EPI.ISL.415155
EPI.ISL.415527
EPI.ISL.416506
EPI.ISL.4155
12
EPI.ISL.415594
MT188340
EPI.ISL.415105
EPI.ISL.412898
EPI.ISL.415591
EPI.ISL.415505
EPI.ISL.4164
81
EPI.ISL.
414009
EPI.ISL.404228
EPI.ISL.414592
EPI.ISL.
415651
EPI.IS
L.4120
29
EPI.ISL.415475
EPI.ISL.414629
EPI.IS
L.4164
80
EPI.ISL.411066
EPI.ISL.413596
MT049951
EPI.ISL.416405
50
54
14.
LSI.I
PE
EPI.ISL.416316
EPI.ISL.416372
EPI.ISL.416034
EPI.IS
L.4145
17
EPI.ISL.414684
EPI.IS
L.4165
03
EPI.ISL.415462
EPI.ISL.408480
EPI.ISL.414577
EPI.ISL.416475
EPI.ISL.415600
EPI.ISL.413597
EPI.IS
L.4146
48
EPI.ISL.415541
MT123291
NMDC60013002-10
EPI.ISL.415460
MT159706
EPI.ISL.415506
EPI.ISL.414552
EPI.ISL.413579
EPI.IS
L.4156
55
MT184911
EPI.ISL.
415478
EPI.ISL.407988
66
46
14.
LSI.I
PE
EPI.ISL.414558
EPI.ISL.416047
EPI.ISL.415461
EPI.ISL.415741
EPI.ISL.406862
EPI.ISL.415629
EPI.ISL.408431
EPI.ISL.406531
MT050493
EPI.ISL.415510
EPI.ISL.412980
MN997409
EPI.ISL.416470
EPI.ISL.416478
EPI.ISL.414616
EPI.ISL.412869
EPI.ISL.403934
EPI.ISL.416409
13
93
14.
LSI.I
PE
EPI.ISL.416319
EPI.ISL.416321
EPI.ISL.411927
EPI.ISL.416029
EPI.ISL.414520
EPI.ISL.414008
EPI.ISL.416474
EPI.ISL.415482
EPI.ISL.416509
EPI.ISL.416447
EPI.ISL.415459
EPI.ISL.412871
75
65
14.
LSI.I
PE
EPI.ISL.416505
EPI.ISL.406538
LC528232
EPI.ISL.414435
EPI.ISL.416401
EPI.ISL.415708
EPI.ISL.416387
EPI.ISL.416341
EPI.ISL.413858
EPI.ISL.416399
EPI.ISL.415472
59
45
14.
LSI.I
PE
EPI.ISL.414643
MT159709
EPI.ISL.414452
MT118835
MT163720
EPI.IS
L.4140
21
EPI.ISL.41
3016
EPI.ISL.416407
EPI.ISL.415159
MN908947_2019/12/30_China/Hubei/Wuhan
MN938384_2020/01/10_China/Guangdong/Shenzhen ne
hzn
eh
S/g
no
dg
na
uG/
ani
hC
_11
/1
0/0
20
2_
26
25
79
NM
ytn
uo
Chsi
mo
ho
nS/
not
gni
hs
aW/
set
atS
deti
nU
_9
1/1
0/0
20
2_
52
35
89
NM
MN988713_2020/01/21_UnitedStates/Illinois/Chicago
MN994467_2020/01/23_UnitedStates/California/LosAngeles
MN994468_2020/01/22_UnitedStates/California/OrangeCounty
MN997409_2020/01/22_UnitedStates/Arizona/Phoenix
MN988668_2020/01/02_China/Hubei/Wuhan
CNA0007332_2019/12/26_China/Hubei/Wuhan
CNA0007334_2020/01
/01_China/Hubei/Wu
han
CNA0007335_2020/01/05_China/Hubei/Wuhan
MT007544_2020/01/25_Australia/Victoria/Clayton
NMDC60013002-06_2019/12/30_C
hina/Hubei/Wuhan
na
hu
W/ie
bu
H/a
nih
C_
70/
10/
02
02
_7
0-2
00
31
00
6C
DM
N
NMDC60013002-08_2019/12/30_China/Hubei/Wuhan
NMDC60013002-09_2020/01/01_China/Hubei/Wuhan
NMDC60013002-10_2019/12/30_China/Hubei/WuhanLC521925_2020/01/25_Japan/Aichi
MT027062_2020/01/29_UnitedStates/California
MT027064_2020/01/29_UnitedStates/California
MT066175_2020/02/01_China/Taiwan/Taipei
oyk
oT/
na
paJ
_9
2/1
0/0
20
2_
37
92
25
CLLC522974_2020/01/31_Japan/Tokyo
LC522975_2020/01/31_Japan/Tokyo
LC522972_2020/01/29_Japan/Kyoto
MT039887_2020/01/31_UnitedStates/Wisconsin
MT044258_2020/01/27_UnitedStates/California
MT044257_2020/01/28_UnitedStates/Illinois
MT049951_2020/01/17_China/Yunnan
MT066176_2020/02/05_China/Taiwan/Taipei
MT072688_2020/01/13_Nepal/Kathmandu
MT093631_2020/01/08_China
MT093571_2020/02/07_Sweden
MT106052_2020/02/06_UnitedStates/California
MT106053_2020/02/10_UnitedStates
/California
sa
xe
T/s
etat
Sd
eti
nU
_11
/2
0/0
20
2_
45
06
01
TM
MT118835_2020/02/23_UnitedStates/California/Solano
MT123291_2020/01/29_China/Guangdong/Guangzhou
MT123290_2020/02/05_China/Guangdong/Guangzhou
LC528233_2020/02/10_Japan/KanagawaLC528232_2020/02/10_Japan/Kanagawa
MT152824_2020/02/24_UnitedStates/Washington/SnohomishCounty
MT126808_2
020/02/28_
Brazil
MT1637
16_202
0/02/2
7_Unit
edStat
es/Was
hingto
n
MT135041_2020/01/26_China/Beijing
MT135043_2020/01/28_China/Beijing
MT163717_2020/02/28_UnitedStates/Washington
MT163718_2020/02/29_UnitedStates/Washington
MT163719_2020/03/01_UnitedStates/Washington
MT163720_2020/03/01_UnitedStates/Washington
MT050493_2020/01/31_India/KeralaState
MT012098_2020/01/27_India/KeralaState
MT159717_2020/02/17_UnitedStates
MT159718_2020/02/18_UnitedStates
MT159719_2020/02/18_UnitedStates
MT159720_2020/02/21_UnitedStates
MT159722_2020/02/21_UnitedStates
MT159705_2020/02/17_UnitedStates
MT159706_2020/02/17_UnitedStates
MT159707_2020/02/17_UnitedStates
MT159708_2020/02/17_UnitedStates
MT159709_2020/02/20_UnitedStates
MT159712_2020/02/25_UnitedStates
MT159715_2020/02/24_UnitedStates
MT159716_2020/02/24_UnitedStates
MT123292_2020/01/27_China/Guangdong/Guangzhou
MT123293_2020/01/29_China/Guangdong/Guangzhou
MT121215_2020/02/02_China/Shanghai
MT0661
56_202
0/01/3
0_Ital
y
MT184908_2020/02/21_UnitedStates
MT184910_2020/02/18_UnitedStates
MT184911_2020/02/17_UnitedStatesMT184912_2020/02/17_UnitedStates
MT184913_2020/02/24_UnitedStates
MT188339_2020/03/09_UnitedStates/MN
MT188340_2020/03/07_UnitedStates/MN
MT188341_2020/03/05_UnitedStates/MN
MN996527_2019/12/30_Ch
ina/Hubei/Wuhan
MN996529_2019/12/30_China/Hubei/Wuhan
na
hu
W/ie
bu
H/a
nih
C_
03/
21/
91
02
_0
35
69
9N
M
MN996531_2019/12/30_China/Hubei/Wuhan
MT019529_2019/12/23_China/Hubei/Wuhan
MT019530_2019/12/30_China/Hubei/Wuhan
MT019531_2019/12/30_China/Hubei/Wuhan
MT019532_2019/12/30_China/Hubei/Wuhan
MT019533_2020/01/01_China/Hubei/Wuhan
uo
hz
gn
aH/
gn
aije
hZ/
ani
hC
_0
2/1
0/0
20
2_
37
89
30
TM
na
hu
W/ie
bu
H/a
nih
C/ai
sA
_0
3/2
1/9
10
2_
23
12
04
_L
SI_I
PE
EPI_ISL_403932_2020/01/14_Asia/China/Guandong/Shenzhen
EPI_ISL_403934_202
0/01/15_Asia/China
/Guandong/Shenzhen
EPI_ISL_403936_2020/01/17_Asia/China/Guangdong/Zhuhai
iru
ba
htn
oN/
dn
alia
hT/
ais
A_
80/
10/
02
02
_2
69
30
4_
LSI
_IP
E
EPI_ISL_403963_2020/01/13_Asia/Thailand/Nonthaburi
EPI_ISL_404227_2020/01/16_Asia/China/Zhejiang
EPI_ISL_404228_2020/01/17_Asia/China/Zhejiang
EPI_IS
L_4060
31_202
0/01/2
3_Asia
/Taiwa
n/Kaoh
siung
EPI_ISL_406531_2020/01/22_Asia/China/Guangdong/Zhuhai
EPI_ISL_406533_2020/01/22_Asia/China/Guangdong/Guangzhou
EPI_ISL_406534_2020/01/22_Asia/China/Guangdong/Foshan
EPI_ISL_406535_2020/01/22_Asia/China/Guangdong/Foshan
EPI_ISL_406536_2020/01/22_Asia/China/Guangdong/Foshan
EPI_ISL_406538_2020/01/23_Asia/China/Guangdong
EPI_ISL_406592_2020/01/13_Asia/China/Guandong/Shenzhen
ne
hzn
eh
S/g
no
dn
au
G/a
nih
C/ai
sA
_3
1/1
0/0
20
2_
39
56
04
_L
SI_I
PE
EPI_ISL_406594_2020/01/16_Asia/China/Guandong/Shenzhen
EPI_ISL_406595_2020/
01/16_Asia/China/Gua
ndong/Shenzhen
EPI_ISL_406596_2020/01/23_Europe/France/Ile-de-France/Paris
EPI_ISL_406597_2020/01/23_Europe/France/Ile-de-France/Paris
EPI_ISL_406862_2020/01/28_Europe/Germany/Bavaria/Munich
EPI_ISL_406973_2020/01/23_Asia/Singapore
EPI_ISL_407071_2020/01/29_Europe/UnitedKingdom/England
EPI_ISL_407073_2020/01/29_Europe/UnitedKingdom/England
EPI_ISL_407193_2020/01/25_Asia/SouthKorea/Gyeonggi-do
EPI_ISL_407313_2020/01/19_Asia/China/Zhejiang/Hangzhou
EPI_ISL_407893_2020/01/24_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_407894_2020/01/28_Oceania/Australia/Queensland/GoldCoast
EPI_ISL_407896_2020/01/30_Oceania/Australia/Queensland/GoldCoast
EPI_ISL_407976_2020/02/03_Europe/Belgium/Leuven
EPI_ISL_407987_2020/01/25_Asia/Singapore
EPI_ISL_407988_2020/02/01_Asia/Singapore
EPI_ISL_408430_2020/01/29_Europe/France/Ile-de-France/Paris
EPI_ISL_408431_2020/01/29_Europe/France/Ile-de-France/Paris
na
uh
cg
no
Y/q
niq
gn
oh
C/a
nih
C/ai
sA
_1
2/1
0/0
20
2_
87
48
04
_L
SI_I
PE
EPI_ISL_408479_2020/01/23_Asia/China/Chongqing/Zhongxian
EPI_ISL_408480_2020/01/17_Asia/China/Yunnan/Kunming
EPI_ISL_408481_2020/01/18_Asia/China/Chongqing
EPI_ISL_408482_2020/01/19_Asia/China/Shandong/Qingdao
EPI_ISL_408484_2020/01/15_Asia/China/Sichuan/Chengdu
EPI_ISL_408485_2020/01/18_Asia/China/Beijing
EPI_ISL_408486_2020/01/11_Asia/China/J
iangxi/Pingxiang
EPI_ISL_408488_2020/01/19_Asia/China/Jiangsu/Huaian
EPI_ISL_408514_2020/01/01_Asia/China/Hubei/Wuhan
EPI_ISL_408515_2020/01/01_Asia/China/Hubei/Wuhan
EPI_ISL_408668_2020/01/24_Asia/Vietnam/ThanhHoa
EPI_ISL_408976_2020/01/22_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_408977_2020/01/25_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_409067_2020/01/29_NorthAmerica/USA/Massachusetts
EPI_ISL_410531_2020/01/25_Asia/Japan/Nara
EPI_ISL_410535_2020/02/03_Asia/Singapore
EPI_ISL_410536_2020/02/06_Asia/Singapore
EPI_ISL_410537_2020/02/09_Asia/Singapore
EPI_IS
L_4105
46_202
0/01/3
1_Euro
pe/Ita
ly/Rom
e
EPI_ISL_410713_2020/01/27_Asia/Singapore
EPI_ISL_410714_2020/02/03_Asia/Singapore
EPI_ISL_410715_2020/02/04_Asia/Singapore
EPI_ISL_410716_2020/02/04_Asia/Singapore
EPI_ISL_410717_2020/02/05_Oceania/Australia/Queensland/GoldCoast
EPI_ISL_410718_2020/02/05_Oceania/Australia/Queensland/GoldCoast
EPI_ISL_410719_2020/02/02_Asia/Singapore
EPI_ISL_410720_2020/01/23_Europe/France/Ile-de-France/Paris
EPI_ISL_410984_2020/01/29_Europe/France/Ile-de-France/Paris
naij
uF/
ani
hC/
ais
A_
12/
10/
02
02
_0
60
114
_L
SI_I
PE
EPI_ISL_411066_2020/01/22_Asia/China/Fujian
EPI_ISL_411218_2020/02/02_Europe/France/Ile-de-France/Paris
EPI_ISL_411
219_2020/01/28_Europe/France/Ile-de-France/Paris
EPI_IS
L_4112
20_202
0/01/2
8_Euro
pe/Fra
nce/Il
e-de-F
rance/
Paris
EPI_ISL_411902_2020/01/27_Asia/Cambodia/Sihanoukville
EPI_ISL_411915_2020/01/25_Asia/Taiwan/Taoyuan
EPI_ISL_411926_2020/01/24_Asia/Taiwan/Taipei
EPI_ISL_411927_2020/01/28_Asia/Taiwan/Taipei
EPI_ISL_411950_2020/01/23_Asia/China/Jiangsu
EPI_ISL_411952_2020/01/24_Asia/China/Jiangsu
EPI_ISL_412026_2020/02/23_Asia/China/Anhui/Hefei
EPI_ISL_412028_2020/01/22_Asia/HongKong
EPI_IS
L_4120
29_202
0/01/3
0_Asia
/HongK
ong
EPI_ISL_412030_2020/02/01_Asia/HongKong
EPI_ISL_
412116_2
020/02/0
9_Europe
/UnitedK
ingdom/E
ngland
uo
hzg
niJ/i
eb
uH/
ani
hC/
ais
A_
80/
10/
02
02
_9
54
21
4_
LSI
_IP
E
EPI_ISL_412869_2020/01/30_Asia/SouthKorea/Seoul
EPI_ISL_412870_2020/01/30_Asia/SouthKorea/Seoul
EPI_ISL_412871_2020/01/31_Asia/SouthKorea/Seoul
EPI_ISL_412872_2020/02/01_Asia/SouthKorea/Gyeonggi-do
EPI_ISL_412873_2020/02/06_Asia/SouthKorea/Chungcheongnam-do
EPI_ISL_412898_2019/12/3
0_Asia/China/Hubei/Wuhan
EPI_ISL_412912_2020/02/25_Europe/Germany/Baden-Wuerttemberg
EPI_ISL_412972_2020/02/27_NorthAmerica/Mexico/MexicoCity
EPI_ISL_412973_2
020/02/20_Europe
/Italy/Lombardy
EPI_IS
L_4129
74_202
0/01/2
9_Euro
pe/Ita
ly/Rom
e
EPI_ISL_412975_2020/02/28_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_412978_2020/01/17_Asia/China/Hubei/Wuhan
EPI_ISL_412979_2020/01/18_Asia/China/Hubei/Wuhan
EPI_ISL_412980_2020/01/18_Asia/China/Hubei/Wuhan
EPI_ISL_412981_2020/01/18_Asia/China/Hubei/Wuhan
EPI_ISL_412982_2020/02/07_Asia/China/Hubei/Wuhan
EPI_ISL_412983_2020/02/08_Asia/China/Hubei/Tianmen
EPI_ISL_413014_2020/01/25_NorthAmerica/Canada/Ontario
EPI_ISL_413015_2020/01/23_NorthAmerica/Canada/Ontario
EPI_ISL_41
3016_2020/
02/28_Sout
hAmerica/B
razil/SaoP
aulo/SaoPa
ulo
EPI_ISL_413017_2020/02/06_Asia/SouthKorea
EPI_ISL_413018_2020/02/06_Asia/SouthKorea
EPI_ISL_41
3019_2020/
02/26_Euro
pe/Switzer
land/Zuric
h
EPI_ISL_413020_2020/02/27_Europe/Switzerland/Zurich
EPI_ISL_413021_2020/02/29_Europe/Switzerland/Zurich
EPI_ISL_413022_2020/02/29_Europe/Switzerland/Zurich
EPI_ISL_413023_2020/02/29_Europe/Switzerland/Zurich
EPI_IS
L_4130
24_202
0/02/2
9_Euro
pe/Swi
tzerla
nd/Zur
ich
EPI_ISL_413213_2020/02/29_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_413214_2020/02/29_Oceania/Australia/NewSouthWales/Sydney
EPI_ISL_413456_2020/02/20_NorthAmerica/USA/Washington/KingCounty
EPI_ISL_413459_2020/02/20_Asia/Japan/Tokyo
EPI_ISL_413485_2020/01/24_Asia/China/Anhui/Suzhou
tcir
tsi
Dgr
ebs
nie
H/ail
ah
pts
eW
eni
hR
htr
oN/
yn
amr
eG/
ep
oru
E_
82/
20/
02
02
_8
84
31
4_
LSI
_IP
E
EPI_ISL_413489_202
0/03/03_Europe/Ita
ly/Lombardy/Milan
EPI_ISL_413490_2020/02/27_Oceania/NewZealand/Auckland
EPI_ISL_413513_2020/02/27_Asia/SouthKorea
EPI_ISL_413515_2020/02/27_Asia/SouthKorea
EPI_ISL_413555_2020/02/27_Europe/UnitedKingdom/Wales
EPI_ISL_413558_2020/02/27_NorthAmerica/USA/California/SolanoCounty
EPI_ISL_413559_2020/02/27_NorthAmerica/USA/California/SolanoCounty
EPI_ISL_413560_2020/02/28_NorthAmerica/USA/Washington
EPI_ISL_413563_2020/03/03_NorthAmerica/USA/Washington
EPI_ISL_413566_2020/03/02_Europe/Netherlands/Blaricum
EPI_ISL_413572_2020/03/01_Europe/Netherlands/Haarlem
EPI_ISL_413573_2020/03/02_Europe/Netherlands/HardinxveldGiessendam
EPI_ISL_413577_2020/03/02_Europe/Netherlands/Naarden
EPI_ISL_413579_2020/03/03_Europe/Netherlands/Nootdorp
EPI_ISL_413580_2020/03/02_Europe/Netherlands/Oisterwijk
EPI_ISL_413582_2020/03/01_Europe/Netherlands/Rotterdam
EPI_ISL_413584_2020/03/03_Europe/Netherlands/Rotterdam
EPI_ISL_413589_2020/03/01_Europe/Netherlands/Utrecht
EPI_ISL_413592_2020/03/02_
Asia/Taiwan/Taipei
EPI_ISL_413593_2020/02/29_Europe/Luxembourg
EPI_ISL_413594_2020/02/28_Oceania/Australia/NSW/Sydney
EPI_ISL_413595_2020/02/28_Oceania/Australia/NSW/Sydney
EPI_ISL_413596_2020/02/28_Oceania/Australia/NSW/Sydney
EPI_ISL_413597_2020/03/02_Oceania/Australia/NSW/Sydney
EPI_ISL_413599_2020/03/04_Oceania/Australia/NSW/Sydney
EPI_ISL_413600_2020/03/03_Oceania/Australia/NSW/Sydney
EPI_IS
L_4136
02_202
0/03/0
3_Euro
pe/Fin
land/H
elsink
i
EPI_ISL_
413603_2
020/03/0
3_Europe
/Finland
/Helsink
i
EPI_ISL_413604_2020/03/03_Europe/Finland/Helsinki
EPI_ISL_413647_2020/03/01_Europe/Portugal
EPI_ISL_
413648_2
020/03/0
1_Europe
/Portuga
l
EPI_ISL_413650_2020/03/05_NorthAmerica/USA/Washington
EPI_ISL_413651_2020/03/05_NorthAmerica/USA/Washington
EPI_ISL_413653_2020/03/05_NorthAmerica/USA/Washington
EPI_ISL_413853_2020/01/30_Asia/China/Guangdong
EPI_ISL_413854_2020/01/30_Asia/China/Guangdong
EPI_ISL_413857_2020/01/31_Asia/China/GuangdongEPI_ISL_413858_2020/01/30_Asia/China/Guangdong
EPI_ISL_413860_2020/01/30_Asia/China/Guangdong
EPI_ISL_413861_2020/02/01_Asia/China/Guangdong
EPI_ISL_413863_2020/02/01_Asia/China/Guangdong
EPI_ISL_413864_2020/02/09_Asia/China/Guangdong
ocsic
nar
Fn
aS/
ainr
ofila
C/A
SU/
acir
em
Ahtr
oN
_5
0/3
0/0
20
2_
52
93
14
_L
SI_I
PE
oc
sic
nar
Fn
aS/
ainr
ofila
C/A
SU/
acire
mA
htro
N_
50/
30/
02
02
_8
29
31
4_
LSI
_IP
Eoc
sic
nar
Fn
aS/
ainr
ofila
C/A
SU/
acire
mA
htro
N_
50/
30/
02
02
_1
39
31
4_
LSI
_IP
E
EPI_ISL_413996_2
020/02/24_Europe
/Switzerland/Tes
sin
EPI_IS
L_4139
97_202
0/02/2
6_Euro
pe/Swi
tzerla
nd/Gen
eva
EPI_IS
L_4139
99_202
0/02/2
7_Euro
pe/Swi
tzerla
nd/Arg
ovie
EPI_ISL_
414005_2
020/02/2
5_Europe
/UnitedK
ingdom/E
ngland
EPI_ISL_
414006_2
020/02/2
8_Europe
/UnitedK
ingdom/E
ngland
EPI_ISL_
414007_2
020/02/2
8_Europe
/UnitedK
ingdom/E
ngland
EPI_ISL_414008_2020/02/27_Europe/UnitedKingdom/England
EPI_ISL_
414009_2
020/02/2
5_Europe
/UnitedK
ingdom/E
ngland
EPI_ISL_41
4011_2020/
02/26_Euro
pe/UnitedK
ingdom/Eng
land
EPI_ISL_414012_2020/02/27_Europe/UnitedKingdom/England
EPI_ISL_4140
19_2020/02/2
7_Europe/Swi
tzerland/Gen
eva
EPI_ISL_4140
20_2020/02/2
7_Europe/Swi
tzerland/Gen
eva
EPI_IS
L_4140
21_202
0/02/2
7_Euro
pe/Swi
tzerla
nd/Bas
el
EPI_IS
L_4140
22_202
0/02/2
7_Euro
pe/Swi
tzerla
nd/Gen
eva
EPI_ISL_414023_2020/03/01_Europe/Switzerland/Vaud
EPI_IS
L_4140
45_202
0/03/0
4_Sout
hAmeri
ca/Bra
zil/Ri
odeJan
eiro/R
iodeJa
neiro
EPI_ISL_414428_2020/03/02_Europe/Netherlands/NoordBrabant
EPI_ISL_414429_2020/03/02_Europe/Netherlands/NoordBrabant
EPI_ISL_414435_2020/03/03_Europe/Netherlands/Utrecht
EPI_IS
L_4144
45_202
0/03/0
3_Euro
pe/Net
herlan
ds/Zui
dHolla
nd
EPI_ISL_41
4451_2020/
03/06_Euro
pe/Netherl
ands/Noord
Brabant
EPI_ISL_414452_2020/03/06_Europe/Netherlands/NoordBrabant
EPI_IS
L_4144
57_202
0/03/0
6_Euro
pe/Net
herlan
ds/Noo
rdBrab
ant
EPI_ISL_414467_2020/03/05_Europe/Netherlands/ZuidHolland
EPI_ISL_414468
_2020/03/06_Eu
rope/Netherlan
ds/ZuidHolland
EPI_ISL_414470_2020/03/06_Europe/Netherlands/ZuidHolland
EPI_ISL_414476_2020/02/29_NorthAmerica/USA/NewYork
EPI_ISL_414487_2020/03/04_Europe/Ireland/Cork t
cirtsiD
gre
bsni
eH/
aila
hpts
eW
eni
hR
htro
N/yn
amr
eG/
ep
oru
E_
52/
20/
02
02
_7
94
41
4_
LSI
_IP
E tcir
tsi
Dgr
eb
sni
eH/
aila
hpt
se
We
nih
Rht
ro
N/y
na
mre
G/e
por
uE
_6
2/2
0/0
20
2_
99
44
14
_L
SI_I
PE
EPI_ISL_414500_2020/03/04_Europe/UnitedKingdom/England/Yorkshire/Sheffield
tcirtsiD
gre
bsni
eH/
aila
hpts
eW
eni
hR
htro
N/yn
amr
eG/
ep
oru
E_
72/
20/
02
02
_5
05
41
4_
LSI
_IP
Etcirt
siD
gre
bsni
eH/
aila
hpts
eW
eni
hR
htro
N/yn
amr
eG/
ep
oru
E_
82/
20/
02
02
_9
05
41
4_
LSI
_IP
E
EPI_IS
L_4145
17_202
0/02/2
5_Asia
/HongK
ong
EPI_ISL_414520_2020/03/02_Europe/Germany/Munich
hci
nu
M/y
na
mre
G/e
por
uE
_2
0/3
0/0
20
2_
12
54
14
_L
SI_I
PE
EPI_ISL_414522_2020/02/28_Europe/UnitedKingdom/England
EPI_ISL_414523_2020/02/27_Europe/UnitedKingdom/England
EPI_IS
L_4145
24_202
0/03/0
1_Euro
pe/Uni
tedKin
gdom/E
ngland
EPI_ISL_
414525_2
020/03/0
2_Europe
/UnitedK
ingdom/E
ngland
EPI_ISL_414526_2020/03/03_Europe/UnitedKingdom/England
EPI_IS
L_4145
27_202
0/02/0
9_Asia
/HongK
ong
EPI_IS
L_4145
28_202
0/02/1
0_Asia
/HongK
ong
EPI_ISL_414536_2020/03/08_Europe/Netherlands/NoordBrabant
EPI_ISL_414537_2020/03/08_Europe/Netherlands/NoordBrabant
EPI_ISL_414539_2020/03/08_Europe/Netherlands/NoordBrabant
EPI_ISL_414544_2020/03/09_Europe/Netherlands/NoordBrabant
EPI_ISL_414545_2020/03/03_Europe/Netherlands/NoordBrabant
EPI_ISL_414549_2020/03/03_Europe/Netherlands/NoordHolland
EPI_ISL_414551_2020/03/07_Europe/Netherlands/Utrecht
EPI_ISL_414552_2020/03/07_Europe/Netherlands/Utrecht
EPI_ISL_414554_2020/03/08_Europe/Netherlands/Utrecht
EPI_ISL_414555_2020/03/08_Europe/Netherlands/Utrecht
EPI_ISL_414558_2020/03/06_Europe/Netherlands/ZuidHolland
EPI_ISL_414559_2020/03/07_Europe/Netherlands/ZuidHolland
EPI_ISL_414562_2020/03/03_Europe/Netherlands/ZuidHolland
EPI_ISL_414563_2020/03/03_Europe/Netherlands/ZuidHolland
EPI_IS
L_4145
69_202
0/02/1
2_Asia
/HongK
ong
EPI_ISL_414571_2020/02/22_Asia/HongKong
EPI_ISL_414577_2020/03/02_SouthAmerica/Chile/Talca
EPI_ISL_414578_2020/03/04_SouthAmerica/Chile/Talca
EPI_ISL_414579_2020/03/03_SouthAmerica/Chile/Santiago
EPI_ISL_414580_2020/03/05_SouthAmerica/Chile/Santiago
EPI_ISL_414586_2020/03/03_Europe/Ireland/Limerick
EPI_ISL_414587_2020/03/03_Europe/Ireland/Limerick
EPI_ISL_414591_2020/03/06_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414592_2020/03/06_NorthAmerica/USA/Washington/Tacoma
EPI_ISL_414593_2020/03/05_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414595_2020/03/05_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414597_2020/03/04_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414600_2020/02/26_Europe/France/Grand-Est/Strasbourg
EPI_ISL_414601_2020/
02/27_Europe/France/
Normandie/Rouen
EPI_ISL_414616_2020/03/08_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414617_2020/03/08_NorthAmerica/USAEPI_ISL_414618_2020/03/08_NorthAmerica/USA
EPI_ISL_414620_2020/03/08_NorthAmerica/USA
EPI_ISL_414621_2020/03/08_NorthAmerica/USA/Washington/Kirkland
EPI_ISL_414623_2020/02/25_Europe/France/Grand-Est/Strasbourg
EPI_ISL_414624_2020/02
/26_Europe/France/Norm
andie/Rouen
EPI_ISL_414625_2020/02/26_Europe/France/PaysdelaLoire/Nantes
EPI_ISL_414626_2020/02/29_Europe
/France/HautsdeFrance/CrpyenValo
is
EPI_ISL_414627_2020/03/02_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414628_2020/03/02_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414629_2020/03/03_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414630_2020/03/03_Europe/France/HautsdeFrance/Compigne
EPI_ISL_
414631_2
020/03/0
4_Europe
/France/
Grand-Es
t/Reims
EPI_ISL_414632_2020/03/04_Europe/France/Grand-Est/Reims
EPI_ISL_414633_2020/03/04_Europe/France/Ile-de-France/Pontoise
EPI_ISL_414634_2020/03/04_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414635_2020/03/04_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414636_2020/03/04_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414637_2020/03/04_Europe/France/HautsdeFrance/Compigne
EPI_ISL_414638_2020/03/04_Europe/Franc
e/HautsdeFrance/Compigne
EPI_ISL_414641_2020/03/05_Europe/Finland
EPI_ISL_414642_2020/03/08_Europe/Finland
EPI_ISL_414643_2020/03/07_Europe/Finland
EPI_IS
L_4146
46_202
0/03/0
4_Euro
pe/Fin
land
EPI_IS
L_4146
48_202
0/03/1
1_Nort
hAmeri
ca/USA
/Calif
ornia/
SanDie
goCoun
ty
EPI_ISL_414663_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414684_2020/02/27_Asia/China/Guangdong/Guangzhou
EPI_ISL_414687_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414688_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414689_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414691_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414692_2020/02/25_Asia/China/Guangdong/Guangzhou
EPI_ISL_414938_2020/01/24_Asia/China/Shandong
EPI_ISL_414940_2020/01/25_Asia/China/Shandong
EPI_ISL_414941_2020/01/30_Asia/China/Shandong
EPI_ISL_415105_2020/03/04_SouthAmerica/Brazil/Bahia/FeiradeSantana
EPI_ISL_
415128_2
020/02/2
9_SouthA
merica/B
razil/Es
piritoSa
nto/Vila
Velha
EPI_ISL_415129_2020/02/29_Europe/UnitedKingdom/England
EPI_ISL_415151_2020/03/04_NorthAmerica/USA/NewYork
EPI_ISL_415153_2020/03/03_Europe/Belgium/Huldenberg
EPI_ISL_415154_2020/03/01_Europe/Belgium/Kraainem
EPI_ISL_415155_2020/03/01_Europe/Belgium/Huldenberg
EPI_ISL_415156_2020/03/01_Europe/Belgium/Huldenberg
EPI_ISL_415157_2020/03/01_Europe/Belgium/Sint-Niklaas
EPI_ISL_415158_2020/03/01_Europe/Belgium/Brussels
EPI_ISL_415159_2020/02/29_Europe/Belgium/Leuven
EPI_ISL_
415435_2
020/03/0
6_Europe
/UnitedK
ingdom/W
ales
EPI_ISL_415454_2020/02/28_Europe/Switzerland
EPI_IS
L_4154
55_202
0/02/2
8_Euro
pe/Swi
tzerla
nd
EPI_IS
L_4154
56_202
0/02/2
9_Euro
pe/Swi
tzerla
nd
EPI_ISL_415457_2020/02/29_Europe/Switzerland
EPI_ISL_415458_2020/03/01_Europe/Switzerland
EPI_ISL_415459_2020/02/29_Europe/Switzerland/Genve
EPI_ISL_415460_2020/03/09_Europe/Netherlands/Flevoland
EPI_ISL_415461_2020/03/10_Europe/Netherlands/Gelderland
EPI_ISL_415462_2020/03/09_Europe/Netherlands/Gelderland
EPI_ISL_415467_2020/03/10_Europe/Netherlands
EPI_ISL_415468_2020/03/10_Europe/Netherlands
EPI_ISL_415471_2020/03/11_Europe/Netherlands
EPI_ISL_415472_2020/03/11_Europe/Netherlands
EPI_ISL_415473_2020/03/09_Europe/Netherlands
EPI_ISL_415475_2020/03/12_Europe/Netherlands
EPI_ISL_415476_2020/03/10_Europe/Netherlands
EPI_ISL_
415478_2
020/03/0
8_Europe
/Netherl
ands
EPI_ISL_415479_2020/03/08_Europe/Netherlands
EPI_ISL_415481_2020/03/08_Europe/Netherlands
EPI_ISL_415482_2020/03/09_Europe/Netherlands
EPI_ISL_415485_2020/03/12_Europe/Netherlands
EPI_ISL_415486_2
020/03/13_Europe
/Netherlands
EPI_ISL_415487_2020/03/13_Europe/Netherlands
EPI_ISL_415488_2020/03/13_Europe/Netherlands
EPI_ISL_415491
_2020/03/07_Eu
rope/Netherlan
ds
EPI_ISL_4154
92_2020/03/1
0_Europe/Net
herlands
sd
nalr
eht
eN/
ep
oru
E_
01/
30/
02
02
_5
94
51
4_
LSI
_IP
E
EPI_ISL_415497_2020/03/09_Europe/Netherlands
EPI_ISL_415500_2020/03/10_Europe/Netherlands/NoordBrabant
EPI_ISL_415501_2020/03/11_Europe/Netherlands/NoordBrabant
EPI_ISL_415503_2020/03/11_Europe/Netherlands/NoordBrabant
EPI_ISL_415505_2020/03/11_Europe/Netherlands/NoordBrabant
EPI_ISL_415506_2020/03/11_Europe/Netherlands/NoordBrabant
EPI_ISL_415510_2020/03/09_Europe/Netherlands/NoordBrabant
EPI_ISL_4155
12_2020/03/0
9_Europe/Net
herlands/Noo
rdBrabant
EPI_ISL_415516_2020/03/11_Europe/Netherlands/NoordBrabant
EPI_ISL_415525_2020/03/12_Europe/Netherlands/NoordHolland
EPI_ISL_415527_2
020/03/12_Europe
/Netherlands/Utr
echt
dn
allo
Hdi
uZ/s
dn
alre
hte
N/e
por
uE
_9
0/3
0/0
20
2_
13
55
14
_L
SI_I
PE
EPI_ISL_415532_2020/03/09_Europe/Netherlands/ZuidHolland
EPI_ISL_415534_2020/03/11_Europe/Netherlands/ZuidHolland
EPI_ISL_415535_2020/03/12_Europe/Netherlands/ZuidHolland
EPI_ISL_415539_2020/03/13_NorthAmerica/USA/Utah
EPI_ISL_415541_2020/03/13_NorthAmerica/USA/Utah
EPI_ISL_415543_2020/03/13_NorthAmerica/USA/Utah
EPI_ISL_415581_2020/03/01_NorthAmerica/Canada/BritishColumbia
EPI_ISL_415583_2020/03/03_NorthAmerica/Canada/BritishColumbia
aib
mul
oC
hsitir
B/a
da
na
C/a
cire
mA
htro
N_
20/
30/
02
02
_4
85
51
4_
LSI
_IP
E
EPI_ISL_415591_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_415592_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415594_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_415595_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415596_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415597_2020/03/10_NorthAmerica/USA/Washington/Seattle
EPI_ISL_415598_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_415600_2020/03/09_NorthAmerica/USA
EPI_ISL_415601
_2020/03/10_No
rthAmerica/USA
/Washington
EPI_ISL_415602_2020/03/10_NorthAmerica/USA/Washington/Seattle
EPI_ISL_415603_2020/03/10_NorthAmerica/USA/Washington/Seattle
EPI_ISL_415604_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_415605_2020/03/05_NorthAmerica/USA/Washington
EPI_ISL_415606_2020/03/08_NorthAmerica/USA/Washington
EPI_ISL_415607_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415608_2020/03/08_NorthAmerica/USA/Washington
EPI_ISL_415609_2020/03/08_NorthAmerica/USA/Washington
EPI_ISL_415611_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415612_2020/03/09_NorthAmerica/USA/Washington/Seattle
EPI_ISL_415613_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415614_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415615_2020/03/08_NorthAmerica/USA/Washington
EPI_ISL_415616_2020/03/08_NorthAmerica/USA/WashingtonEPI_ISL_415617_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415619_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415620_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415621_2020/03/09_NorthAmerica/USA/Washington/Seattle
EPI_ISL_415622_2020/03/09_NorthAmerica/USA/WashingtonEPI_ISL_415624_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415625_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415626_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_415627_2020/03/09_NorthAmerica/USA/Washington
EPI_ISL_415629_2020/03/04_Europe/UnitedKingdom/Scotland
EPI_ISL_
415630_2
020/03/0
9_Europe
/UnitedK
ingdom/S
cotland
EPI_ISL_4156
31_2020/03/1
0_Europe/Uni
tedKingdom/S
cotland
EPI_ISL_415640_2020/03/08_Europe/UnitedKingdom/Scotland
EPI_IS
L_4156
41_202
0/02/2
7_Asia
/Georg
ia/Tbi
lisi
EPI_IS
L_4156
42_202
0/03/1
0_Asia
/Georg
ia/Tbi
lisi
EPI_IS
L_4156
43_202
0/03/1
0_Asia
/Georg
ia/Tbi
lisi
EPI_ISL_415646_2020/03/03_Europe/Italy
EPI_ISL_415648_2020/03/03_Europe/Italy
EPI_ISL_415649_2020/03/05_Europe/France/Hauts-de-France/Crpy-en-Valois
EPI_ISL_415650_2020/03/02_Europe/France/IDF
EPI_ISL_
415651_2
020/03/0
5_Europe
/France/
Bourgogn
e-France
-Comt/Mo
ntreux-C
hateau
EPI_ISL_
415652_2
020/03/0
5_Europe
/France/
Bourgogn
e-France
-Comt/Th
ise
EPI_ISL_415654_2020/03/09_Europe/France/HautsdeFrance/Compigne
EPI_IS
L_4156
55_202
0/03/1
2_Euro
pe/Uni
tedKin
gdom/W
ales
sel
aW/
mo
dg
niK
deti
nU/
ep
oru
E_
21/
30/
02
02
_7
56
51
4_
LSI
_IP
E
EPI_ISL_415658_2020/03/06_SouthAmerica/Chile/Santiago
EPI_ISL_415660_2020/03/07_SouthAmerica/Chile/Santiago
EPI_ISL_415698_2020/02/27_Europe/Switzerland
EPI_ISL_415700_2020/02/28_Europe/Switzerland
EPI_IS
L_4157
01_202
0/03/0
2_Euro
pe/Swi
tzerla
nd
EPI_ISL_415702_2020/03/02_Europe/Switzerland
EPI_ISL_415703_2020/03/01_Europe/Switzerland
EPI_ISL_41
5704_2020/
03/04_Euro
pe/Switzer
land
EPI_ISL_41
5705_2020/
03/06_Euro
pe/Switzer
land
EPI_ISL_41
5706_2020/
03/06_Euro
pe/Switzer
land
EPI_ISL_41
5707_2020/
03/08_Euro
pe/Switzer
land
EPI_ISL_415708
_2020/03/07_Eu
rope/Switzerla
nd
EPI_IS
L_4157
09_202
0/01/2
5_Asia
/China
/Hangz
hou
EPI_ISL_415711_2020/01/22_Asia/China/Hangzhou
EPI_ISL_415741_2020/02/26_Asia/Taiwan/Taoyuan
EPI_ISL_415743_2020/02/27_Asia/Taiwan/Taoyuan
sel
aW/
mo
dg
niK
deti
nU/
ep
oru
E_
21/
30/
02
02
_0
29
51
4_
LSI
_IP
E
EPI_ISL_416024_2020/03/11_Europe/UnitedKingdom/Wales
EPI_ISL_416028_2020/03/03_SouthAmerica/Brazil/SaoPaulo/SaoPauloEP
I_ISL_416029_2020/03/04_SouthAmerica/Brazil/SaoPaulo/SaoPaulo
EPI_ISL_
416031_2
020/03/0
4_SouthA
merica/B
razil/Sa
oPaulo/S
aoPaulo
EPI_ISL_41
6032_2020/
03/04_Sout
hAmerica/B
razil/Dist
ritoFedera
l/Brasilia
olu
aP
oa
S/ol
ua
Po
aS/liz
arB/
acire
mA
htu
oS
_3
0/3
0/0
20
2_
33
06
14
_L
SI_I
PE
EPI_ISL_416034_2020/03/04_SouthAmerica/Brazil/SaoPaulo/SaoPaulo
EPI_ISL_416035_2020/03/05_SouthAmerica/Brazil/SaoPaulo/SaoPaulo
EPI_ISL_416036_2020/03/05_SouthAmerica/Brazil/SaoPaulo/SaoPaulo
uo
hzg
na
H/a
nih
C/ai
sA
_6
2/1
0/0
20
2_
24
06
14
_L
SI_I
PE
EPI_ISL_416044_2020/01/25_Asia/China/Hangzhou
EPI_ISL_416046_2020/01/24_Asia/China/Hangzhou
EPI_ISL_416047_2020/02/02_Asia/China/Hangzhou
EPI_ISL_416140_2020/03/02_Europe/Denmark/Copenhagen
EPI_ISL_416141_2020/03/03_Europe/Denmark/Copenhagen
EPI_ISL_416316_2020/01/25_Asia/China/Shanghai
EPI_ISL_416317_2020/01/25_Asia/China/Shanghai
EPI_ISL_416318_2020/01/28_Asia/China/Shanghai
EPI_ISL_416319_2020/01/28_Asia/China/Shanghai
EPI_ISL_416320_2020/01/28_Asia/China/Shanghai
EPI_ISL_416321_2020/01/28_Asia/China/Shanghai
EPI_ISL_416322_2020/01/29_Asia/China/Shanghai
EPI_ISL_416323_2020/01/29_Asia/China/Shanghai
EPI_ISL_416324_2020/01/29_Asia/China/Shanghai
EPI_IS
L_4163
25_202
0/02/0
2_Asia
/China
/Shang
hai
EPI_ISL_416326_2020/01/30_Asia/China/Shanghai
EPI_ISL_416330_2020/02/02_Asia/China/Shanghai
EPI_ISL_416332_2020/01/30_Asia/China/Shanghai
EPI_ISL_416333_2020/01/30_Asia/China/Shanghai
EPI_ISL_416334_2020/02/06_Asia/China/Shanghai
EPI_ISL_416335_2020/02/02_Asia/China/Shanghai
EPI_ISL_416336_2020/02/01_Asia/China/Shanghai
EPI_ISL_416337_2020/02/04_Asia/China/Shanghai
EPI_ISL_416338_2020/02/01_Asia/China/Shanghai
EPI_ISL_416339_2020/02/03_Asia/China/Shanghai
EPI_ISL_416340_2020/02/02_Asia/China/Shanghai
EPI_ISL_416341_2020/02/01_Asia/China/Shanghai
EPI_ISL_416342_2020/02/04_Asia/China/Shanghai
EPI_ISL_416353_2020/02/04_Asia/China/Shanghai
EPI_ISL_416355_2020/02/04_Asia/China/Shanghai
EPI_ISL_416361_2020/02/01_Asia/China/Shanghai
EPI_ISL_416362_2020/02/02_Asia/China/Shanghai
EPI_ISL_416363_2020/02/08_Asia/China/Shanghai
EPI_ISL_416365_2020/01/30_Asia/China/Shanghai
EPI_ISL_416366_2020/01/30_Asia/China/Shanghai
EPI_ISL_416367_2020/02/01_Asia/China/Shanghai
ia
hg
na
hS/
ani
hC/
ais
A_
10/
20/
02
02
_0
73
61
4_
LSI
_IP
E
EPI_ISL_416372_2020/02/01_Asia/China/Shanghai
EPI_ISL_416373_2020/02/07_Asia/China/Shanghai
EPI_ISL_416376_2020/02/05_Asia/China/Shanghai
EPI_ISL_416377_2020/02/07_Asia/China/Shanghai
ia
hg
na
hS/
ani
hC/
ais
A_
10/
20/
02
02
_9
73
61
4_
LSI
_IP
E
EPI_ISL_416382_2020/02/08_Asia/China/Shanghai
EPI_ISL_416387_2020/02/01_Asia/China/Shanghai
EPI_ISL_416389_2020/01/21_Asia/China/Shanghai
EPI_ISL_416390_2020/01/31_Asia/China/Shanghai
EPI_ISL_416397_2020/02/02_Asia/China/Shanghai
EPI_ISL_416398_2020/02/02_Asia/China/Shanghai
EPI_ISL_416399_2020/02/13_Asia/China/Shanghai
EPI_ISL_416401_2020/02/02_Asia/China/Shanghai
EPI_ISL_416402_2020/02/11_Asia/China/Shanghai
EPI_ISL_416403_2020/02/02_Asia/China/Shanghai
EPI_ISL_416405_2020/02/02_Asia/China/Shanghai
EPI_ISL_416406_2020/02/15_Asia/China/Shanghai
EPI_ISL_416407_2020/02/15_Asia/China/Shanghai
EPI_ISL_416409_2020/02/02_Asia/China/Shanghai
EPI_ISL_416411_2020/01/25_Oceania/Australia/Victoria/Melbourne
EPI_ISL_416412_2020/03/02_Oceania/Australia/Victoria/Melbourne
EPI_ISL_416415_2020/02/08_Oceania/Australia/Victoria/Melbourne
EPI_ISL_416425_2020/02/03_Asia/China/Hangzhou
EPI_ISL_41
6426_2020/
03/17_Euro
pe/Hungary
/Budapest
EPI_ISL_416428_2020/03/07_Asia/Vietnam/Quangning
EPI_ISL_416429_2020/02/11_Asia/Vietnam/Vinhphuc
EPI_ISL_416430_2020/03/06_Asia/Vietnam/Hanoi
EPI_ISL_416431_2020/03/06_Asia/Vietnam/Hanoi
EPI_ISL_416432_2020/03/07_Asia/SaudiArabia/Riyadh
EPI_ISL_416433_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416434_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416435_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416436_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416437_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416438_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416439_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416440_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416441_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416442_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416443_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416444_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416445_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416446_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416447_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416448_2020/03/11_NorthAmerica/USA/Washington
EPI_ISL_416451_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416452_2020/03/10_NorthAmerica/USA/Washington
EPI_ISL_416454_2020/03/06_NorthAmerica/USA/Washington
EPI_ISL_416455_2020/03/07_NorthAmerica/USA/Washington
EPI_ISL_416457_2020/03/18_NorthAmerica/USA/California/SanDiegoCounty
EPI_ISL_416458_2020/03/02_Asia/Kuwait/Hawali
ytn
uo
Cg
niK/
not
gni
hsa
W/A
SU/
acire
mA
htro
N_
92/
20/
02
02
_0
64
61
4_
LSI
_IP
Eyt
nu
oC
gni
K/n
otg
nih
sa
W/A
SU/
acir
em
Ahtr
oN
_9
2/2
0/0
20
2_
16
46
14
_L
SI_I
PE
EPI_ISL_416465_2020/02/29_NorthAmerica/USA/Washington/KingCounty
ytn
uo
Cg
niK/
not
gni
hsa
W/A
SU/
acire
mA
htro
N_
30/
30/
02
02
_6
64
61
4_
LSI
_IP
E
EPI_ISL_416467_2020/03/02_Europe/Belgium/Holsbeek
EPI_ISL_416468_2020/03/02_Europe/Belgium/Holsbeek
EPI_ISL_416469_2020/03/03_Europe/Belgium/Kessel-Lo
EPI_ISL_416470_2020/03/02_Europe/Belgium/CouthuinEPI_ISL_416471_2020/03/02_Europe/Belgium/Kessel-Lo
EPI_ISL_416472_2020/03/02_Europe/Belgium/Kessel-Lo
EPI_ISL_416473_2020/01/26_Asia/China/Hangzhou
EPI_ISL_416474_2020/01/28_Asia/China/Hangzhou
EPI_ISL_416475_2020/03/03_Europe/Belgium/Leuven
EPI_ISL_416476_2020/03/02_Europe/Belgium/Holsbeek
EPI_ISL_416477_2020/03/08_Asia/Georgia/Tbilisi
EPI_ISL_416478_2020/03/14_Asia/Georgia/Tbilisi
EPI_ISL_416479_2020/03/05_Asia/Georgia/Tbilisi
EPI_IS
L_4164
80_202
0/03/1
1_Asia
/Georg
ia/Tbi
lisi
EPI_ISL_4164
81_2020/03/1
6_Asia/Georg
ia/Tbilisi
EPI_IS
L_4164
82_202
0/03/1
3_Asia
/Georg
ia/Tbi
lisi
eiksro
go
nol
eiZ/
dn
alo
P/e
por
uE
_3
0/3
0/0
20
2_
88
46
14
_L
SI_I
PE
EPI_ISL_4164
89_2020/03/1
5_NorthAmeri
can/USA/WI/D
anecounty
EPI_ISL_416491_2020/03/15_NorthAmerica/USA/WI/Danecounty
EPI_ISL_
416493_2
020/03/0
8_Europe
/France/
HautsdeF
rance/Ch
teau-Thi
erry
EPI_ISL_416494_202
0/03/04_Europe/Fra
nce/Normandie/Roue
n
EPI_ISL_416495_2020/03/10_Europe/France/HautsdeFrance/Compigne
EPI_ISL_416496_2020/03/10_Europe/France/HautsdeFrance/Compigne
EPI_ISL_416497_2020/03/10_Europe/France/Hautsd
eFrance/Compigne
EPI_ISL_416498_2020/03/11_Europe/France/IledeFrance/Garches
EPI_ISL_
416499_2
020/03/1
1_Europe
/France/
IledeFra
nce/Long
jumeau
EPI_ISL_416502_2020/02/26_Europe/France/Bretagne/Rennes
EPI_IS
L_4165
03_202
0/03/0
1_Euro
pe/Fra
nce/Br
etagne
/Renne
s
EPI_ISL_416505_2020/03/02_Europe/France/Bretagne/Rennes
EPI_ISL_416506_2020/03/03_Europe/France/Bretagne/Rennes
EPI_IS
L_4165
07_202
0/03/0
5_Euro
pe/Fra
nce/Br
etagne
/Renne
s
EPI_ISL_416509_2020/03/06_Europe/France/Bretagne/Rennes
EPI_ISL_416510_2020/03/06_Europe/France/Bretagne/Rennes
EPI_ISL_416511_2020/03/07_Europe/France/Bretagne/Rennes
EPI_ISL_416512_2020/03/07_Europe/France/Bretagne/Rennes
EPI_ISL_416513_2020/03/07_Europe/France/Bretagne/Rennes
EPI_ISL_416514_2020/03/15_Oceania/Australia/Victoria/Melbourne
EPI_ISL_416519_2020/03/02_Oceania/NewZealand/Auckland
EPI_ISL_416521_2020/03/10_As
ia/SaudiArabia
Tree scale: 0.0001
Inner Color: Continents
Asia
Europe
NorthAmerica
Oceania
SouthAmerica
Outer Clolors: Countries
Australia
Belgium
Brazil
Cambodia
Canada
Chile
China
Denmark
Finland
France
Georgia
Germany
HongKong_China
Hungary
India
Ireland
Italy
Japan
Kuwait
Luxembourg
Mexico
Nepal
Netherlands
NewZealand
Poland
Portugal
SaudiArabia
Singapore
SouthKorea
Sweden
Switzerland
Taiwan_China
Thailand
UnitedKingdom
UnitedStates
Vietnam
Shape Plot: Haplotype
H1
H2
H3
H4
H5
H6
Other
Cluster 2 Cluster 1
Cluster 3Cluster 2
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
1 2 3 4 5 6 7 8 9
9999
9898
99
9898
Block 1 (27 kb)1 2 3 4 5 6 7 8 9
9799
9799
9999
9999
9998
9999
9997
9797
9898
9999
9898
98
98
98
9998
99
Block 1 (27 kb)
.423
.288
.148
.122
.423
.288
.148
.122
C C C C C A C A T
T T C T C A C G T
C C T C C A C A C
C C T C T G T A C
.454
.324
.123
.063
.454
.324
.123
.063
A BC
241T
C30
37T
C87
82T
C14
408T
C17
747T
A178
58G
C18
060T
A234
03G
T281
44C
C24
1T
C30
37T
C87
82T
C14
408T
C17
747T
A178
58G
C18
060T
A234
03G
T281
44C
1 2 3 4 5 6 7 8 91 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 91 2 3 4 5 6 7 8 9
T T C T C A C G T
C C C C C A C A T
C C T C T G T A C
C C T C C A C A C
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
18060T 23403G 28144C
14408T 17747T 17858G
241T 3037T 8782T
0.00
0.25
0.50
0.75
1.00
0.00
0.25
0.50
0.75
1.00
0.00
0.25
0.50
0.75
1.00
Freq
uenc
y of
eac
h sp
ecifi
c si
te
TTCTCACGT(H1) CCCCCACAT(H2) CCTCTGTAC(H3) CCTCCACAC(H4)
CCCA TTTG CAC TGT
0.00
0.25
0.50
0.75
1.00
0.00
0.25
0.50
0.75
1.00
Freq
uenc
y of
major
hap
loty
pes
of th
e 4,
3 an
d 9
spec
ific
site
s
countrySouthKoreaChinaJapanUSACanadaAustraliaSpainUKGermanyNetherlandAustriaPortugalBelgiumItalyFranceSwitzerlandBrazil
Asia
NorthAmerica Ocieania
Europe
SouthAmerica
A
B
countrySouthKoreaChinaJapanUSACanadaAustraliaSpainUKGermanyNetherlandAustriaPortugalBelgiumItalyFranceSwitzerlandBrazil
Asia
NorthAmerica Ocieania
Europe
SouthAmerica
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
Sout
hKor
eaC
hina
Japa
nU
SAC
anad
aAu
stra
liaSp
ain
UK
Ger
man
yN
ethe
rland
Aust
riaPo
rtuga
lBe
lgiu
mIta
lyFr
ance
Switz
erla
ndBr
azil
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
0.0
0.1
0.2
0.3
0.4
0.5
0 10000 20000 30000
H1
COL
1
2
0.0
0.1
0.2
0.3
0.4
0.5
0 10000 20000 30000
H2
COL
1
2
0.0
0.1
0.2
0.3
0.4
0.5
0 10000 20000 30000
H3
COL
1
2
0.0
0.1
0.2
0.3
0.4
0.5
0 10000 20000 30000Pos
H4
COL
1
2
A B COL other mutation sitesthe 9 specific sites
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
Australia
Austria
Belgium
Brazil
Canada
China
France
Germany
Italy
Japan
Netherlands
Portugal
South_Korea
Spain
Switzerland
USA
United_Kingdom
H1
H5
H2
H4
H3
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.46 , p = 0.06451
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.18 , p = 0.4799
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.33 , p = 0.1906
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.47 , p = 0.055
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.18 , p = 0.4962
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.45 , p = 0.06913
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.32 , p = 0.2105
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.18 , p = 0.4807
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.18 , p = 0.4802
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.44 , p = 0.08004
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.46 , p = 0.06523
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
R = 0.45 , p = 0.07226
28144T TTTG CAC TTCTCACGT(H1)
17747C 17858A 18060C 23403G
241T 3037T 8782C 14408T
0.00 0.25 0.50 0.75 1.000.00 0.25 0.50 0.75 1.000.00 0.25 0.50 0.75 1.000.00 0.25 0.50 0.75 1.00
0.04
0.08
0.12
0.16
0.04
0.08
0.12
0.16
0.04
0.08
0.12
0.16
Frequency of 9 specific sites and haplotypes
Est
imat
ed d
eath
rat
e
.CC-BY-NC-ND 4.0 International license(which was not certified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprintthis version posted June 30, 2020. . https://doi.org/10.1101/2020.04.24.058933doi: bioRxiv preprint
Ka (mean)
10-3
s.e. (Ka)
10-3
Ks (mean)
10-3
s.e. (Ks)
10-3
Ka/Ks
(mean)
s.e.
(Ka/Ks) Hypothesis
P-value*
622 genomes
P-value*
3624 genomes
SARS-CoV 0.985 0.018 1.310 0.049 0.760 0.038 Ka/Ks ( SARS-CoV < SARS-CoV-2) 0.139 0.084
MERS-CoV 1.319 0.040 4.887 0.096 0.260 0.003 Ka/Ks ( MERS-CoV < SARS-CoV-2) 0.000 0.000
SARS-CoV-2 622 genomes 0.231 0.004 0.265 0.005 1.008 0.020
3624 genomes 0.287 0.002 0.298 0.002 1.094 0.010
Note: *One-sided Mann-Whitney U-test for the means of two independent samples.
.C
C-B
Y-N
C-N
D 4.0 International license
(which w
as not certified by peer review) is the author/funder. It is m
ade available under aT
he copyright holder for this preprintthis version posted June 30, 2020.
. https://doi.org/10.1101/2020.04.24.058933
doi: bioR
xiv preprint
substitution rate (10-3) 95% CI (10-3) tMRCA 95% CI references
SARS-CoV 1.050 0.489, 1.654 2002/6/28 2002/1/19, 2002/11/3 this study
- 0.800, 2.380 spring of 2002 - 17
MERS-CoV 1.516 1.392, 1.632 2012/6/7 2012/6/4, 2012/6/9 this study
1.120 0.876, 1.370 2012/3 2011/12, 2012/6 18
SARS-CoV-2 1.601 1.418, 1.796 2019/9/27 2019/8/28, 2019/10/26 this study
17. Zhao Z, Li H, Wu X, Zhong Y, Zhang K, Zhang YP, et al. Moderate mutation rate in the SARS coronavirus genome and its
implications. BMC Evol Biol. 2004;4:21.
18. Cotten M, Watson SJ, Zumla AI, Makhdoom HQ, Palser AL, Ong SH, et al. Spread, circulation, and evolution of the
Middle East respiratory syndrome coronavirus. mBio. 2014;5(1).
.C
C-B
Y-N
C-N
D 4.0 International license
(which w
as not certified by peer review) is the author/funder. It is m
ade available under aT
he copyright holder for this preprintthis version posted June 30, 2020.
. https://doi.org/10.1101/2020.04.24.058933
doi: bioR
xiv preprint
Pos Ref Alt
Cluster 1 Cluster 2 Cluster 3
Fst Gene
region
Mutation
type
Protein
changed
codon
changed Impact
pi Major
allele frequency pi
Major
allele frequency pi
Major
allele frequency
241 C T 0.0000 C 1.0000 0.0109 C 0.9945 0.0000 T 1.0000 0.9912 5'UTR upstream NA NA MODIFIER
3037 C T 0.0000 C 1.0000 0.0109 C 0.9945 0.0000 T 1.0000 0.9912 gene-orf1ab synonymous 924F 2772ttC>ttT LOW
8782 C T 0.0000 T 1.0000 0.3863 C 0.7390 0.0000 C 1.0000 0.5821 gene-orf1ab synonymous 2839S 8517agC>agT LOW
14408 C T 0.0000 C 1.0000 0.0055 C 0.9973 0.0000 T 1.0000 0.9956 gene-orf1ab missense 4715P>L 14144cCt>cTt MODERATE
17747 C T 0.0735 T 0.9620 0.0000 C 1.0000 0.0000 C 1.0000 0.9752 gene-orf1ab missense 5828P>L 17483cCt>cTt MODERATE
17858 A G 0.0497 G 0.9747 0.0000 A 1.0000 0.0000 A 1.0000 0.9836 gene-orf1ab missense 5865Y>C 17594tAt>tGt MODERATE
18060 C T 0.0000 T 1.0000 0.0164 C 0.9918 0.0000 C 1.0000 0.9761 gene-orf1ab synonymous 5932L 17796ctC>ctT LOW
23403 A G 0.0000 A 1.0000 0.0164 A 0.9918 0.0000 G 1.0000 0.9869 gene-S missense 614D>G 1841gAt>gGt MODERATE
28144 T C 0.0000 C 1.0000 0.3915 T 0.7335 0.0000 T 1.0000 0.5785 gene-ORF8 missense 84L>S 251tTa>tCa MODERATE
.C
C-B
Y-N
C-N
D 4.0 International license
(which w
as not certified by peer review) is the author/funder. It is m
ade available under aT
he copyright holder for this preprintthis version posted June 30, 2020.
. https://doi.org/10.1101/2020.04.24.058933
doi: bioR
xiv preprint
Ka (mean) 10-3 s.e. (Ka)10-3 Ks (mean) 10-3 s.e. (Ks)10-3 Ka/Ks (mean) s.e. (Ka/Ks) Hypothesis P-value*
3624 genomes
H1 0.510 0.057 0.498 0.056 1.099 0.010 Ka/Ks ( H3 < H1) 0.000
H2 0.347 0.042 0.334 0.044 1.268 0.023 Ka/Ks ( H3 < H2) 0.000
H3 0.298 0.007 0.397 0.009 0.795 0.010 - -
H4 0.334 0.023 0.414 0.019 0.796 0.019 - -
16373 genomes
H1 0.416 0.011 0.384 0.011 1.178 0.005 Ka/Ks ( H3 < H1) 0.000
H2 0.332 0.014 0.316 0.014 1.224 0.012 Ka/Ks ( H3 < H2) 0.000
H3 0.305 0.004 0.401 0.005 0.809 0.007 - -
H4 0.354 0.010 0.477 0.009 0.749 0.010 - -
Note: *One-sided Mann-Whitney U-test for the means of two independent samples.
.C
C-B
Y-N
C-N
D 4.0 International license
(which w
as not certified by peer review) is the author/funder. It is m
ade available under aT
he copyright holder for this preprintthis version posted June 30, 2020.
. https://doi.org/10.1101/2020.04.24.058933
doi: bioR
xiv preprint