dating the origin of the genus flavivirus in the light of

14
Dating the origin of the genus Flavivirus in the light of Beringian biogeography John H.-O. Pettersson and Omar Fiz-Palacios Correspondence John H.-O. Pettersson [email protected] or [email protected] Received 28 February 2014 Accepted 4 June 2014 Department of Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden The genus Flavivirus includes some of the most important human viral pathogens, and its members are found in all parts of the populated world. The temporal origin of diversification of the genus has long been debated due to the inherent problems with dating deep RNA virus evolution. A generally accepted hypothesis suggests that Flavivirus emerged within the last 10 000 years. However, it has been argued that the tick-borne Powassan flavivirus was introduced into North America some time between the opening and closing of the Beringian land bridge that connected Asia and North America 15 000–11 000 years ago, indicating an even older origin for Flavivirus. To determine the temporal origin of Flavivirus, we performed Bayesian relaxed molecular clock dating on a dataset with high coverage of the presently available Flavivirus diversity by combining tip date calibrations and internal node calibration, based on the Powassan virus and Beringian land bridge biogeographical event. Our analysis suggested that Flavivirus originated ~85 000 (64 000–110 000) or 120 000 (87 000–159 000) years ago, depending on the circumscription of the genus. This is significantly older than estimated previously. In light of our results, we propose that it is likely that modern humans came in contact with several members of the genus Flavivirus much earlier than suggested previously, and that it is possible that the spread of several flaviviruses coincided with, and was facilitated by, the migration and population expansion of modern humans out of Africa. INTRODUCTION The genus Flavivirus, within the family Flaviviridae, currently consists of .70 virus species distributed all over the globe (Gould et al., 2003; Gubler et al., 2007; Lindenbach et al., 2007). It includes numerous viruses of major human health concern, such as Dengue virus (DENV), Japanese enceph- alitis virus, West Nile virus and the type species for flaviviruses, yellow fever virus (flavus: ‘yellow’), giving the family and the genus its name (Gould & Solomon, 2008; Mackenzie et al., 2004). The genome of flaviviruses consists of a positive-sense ssRNA molecule of ~11 kbp. One single ORF encodes three structural proteins (capsid, pre-mem- brane and envelope) and seven non-structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B and NS5) flanked by untranslated regions (Chambers et al., 1990; Lindenbach & Rice, 2003). The genus Flavivirus has been divided into four main groups based on ecological characteristics, molecular phylo- genetic analyses, vector specificity and virus performance in host cells (Gaunt et al., 2001; Gould et al., 2001; Kuno, 2007). Most of the recognized flaviviruses belong to the mosquito-borne flaviviruses (MBFs) that are commonly, but not exclusively, vectored by either Aedes or Culex mosqui- toes. The second group, tick-borne flaviviruses (TBFs), the only monophyletic group of Flavivirus, are vectored by ixodid ticks, and are further subdivided into a mammal and a seabird group based on their host specificity (Grard et al., 2007; Gritsun et al., 2003). The no-known vector flaviviruses (NKVFs) are associated with either bats or rodents and infect vertebrates without an apparent arthropod vector transmitting the viruses (Porterfield, 1980). The majority of the known flaviviruses are zoonotic, i.e. pathogenic viruses that can be transmitted between humans and other animals. However, the fourth group, the insect-specific flaviviruses (ISFs; flaviviruses that are only capable of replicating in insect cells) (Kuno, 2007), is most likely an undersampled and very diverse group (Cook et al., 2012). The idea of a molecular clock has been used to address many hypotheses in the study of emerging viral diseases, especially for diseases caused by RNA viruses (Bromham & Penny, 2003), e.g. to reject the hypothesis of a potential spread of human immunodeficiency virus (HIV) in the 1950s through a contaminated polio vaccine (Korber et al., 2000). However, dating the origin of viruses is a complex and challenging task. For viruses with high rates of evolu- tion, the original phylogenetic signal is difficult to deduce even with complex evolutionary models because the signal Three supplementary figures and three supplementary tables are available with the online version of this paper. Journal of General Virology (2014), 95, 1969–1982 DOI 10.1099/vir.0.065227-0 065227 G 2014 The Authors Printed in Great Britain 1969

Upload: others

Post on 23-Mar-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Dating the origin of the genus Flavivirus in the lightof Beringian biogeography

John H.-O. Pettersson and Omar Fiz-Palacios

Correspondence

John H.-O. Pettersson

[email protected] or

[email protected]

Received 28 February 2014

Accepted 4 June 2014

Department of Systematic Biology, Evolutionary Biology Centre, Uppsala University,Uppsala, Sweden

The genus Flavivirus includes some of the most important human viral pathogens, and its

members are found in all parts of the populated world. The temporal origin of diversification of the

genus has long been debated due to the inherent problems with dating deep RNA virus evolution.

A generally accepted hypothesis suggests that Flavivirus emerged within the last 10 000 years.

However, it has been argued that the tick-borne Powassan flavivirus was introduced into North

America some time between the opening and closing of the Beringian land bridge that connected

Asia and North America 15 000–11 000 years ago, indicating an even older origin for Flavivirus.

To determine the temporal origin of Flavivirus, we performed Bayesian relaxed molecular clock

dating on a dataset with high coverage of the presently available Flavivirus diversity by combining

tip date calibrations and internal node calibration, based on the Powassan virus and Beringian

land bridge biogeographical event. Our analysis suggested that Flavivirus originated ~85 000

(64 000–110 000) or 120 000 (87 000–159 000) years ago, depending on the circumscription

of the genus. This is significantly older than estimated previously. In light of our results, we

propose that it is likely that modern humans came in contact with several members of the genus

Flavivirus much earlier than suggested previously, and that it is possible that the spread of several

flaviviruses coincided with, and was facilitated by, the migration and population expansion of

modern humans out of Africa.

INTRODUCTION

The genus Flavivirus, within the family Flaviviridae, currentlyconsists of .70 virus species distributed all over the globe(Gould et al., 2003; Gubler et al., 2007; Lindenbach et al.,2007). It includes numerous viruses of major human healthconcern, such as Dengue virus (DENV), Japanese enceph-alitis virus, West Nile virus and the type species forflaviviruses, yellow fever virus (flavus: ‘yellow’), giving thefamily and the genus its name (Gould & Solomon, 2008;Mackenzie et al., 2004). The genome of flaviviruses consistsof a positive-sense ssRNA molecule of ~11 kbp. One singleORF encodes three structural proteins (capsid, pre-mem-brane and envelope) and seven non-structural proteins (NS1,NS2A, NS2B, NS3, NS4A, NS4B and NS5) flanked byuntranslated regions (Chambers et al., 1990; Lindenbach &Rice, 2003).

The genus Flavivirus has been divided into four maingroups based on ecological characteristics, molecular phylo-genetic analyses, vector specificity and virus performance inhost cells (Gaunt et al., 2001; Gould et al., 2001; Kuno,2007). Most of the recognized flaviviruses belong to themosquito-borne flaviviruses (MBFs) that are commonly, but

not exclusively, vectored by either Aedes or Culex mosqui-toes. The second group, tick-borne flaviviruses (TBFs), theonly monophyletic group of Flavivirus, are vectored byixodid ticks, and are further subdivided into a mammal anda seabird group based on their host specificity (Grard et al.,2007; Gritsun et al., 2003). The no-known vector flaviviruses(NKVFs) are associated with either bats or rodents andinfect vertebrates without an apparent arthropod vectortransmitting the viruses (Porterfield, 1980). The majority ofthe known flaviviruses are zoonotic, i.e. pathogenic virusesthat can be transmitted between humans and other animals.However, the fourth group, the insect-specific flaviviruses(ISFs; flaviviruses that are only capable of replicating ininsect cells) (Kuno, 2007), is most likely an undersampledand very diverse group (Cook et al., 2012).

The idea of a molecular clock has been used to addressmany hypotheses in the study of emerging viral diseases,especially for diseases caused by RNA viruses (Bromham &Penny, 2003), e.g. to reject the hypothesis of a potentialspread of human immunodeficiency virus (HIV) in the1950s through a contaminated polio vaccine (Korber et al.,2000). However, dating the origin of viruses is a complexand challenging task. For viruses with high rates of evolu-tion, the original phylogenetic signal is difficult to deduceeven with complex evolutionary models because the signal

Three supplementary figures and three supplementary tables areavailable with the online version of this paper.

Journal of General Virology (2014), 95, 1969–1982 DOI 10.1099/vir.0.065227-0

065227 G 2014 The Authors Printed in Great Britain 1969

diminishes with time due to repeated substitutions at thesame site (Holmes, 2003a). The extremely high substitutionrate observed for RNA viruses is mostly due to their error-prone replication and repair machinery (Bromham &Penny, 2003; Duffy et al., 2008). Unequal substitution ratesamong lineages of the same genus further adds to thecomplexity of accurately estimating the time to the mostrecent common ancestor (tMRCA) (Duffy et al., 2008;Holmes, 2003a; Sanjuan, 2012).

Reconstruction of divergence times requires a temporalreference to convert branch lengths of a phylogenetic treeinto time. This temporal reference is usually in the form ofa fossil or biogeographical event (i.e. internal node calibra-tion), as in most eukaryote studies, or isolation dates (i.e.tip date calibration), as with bacteria and virus studies.From the biogeographical events, fossils or isolation dates,the ages of internal nodes can be estimated. The matchbetween virus and host phylogenies has led to the sugges-tion that some virus lineages originated millions of yearsago. However, this has been strongly rejected by molecularclock studies indicating a virus origin of only a few thousandyears ago (Holmes, 2003a; Worobey et al., 2010). Thiscontroversy has sparked a debate regarding the validity andutility of molecular clock reconstructions using tip datesversus internal (deep) calibration to study and infer thetemporal scale of virus evolution (Sharp & Simmonds,2011).

Previous studies based on molecular clocks have suggestedthat the Flavivirus clade originated ~10 000 years ago inAfrica (Gould et al., 2001) from a non-vectored mam-malian virus ancestor (Gould et al., 2003), followed by theradiation of TBFs and MBFs during the last 5000 and 3000years, respectively (Zanotto et al., 1996), and ISFs ~3000years ago (Crochu et al., 2004). Other studies have alsodated small clades of individual flaviviruses using tip datesas calibration points. These estimates are broadly congruentwith the notion of a Flavivirus origin within the last 10 000years (e.g. Dunham & Holmes 2007; May et al., 2011; Panet al., 2011; Twiddy et al., 2003). However, recent analyseson TBFs, based on complete genomes analysed with arelaxed molecular clock and Bayesian methods, have esti-mated the age for the TBF clade to be at least 16 000 years(Heinze et al., 2012), indicating that the age for theFlavivirus genus as a whole should be significantly olderthan suggested previously.

Within the TBFs, a close relationship between the bio-geography of the Beringian land bridge, the historicallandmass that connected north-east Siberia with Alaska andnorth-west Canada (Hulten, 1937), and the evolution of thePowassan virus (POWV) clade, the only TBFs present inNorth America, has been pointed out recently (Ebel, 2010;Heinze et al., 2012). Geological studies estimate that the landbridge was open for mammal land migration before the endof the last glaciation between 15 000 and 11 000 years ago(Elias, 2001; Elias et al., 1996; Kelly, 2003; Mandryk et al.,2001). The absence of tick-borne encephalitis virus (TBEV)

and Russian POWV lineages in North America is consistentwith the idea that the colonization of POWV into NorthAmerica happened under a single event by a land route, alsosupported by the lack of evidence for continuous movementof POWV and TBEV between Russia and North America byeither seabirds or mosquitoes (Heinze et al., 2012).Therefore, it is unlikely that POWV emerged in NorthAmerica before the land bridge became accessible again afterthe last glacial maximum. Consequentially, it is improbablethat POWV emerged after the Bering Strait was formed. Themost probable explanation, given the current knowledge andpresent distribution of the POWV lineages, is that during thecourse of TBF evolution POWV diverged from other TBFlineages in a single introduction event by the land routeduring the presence of the Beringian land bridge 15 000–11 000 years ago (Heinze et al., 2012).

POWV could have been introduced into North Americaby either humans or other mammals that colonized theAmericas during the existence of the Beringian landbridge via one of two routes: the interior route or coastalroute. (i) The interior route became accessible when theprocess of deglaciation created a corridor between theLaurentide and Cordilleran glaciers. The corridor did notexist during the last glacial maximum. The corridorstarted to open ~15 000 years ago (Dixon, 2013; Dyke,2004), and became habitable for humans and othermammals ~13 500 years ago (Dixon, 2013). (ii) Towardsthe end of the last glacial maximum, the glaciers border-ing to the south-west coast of Alaska through westernCanada started to melt. Around 16 000 years ago, theglaciers had receded to the extent that the coastal habitatscould support human populations (Dixon, 2013 andreferences therein). The coastal route is also likely to bethe route by which humans first entered the Americas(Achilli et al., 2013; Bodner et al., 2012; Fagundes et al.,2008; Goebel et al., 2008; Schurr, 2004).

Presently, there are coding nucleotide genomes available from.70 unique strains of the genus Flavivirus, including virusesrecognized by the International Committee on Taxonomy ofViruses (http://www.ictvonline.org/virusTaxonomy.asp?version=2013). Here, we combine this information together withrelaxed molecular clock methods implementing a Bayesianapproach, again to estimate and shed some light on diver-gence times of the genus Flavivirus and groups within. Aidedby biogeographical calibration, we report the first study, tothe best of our knowledge, estimating the age and rates ofsubstitution of the genus Flavivirus as a whole, including itsmajor groups, using complete coding nucleotide genomes.We explore multiple internal calibration points usingBayesian methods in combination with a relaxed molecularclock to reconstruct divergence times. We compare ourresults with previous studies reconstructing divergencetimes for different clades within the genus Flavivirus. Wealso discuss the incongruence between virus-dating studiesusing molecular clocks and RNA virus evolution. Finally, wepropose a temporal and biogeographical scenario for theevolution of the genus Flavivirus.

J. H.-O Pettersson and O. Fiz-Palacios

1970 Journal of General Virology 95

RESULTS AND DISCUSSION

Phylogenetic reconstruction of the genusFlavivirus

The phylogeny of Flavivirus and its major groups wasinferred using a genus-wide sampling approach includ-ing 86 complete coding flavivirus genomes with knownisolation dates (Table 1) analysed by both Bayesian andmaximum-likelihood methodologies. The overall treetopology from all of the different analyses was in agree-ment with previous published phylogenies based on theNS3 gene (Billoir et al., 2000; Cook & Holmes, 2006;Grard et al., 2007), multiple genes (Medeiros et al., 2007)and complete genomes (Cook & Holmes, 2006; Cook et al.,2012; Grard et al., 2007, 2010; Kuno & Chang, 2006; Loboet al., 2009). Our results were consistent with what iscommonly referred to as an NS3-like topology (Fig. 1).

Two datasets were analysed, a nucleotide and an aminoacid sequence alignment, using Bayesian and maximum-likelihood approaches. In our Bayesian phylogenetic anal-ysis, the nucleotide and amino acid 50 % majority ruleconsensus trees from MrBayes (Figs S1 and S2, available inthe online Supplementary Material) and the maximumclade credibility tree from BEAST (Figs 1 and S3), all rootedwith bovine viral diarrhea virus 1 (BVDV-1), producedtopologies with strong support for the focal nodes (A–O).The Bayesian trees were also supported by the topology ofthe maximum-likelihood trees, although the sister rela-tionships between Tamana bat virus (TABV) and all othergroups were unresolved in the maximum-likelihood trees(data not shown, available upon request).

There was maximum posterior probability (1.0 pp) for thefour major clades roughly corresponding to the conven-tional grouping of Flavivirus (Gould et al., 2001, 2003);the TBF clade (node M), the NKVF clade (NKVFa, nodeN), a MBF-dominated group (MBFdom, node F) and theISF clade (ISFa, node E), sister to the other three groups(Fig. 1). The MBFdom group (node F) includes (i) theAedes MBF clade (MBFAedes, node H), (ii) a NKVF clade(NKVFb), sister to the MBFAedes clade, (iii) a grade oflikely insect-specific flaviviruses consisting of Chaoyangvirus, Donggang virus, Lammi virus and Nounane virus(nodes I and J) (Huhtamo et al., 2009; Kolodziejek et al.,2013; Lee et al., 2013) and (iv) the Culex MBF clade(MBFCulex, node K). The present study supports the factthat the majority of the members of the genus Flavivirusshow a very strong pattern of host–vector association(Gould et al., 2001, 2003), as all groups showed distinctclustering depending on vector association. It is possiblethat the positioning of the NKVFb clade is a consequenceof secondary loss of vector capability (Gould et al., 2001,2003). Likewise, it is possible that the grade of ISFs (nodesI and J) within the MBFdom group is also due to secondaryloss of vector capability.

The main disagreement between our findings and otherstudies relates to the position of TABV. TABV has

previously been suggested to be the sister group to theISFa clade (node E) based on complete coding amino acidsequence analysis (Cook et al., 2012). However, in thepresent study, TABV appears to have diverged before theISFa clade (node B, 1.0 pp, based on Bayesian nucleotideand amino acid phylogenetic analyses), which instead issister to the TBFs, NKVFs and MBFs (Figs 1, S1 and S2).Our results are consistent with the topology of previouscomplete genome and gene-based phylogenetic studies (deLamballerie et al., 2002; Hoshino et al., 2009; Lobo et al.,2009).

Divergence times of the genus Flavivirus

Viruses, and RNA viruses in particular, provide an excel-lent opportunity to study evolutionary change because oftheir relatively high rate of substitution that allows forevolution to be observed within human timescales (Bromham& Penny, 2003; Drummond et al., 2003; Duffy et al., 2008;Holmes, 2003a). Many studies have used molecular clocks todate recent divergences (hundred-year scale; see: Kumar et al.,2010; Mehla et al., 2009; Mohammed et al., 2011; Patil et al.,2011; Ramırez et al., 2010; Weidmann et al., 2013), but herewe show the utility of using internal calibration that allows forthe reconstruction of deeper divergence times.

The divergence times under different calibration schemesare summarized in Tables 2 and S1. All BEAST runs wereperformed with an uncorrelated log-normal relaxedmolecular clock, as the null hypothesis of a strict globalsubstitution rate was rejected by the maximum-likelihoodmolecular clock test (data not shown).

Divergence times of the genus Flavivirus:calibration schemes compared

To examine the temporal origin of Flavivirus, we performedseveral analyses in BEAST as outlined in Methods. Using onlytip date calibration resulted in 12–14 % older mean diver-gence times for nodes A–O than using the internal Beringiancalibration (node O calibrated to 15 000–11 000 years ago)together with tip dates. For example, the age of the root(node A) varied between 265 000 [95 % highest posteriordensity (HPD): 25 600–2 768 000] and 230 500 (156 100–322 700) years ago, respectively (Fig. 1, Table 2). When usingTBF internal calibration (node M, calibrated to 16 100–44 929 years ago) based on the results from Heinze et al.(2012) using only tip dates as calibration, 17–18 % youngermean divergence times were recovered for nodes A–O com-pared with using tip dates only (Table 2). Both Beringianand TBF calibration recovered similar mean divergencetimes, although TBF calibration recovered 4–5 % youngermean estimates for nodes A–O than Beringian calibration(Table 2). Also, allowing for a wider range of the Beringiancalibration, i.e. 16 000–10 000 years, following Dixon (2013),TBF recovered similar ages for the split of POWV and itssister (node O) at 12 500 (10 000–15 600) years ago and forthe root (node A) at 228 500 (142 900–332 500) years ago as

Origin of the genus Flavivirus

http://vir.sgmjournals.org 1971

Table 1. Flaviviruses used in this study

Vector groups include insect-specific (IS), mosquito-borne (MB), no-known vector (NKV) and tick-borne (TB) flaviviruses.

Virus Virus abbreviation Strain Source of isolate Vector

group

Year GenBank

accession no.

Aedes virus AEFV AEFV_SPFLD_MO_2011_MP6 Aedes albopictus IS 2011 KC181923.1

Alfuy virus ALFV MRM3929 Centropus phasianius MB 1966 AY898809.1

Alkhurma virus AHFV 1176 Homo sapiens TB 1995 AF331718.1

Apoi virus APOIV ApMAR-Kitaoka Apodemus spp. NKV 1954 AF160193.1

Bagaza virus BAGV Spain H/2010 Alectoris rufa MB 2010 HQ644143.1

Bagaza virus BAGV DakAr-B209 Mixed Culex MB 1966 AY632545.2

Baiyangdian virus BYDV BYD-1 Duck MB 2010 JF312912.1

Banzi virus BANV SAH-336 Homo sapiens, Culex, Mansonia

africana

MB 1956 DQ859056.1

Bouboui virus BOUV DAK-AR-B490 Anopheles paludis MB 1967 DQ859057.1

Bovine viral diarrhoea virus 1 BVDV-1 NADL 1963 NC_001461.1

Bussuquara virus BSQV BeAn-4073 Alouatta beelzebul MB 1956 NC_009026.2

Cell fusing agent virus CFAV Aedes spp., Aedes egypti IS 1975 NC_001564.1

Chaoyang virus CHAOV ROK144 Aedes vexans nipponii IS 2003 JQ068102.1

Culex flavivirus CxFV Tokyo Culex pipiens IS 2003 AB262759.2

Deer tick virus DTV CTB30-CT95 Ixodes dammini TB 1995 NC_003218.1

Dengue virus type 1 DENV-1 45AZ5 Homo sapiens MB 1974 U88536.1

Dengue virus type 2 DENV-2 New-Guinea-C Homo sapiens MB 1944 AF038403.1

Dengue virus type 3 DENV-3 H87 Homo sapiens MB 1956 M93130.1

Dengue virus type 4 DENV-4 H241 Homo sapiens MB 1956 M14931.2

Donggang virus DGV DG0909 Aedes spp. IS 2009 NC_016997.1

Edge Hill virus EHV YMP-48 Aedes vigilax MB 1969 DQ859060.1

Entebbe bat virus ENTV UgIL-30 Tadarida limbata NKV 1957 DQ837641.1

Gadgets Gully virus GGYV CSIRO122 Ixodes uriae TB 1975 DQ235145.1

Greek goat encephalitis virus GGEV Vergina Capra aegagrus hircus TB 1969 DQ235153.1

Hanko virus HANKV Mosquitoes IS 2005 JQ268258.1

Iguape virus IGUV SPAn-71686 Sentinel mouse MB 1979 AY632538.4

Ilheus virus ILHV Original Aedes spp. and Psorophora spp. MB 1944 AY632539.4

Japanese encephalitis virus JEV JaOArS982 Mosquito? MB 1982 M18370.1

Jugra virus JUGV P9-314 Aedes spp. MB 1968 DQ859066.1

Kadam virus KADV AMP6640 Rhipicephalus pravus TB 1967 DQ235146.1

Kamiti River virus KRV SR-82 Aedes macintoshi IS 1999 NC_005064.1

Karshi virus KSIV LEIV_2247 Ornithodoros papillipes TB 1972 NC_006947.1

Kedougou virus KEDV DakAar-D1470 Aedes minutus MB 1972 AY632540.2

Kokobera virus KOKV AusMRM-32 Culex annulirostris MB 1960 NC_009029.2

Koutango virus KOUV DakAr-D-5443 Tatera kempi MB 1968 EU082200.1

Kunjin virus KUNV MRM61C Culex annulirostris MB 1960 D00246.1

Kyasanur forest disease virus KFDV G11338 Haemaphysalis spinigera TB 1957 JF416959.1

Lammi virus LAMV Mosquitoes IS 2004 FJ606789.1

J.H.-O

Pettersson

andO

.Fiz-P

alacios

19

72

Journal

of

General

Viro

logy

95

Table 1. cont.

Virus Virus abbreviation Strain Source of isolate Vector

group

Year GenBank

accession no.

Langat virus LGTV TP21 Ixodes granulatus Supino TB 1956 AF253419.1

Louping ill virus LIV 369/T2 Ixodes ricinus TB 1963 NC_001809.1

Meaban virus MEAV T70 Ornithodoros maritimus TB 1981 DQ235144.1

Modoc virus MODV M544 Peromyscus maniculatus NKV 1958 AJ242984.1

Montana myotis leukoencephalitis virus MMLV ATCC-VR-537 Myotis lucifugus NKV 1958 AJ299445.1

Murray Valley encephalitis MVEV MVE-1-51 Homo sapiens MB 1956 AF161266.1

Nakiwogo virus NAKV Uganda08 Mansonia africana nigerrima H4A1 IS 2008 GQ165809.2

New Mapoon Virus NMV CY1014 Culex annulirostris MB 1998 KC788512.1

Nounane virus NOUV Uranotaenia mashonaensis IS 2004 FJ711167.1

Ntaya virus NTAV IPDIA Mosquitoes MB 1966 JX236040.1

Omsk hemorrhagic fever virus OHFV Kubrin Homo sapiens TB 1947 AY438626.1

Omsk hemorrhagic fever virus OHFV Guriev Homo sapiens TB 1947 AB507800.1

Palm Creek Virus PCV 56 Coquillettidia xanthogaster IS 2010 KC505248.1

Potiskum virus POTV IBAN-10069 Cricetomys gambianus MB 1989 DQ859067.1

Powassan virus POWV LB Homo sapiens TB 1958 L06436.1

Quang Binh virus QBV VN180 Culex tritaeniorhynchus IS 2002 NC_012671.1

Rio Bravo virus RBV RiMAR Tadarida brasiliensis mexicana NKV 1954 AF144692.1

Rocio virus ROCV SPH-34675 Homo sapiens MB 1975 AY632542.4

Royal Farm virus RFV Eg-Art-371 Argas hermanni TB 1968 DQ235149.1

Saboya virus SABV DakAR-D4600 Tatera kempi MB 1968 DQ859062.1

Saumarez Reef virus SREV CSIRO-4 Ornithodoros capensis TB 1974 DQ235150.1

Sepik virus SEPV MK7148 Mansonia septempunctata MB 1966 DQ859063.1

Sitiawan virus STWV Broiler chicken MB 2000 JX477686.1

Spanish sheep encephalitis virus SSEV 87/2617 TBF 1987 DQ235152.1

Spondweni virus SPOV SM-6V-1 Mansonia spp.? MB 1955 DQ859064.1

St. Louis encephalitis virus SLEV MSI-7 MB 1975 DQ359217.1

Tamana bat virus TABV Tr127154 Pteronotus parnellii NKV 1973 NC_3996.1

Tick-borne encephalitis virus TBEV-Sib Vasilchenko Homo sapiens TB 1969 AF069066.1

Tick-borne encephalitis virus TBEV-FE Sofjin-HO Homo sapiens TB 1937 AB062064.1

Tick-borne encephalitis virus TBEV-Eu Salem Macaca sylvanus TB 2006 FJ572210.1

Tick-borne encephalitis virus TBEV-FE Oshima-5-10 Dog TB 1995 AB062063.2

Tick-borne encephalitis virus TBEV-Eu Neudoerfl Ixodes ricinus TB 1971 TEU27495

Tick-borne encephalitis virus TBEV-Sib Est54 Ixodes persulcatus TB 2000 GU183384.1

Tick-borne encephalitis virus TBEV 886-84 Clethrionomys rufocanus TB 1984 EF469662.1

Tick-borne encephalitis virus TBEV 178-79 Ixodes persulcatus TB 1979 EF469661.1

Tembusu virus TMUV ZJ GH-2 Goose MB 2010 JQ314465.1

Turkish sheep encephalitis virus TSEV TTE80 TB 1969 DQ235151.1

Tyuleniy virus TYUV 6017 TB 1971 DQ235148.1

Uganda S virus UGSV ORIGINAL Aedes spp. MB 1947 DQ859065.1

Usutu virus USUV SAAR-1776 Culex neavei MB 1958 AY453412.1

Orig

inof

theg

enusF

lavivirus

http://vir.sgm

journals.org1

97

3

compared with the more narrow Beringian calibration(Table S1).

The use of the Yang96 model with tip dates as calibrationrecovered older ages and broader intervals (95 % HPD)compared with using the SRD06 model (Table S1).However, both SRD06 and Yang96 model analyses underBeringian calibration recovered similar ranges for the 95 %HPD intervals (Table S1). The difference in estimates seenbetween SRD06, allowing for the third codon position tovary independently of the linked first and second codonpositions, and Yang96, allowing for all three codonpositions to vary, is an indication that divergence timeestimates become biased towards the present, with highersubstitution rates per nucleotide (Worobey et al., 2010).

The differences seen between the calibration schemes areexplained by the fact that estimating the tMRCA for entirevirus families or genera requires a broad sampling schemein order to cover as much of the variation as possible fromthe group in question. However, the wider the sampling, thehigher the variation in substitution rates among lineages.This will consequently lead to an increasing departure froma constant molecular clock (Bromham & Penny, 2003; Duffyet al., 2008; Holmes, 2003a). A high variation in substitutionrates will lead to uncertainty in the reconstructed divergencetimes unless calibration points restrict the node ages.Therefore, using tip dates alone results in a high level ofuncertainty for the deeper nodes and thus internalcalibration becomes crucial (Ho & Phillips, 2009). As theBeringian calibration constraint (15 000–11 000 years) isnarrower than the TBF calibration constraint (16 100–42 300years; Heinze et al., 2012), the resulting 95 % HPD intervalsfrom the Beringian analysis are also narrower.

Our study shows how incorporation of internal calibrationin large-scale virus phylogenies can help reconstruct moreprecise divergences times, independent of the substitutionmodel, given a robust and reliable calibration by narrowingdown the 95 % HPD intervals. Furthermore, as the meantMRCA estimates are generally congruent between thecalibration schemes applied (Table S1), we will hereafteronly report and discuss the results from the combined tipdates and the internal node calibration based on the Beringian-POWV biogeographical event, i.e. the Beringian calibration,with the most narrow 95 % HPD intervals, unless otherwisestated. This is because the incorporation of biogeographicalinformation is essential to date virus origins, especially forancient events (Katzourakis et al., 2009; Wertheim &Kosakovsky Pond, 2011), as is the case with Flavivirus.

Divergence times of the genus Flavivirus:congruences and incongruences

The genus Flavivirus is defined as sensu lato (node B) orsensu stricto (node C). Our analysis, inferred from completecoding nucleotide genomes, indicated that Flavivirus sensulato originated ~119 800 (87 100–158 900) years ago ifTABV is to be considered a part of the genus, or ~84 700T

ab

le1.

cont

.

Vir

us

Vir

us

abb

revi

atio

nS

trai

nS

ou

rce

of

iso

late

Vec

tor

gro

up

Yea

rG

enB

ank

acce

ssio

nn

o.

Wes

sels

bro

nvi

rus

WE

SS

VS

AH

-17

7-9

98

71

-2M

erin

osh

eep

MB

19

55

DQ

85

90

58

.1

Wes

tN

ile

viru

sW

NV

NY

99

-fla

min

go3

82

-99

Ph

oen

icop

teru

sch

ilen

sis

MB

19

99

AF

19

68

35

.2

Wes

tN

ile

viru

sW

NV

B9

56

Hom

osa

pie

ns

MB

19

37

NC

_1

56

3.2

Wes

tN

ile

viru

sW

NV

Rab

ensb

urg

-97

-10

3C

ule

xp

ipie

ns

MB

19

97

AY

76

52

64

.1

Yao

un

de

viru

sY

AO

VD

akA

r-Y

27

6C

ule

xn

ebu

losi

sM

B1

96

8E

U0

82

19

9.1

Yel

low

feve

rvi

rus

YF

VT

rin

idad

-79

A-7

88

37

9H

aem

ago

gus

speg

azz

ini

MB

19

79

AF

09

46

12

.1

Yo

ko

sevi

rus

YO

KV

Oit

a-3

6M

inio

pte

rus

fuli

gin

osu

sN

KV

19

71

AB

11

48

58

.1

Zik

avi

rus

ZIK

VM

R-7

66

Rh

esu

sm

on

key

MB

19

47

DQ

85

90

59

.1

J. H.-O Pettersson and O. Fiz-Palacios

1974 Journal of General Virology 95

(63 700–109 600) years ago if TABV is excluded, i.e.Flavivirus sensu stricto. Either way, this first moleculardating for the whole genus indicates a significantly olderage than the 10 000 years that was previously suggested forFlavivirus sensu stricto (Gould et al., 2001).

Our results also contrast the 3 000 years age of the MBFdom

group (node F), including MBFs, ISFs and NKVFs,estimated in Zanotto et al. (1996). In the present analysis,the MBFdom group (node F) was instead suggested to haveemerged .41 500 (32 600–51 600) years ago, with asubsequent diversification of the MBFAedes clade (nodeH) and the MBFCulex clade (node K) clade ~25 900 (19 800–32 000) and ~26 900 (21 000–33 400) years ago, respect-ively. Previously, the ISFa clade (node E) was indicated tohave emerged between 3500 and 350 000 years ago (Crochuet al., 2004). The combination of tip date calibration andinternal node calibration allows us to more preciselypinpoint the origin of the ISFa clade to have occurred~40 700 (30 800–52 300) years ago, i.e. ~44 000 years afterthe split from the last common ancestor of Flavivirus sensustricto (node C) that emerged ~84 700 (63 700–110 000)years ago. We also showed the first proposed dating of theNKVFa clade (node N), here estimated to have emerged~24 100 (18 000–30 900) years ago, ~15 000 years after thesplit from the last common ancestor of TBFs and NKVFs(node L).

The sharp contrast between the estimates in the presentstudy and previous studies can possibly be explainedbecause evolutionary rates in our study are inferred from agenus-wide sampled dataset using a codon-based substi-tution model as compared with a nucleotide-basedsubstitution model used in many other studies. It is likelythat nucleotide-based substitution models will not be ableto account for variation in selective pressure throughoutevolutionary history, where the effect of purifying selectionis likely to cause underestimation of the actual ages(Wertheim & Kosakovsky Pond, 2011). Using the SLAC, FEL,IFEL, FUBAR and MEME methods available within the HyPhypackage (Pond et al., 2005) and at the Datamonkeywebserver (Delport et al., 2010), there were none-to-weaksigns of positive selection, but strong signs of purifyingselection in the alignment used in the present study (resultsavailable upon request). Signs of strong purifying selectionwithin Flavivirus are in concordance with what has beenfound previously for members of the genus Flavivirus andfor vector-borne RNA viruses in general (Holmes, 2003b;Pybus et al., 2007; Woelk & Holmes, 2002). A codon-basedsubstitution model can to some extent compensate forpurifying selection, producing older tMRCA estimatesthan nucleotide-based substitution models (Wertheim &Kosakovsky Pond, 2011). However, and more importantly,to accurately date deep RNA virus origins, the use of othersources of evidence (such as biogeography) will eventuallybecome necessary as even codon-based models cannotaccount for the complex interactions of events and factorsthat have occurred throughout evolution (Katzourakis &Gifford, 2010; Katzourakis et al., 2009).

Our study is supported by the fact that both Beringiancalibration and only tip date calibration give similar meantMRCA estimates, although only tip date calibration givesbroader 95 % HPD intervals. In contrast to several studies(Sharp & Simmonds, 2011), we have not found any conflictbetween internal node and tip date calibrations. Our resultsare also congruent with several other studies (Table S3).The estimated tMRCA for the TBFs clade [node M, at27 500 (21 700–34 200) years ago] is supported by previousestimates for the whole clade (Heinze et al., 2012) (28 600years ago) and for the lineages within (Bertrand et al., 2012;Uzcategui et al., 2012) (Table S3). For the split of thePOWV clade and its sister (node O), our tMRCA estimatesof 14 800 years ago from tip date analysis also matchbroadly with previous molecular clock studies (12 300 yearsago; Heinze et al., 2012) and support the use of theBeringian biogeographical calibration.

Evolutionary rates of the genus Flavivirus andmolecular clocks in RNA viruses

Rates of nucleotide substitution were estimated from theBeringian calibrated BEAST analyses for the entire ORF ofthe genome (Table 3). Rates for the Beringian, TBF and tipdate calibration are summarized in Table S2.

The genomic substitution rate of positive-sense ssRNAviruses varies between 1022 and 1025 substitutions site–1

year21 (Hanada et al., 2004; Jenkins et al., 2002; Sanjuan,2012). Even though the estimated mean rates in the presentstudy for Flavivirus sensu lato (including TABV; node B),461025 (261025 to 661025) substitutions site–1 year21,are within range of those estimated for positive-sensessRNA viruses, they are at least an order of magnitudeslower than previous estimates for different lineages withinFlavivirus, i.e. 161023 and 261024 substitutions site–1

year21 as found for DENV-4 and St. Louis encephalitisvirus, respectively (Sanjuan, 2012 and references therein).Our results are not very surprising given that ratedifferences of up to 125 times have been shown for simianimmunodeficiency virus as compared with its sister, HIV(Worobey et al., 2010).

Furthermore, we did not find that the MBFdom group(node F) evolved at a significantly faster rate than the TBFclade (node M), as reported previously (Zanotto et al.,1995, 1996). Instead, our study showed that the MBFdom

group (node F) evolved at a mean rate of 3.261025

(1.761025 to 5.061025) substitutions site–1 year–1 andthat the TBF clade evolved at a mean rate of 3.961025

(2.361025 to 5.761025) substitutions site–1 year–1 (Table3). Therefore, the implications for Flavivirus evolutionderived from Zanotto et al. (1995, 1996) need to be revisedand reconsidered. The fastest rate of nucleotide substi-tution was found for the ISFa clade (node E), with a meanrate of 6.261025 (4.061025 to 8.661025) substitutionssite–1 year–1. Their fast rate of evolution could perhaps beexplained by their vertical mode of cycling transmission incombination with their specificity to insects with relatively

Origin of the genus Flavivirus

http://vir.sgmjournals.org 1975

TBF calibration

(16 100–44 929 years)

Beringian calibration

(11 000–15 000 years)

DENV-3_1956_M93130.1

JEV_1982_M18370.1

KUNV_1960_D00246.1

MMLV_1958_AJ299445.1

RBV_1954_AF144692.1

KOUV_1968_EU082200.1

UGSV_1947_DQ859065.1

YAOV_1968_EU082199.1

TBEV-Eu_1971_TEU27495

NTAV_1966_JX236040.1

HANKV_2005_JQ268258.1

GGEV_1969_DQ235153.1

DENV-1_1974_U88536.1

NOUV_2004_FJ711167.1

TBEV-FE_1995_AB062063.2

DGV_2009_NC_016997.1

DENV-4_1956_M14931.2

ZIKV_1947_DQ859059.1

BVDV-1_1963_NC_001461.1

MVEV_1956_AF161266.1

POTV_1989_DQ859067.1

SEPV_1966_DQ859063.1

NAKV_2008_GQ165809.2

CFAV_1975_NC_001564.1

SABV_1968_DQ859062.1

ENTV_1957_DQ837641.1

DTV_1995_NC_003218.1

EHV_1969_DQ859060.1

OHFV_1947_AY438626.1

QBV_2002_NC_012671.1

OHFV_1947_AB507800.1

LGTV_1956_AF253419.1

DENV-2_1944_AF038403.1

APOIV_1954_AF160193.1

WNV_1997_AY765264.1

KEDV_1972_AY632540.2

PCV_2010_KC505248.1

SSEV_1987_DQ235152.1

TSEV_1969_DQ235151.1

TBEV-Sib_1969_AF069066.1

MEAV_1981_DQ235144.1

KADV_1967_DQ235146.1

BAGV_2010_HQ644143.1

MODV_1958_AJ242984.1

ROCV_1975_AY632542.4

BAGV_1966_AY632545.2

KFDV_1957_JF416959.1

JUGV_1968_DQ859066.1

YOKV_1971_AB114858.1

NMV_1998_KC788512.1

LAMV_2004_FJ606789.1

AEFV_2011_KC181923.1

TBEV-Sib_2000_GU183384.1

RFV_1968_DQ235149.1

STWV_2000_JX477686.1

WESSV_1955_DQ859058.1

AHFV_1995_AF331718.1

SREV_1974_DQ235150.1

GGYV_1975_DQ235145.1

TBEV_1984_EF469662.1

IGUV_1979_AY632538.4

KRV_1999_NC_005064.1

SLEV_1975_DQ359217.1

SPOV_1955_DQ859064.1

YFV_1979_AF094612.1

BSQV_1956_NC_009026.2

TABV_1973_NC_003996.1

TMUV_2010_JQ314465.1

BYDV_2010_JF312912.1

KOKV_1960_NC_009029.2

ILHV_1944_AY632539.4

BOUV_1967_DQ859057.1

TBEV-Eu_2006_FJ572210.1

TBEV_1979_EF469661.1

TBEV-FE_1937_AB062064.1

USUV_1958_AY453412.1

WNV_1937_NC_001563.2

ALFV_1966_AY898809.1

CHAOV_2003_JQ068102.1

TYUV_1971_DQ235148.1

LIV_1963_NC_001809.1

BANV_1956_DQ859056.1

CxFV_2003_AB262759.2

POWV_1958_L06436.1

KSIV_1972_NC_006947.1

WNV_1999_AF196835.2

A

C

B

D

E

F

G

I

L

M

N

O

J

K

H

TBF

MBF

MBF

ISF

ISF

NKVF

NKVF

NKVF

ISF

<10 500 years

glacier

13 000–11 000 years

land bridgeglacier

16 000–14 000 years

glacier

land bridge

25 000–18 000 years

glacier

land bridge

050100150200322 Time (×103 years)

POWV migration pathPOWV migration path

Fig. 1. Chronogram based on tip date calibrated terminals and internal nodes using Beringian calibration (see text for details).All named nodes have maximum posterior probability support (1.0 pp). Terminal tips are named by virus name (abbreviations,

J. H.-O Pettersson and O. Fiz-Palacios

1976 Journal of General Virology 95

short generation times (Bolling et al., 2012; Lutomiah et al.,2007).

Estimates of substitution rates are often found to be muchhigher for tips than for deep nodes (Wertheim &Kosakovsky Pond, 2011). From our perspective, a highlyvariable non-constant molecular clock with pulses of veryhigh substitution rates for short periods of time may reflectthe difference of orders of magnitude seen when comparingevolutionary rates in short and recent times (e.g. 161023

for DENV-4 in Klungthong et al., 2004) with long and deeptimes (e.g. 461025 for Flaviviruses sensu lato in this study).

Beringia and the emergence of POWV in NorthAmerica

The Beringian land bridge was open for land migrationbetween 15 000 and 11 000 years ago until it was floodedand the Bering Strait was formed (Dixon, 2013; Elias, 2001;Elias et al., 1996; Kelly, 2003), effectively blockingmigration between Asia and North America for animalsother than humans (Fig. 1). Our molecular reconstructionsusing only tip dates are congruent with POWV beingintroduced into North America when the Beringian landbridge was accessible and supports the use of tip dates incombination with an internal Beringian calibration (Fig. 1,Table 2).

How then did POWV enter North America? Althoughhumans might have been responsible for carrying POWV,

as humans do get infected with POWV, perhaps it is morelikely that ticks and associated tick hosts jointly broughtPOWV into North America. Human infections withPOWV in present times are infrequent (Ebel, 2010),although numbers appear to be on the increase (Hintenet al., 2008). Between 15 000 and 11 000 years ago, themigrating human population who settled the Americas hadan effective population size of perhaps ,80 individuals(Hey, 2005). This, in combination with the fact thatPOWV is considered to be maintained solely in an enzooticcycle between ixodid ticks and their vertebrate hosts (Ebel,2010; Ebel et al., 2000), is a strong argument againsthumans being the likely vectors by which POWV firstentered North America. Nor is it likely that POWV enteredthe Americas by the coastal route. Even though the coastalareas did support marine mammals and had terrestrialmammals living in refugia (Heaton & Grady, 2003), thedeglaciated coastal areas had sparse vegetation that did notsupport large populations of terrestrial mammals (Dixon,2013). Also, animals were isolated and movement wasinhibited due to the glaciers, and the coastal route was, ingeneral, not considered to be a migratory route formammals other than humans (Dixon, 2013).

In view of the results from the present study, the mostlikely time period of introduction was after the deglaciationcorridor became habitable ~13 500 years ago (Catto, 1996;Dixon, 2013) and before sea levels rose by 40 m and theBering strait reached its present width before 10 500 yearsago (Elias et al., 1996). During this period, populations of

see Table 1), year of isolation and GenBank accession number. Development of the Beringia region between the last glacialmaximum until the Bering Strait was formed (25 000–10 500 years ago). Beringia development after Elias et al. (1996) andDixon (2013).

Table 2. Mean estimated tMRCA (�103 years) and 95 % HPD interval: results for the Beringian, TBF and tip date calibration analyses

Node Beringian (11 000–15 000 years) TBF (16 100–42 300 years) Tip dates only

tMRCA 95 % HPD tMRCA 95 % HPD tMRCA 95 % HPD

A: Root 230.5 156.1–322.7 219.4 110.6–387.6 265.0 25.6–2.768.1

B: TABV/ISF–MBF–TBF–NKVF 119.8 87.1–158.9 114.0 62.2–194.6 138.7 12.9–1.456.1

C: ISFa/MBFdom–TBF–NKVFa 84.7 63.7–109.6 80.6 45.0–135.1 98.4 9.8–1.024.7

D: MBFdom/TBF–NKVFa 47.2 37.3–58.7 44.9 26.4–73.9 54.0 5.5–560.2

E: ISFa 40.7 30.8–52.3 38.5 21.7–64.6 46.8 4.6–486.7

F: MBFdom 41.5 32.6–51.6 39.4 23.1–64.9 47.4 4.9–492.7

G: NKVFb/MBFAedes 33.2 25.7–42.0 31.5 18.2–52.3 37.9 4.0–390.7

H: MBFAedes 25.9 19.8–33.0 24.6 14.0–41.0 29.5 2.9–304.1

I: ISFb/NOUV–MBFCulex 33.8 26.6–42.1 32.1 18.7–52.9 38.7 3.8–401.9

J: NOUV/MBFCulex 31.0 24.3–38.6 29.5 17.2–48.7 35.5 3.8–369.4

K: MBFCulex 26.9 21.0–3343 25.5 14.8–42.1 30.7 3.3–320.0

L: TBF/NKVFa 39.3 30.7–49.2 37.5 21.9–61.4 44.9 4.2–468.5

M: TBF 27.5 21.7–34.2 26.2 16.1–42.3 31.5 3.1–329.4

N: NKVFa 24.1 18.0–30.9 22.9 12.8–38.4 27.5 2.9–286.6

O: POWV/sister clade of TBF 12.8 11.0–14.8 12.2 7.0–20.0 14.8 1.5–152.3

Origin of the genus Flavivirus

http://vir.sgmjournals.org 1977

ixodid ticks and land mammals (including humans) couldhave migrated from eastern Beringia through the thenhabitable deglaciated corridor to the south of the glaciers(Dixon, 2013; Goebel & Buvit, 2011; Shapiro et al., 2004).Ticks and tick hosts maintaining POWV might initiallyhave been restricted to eastern Beringia, north-west of theglaciers. As the deglaciated corridor became habitable andanimals could migrate southwards, and after the BeringStrait was formed preventing land migration back to Asia,POWV became established in North America. Using aninternal calibration of 15 000–11 000 or 16 000–10 000 yearsfor the split of POWV from closely related TBFs givessimilar tMRCA dates for POWV crossing into the Americas(12 800 and 12 500 years ago, respectively), which is inagreement with dates of the opening and closing of theBeringian land bridge.

Did flaviviruses and humans spread throughoutthe globe together?

Viruses, and RNA viruses in particular, are among the mostsuccessful evolving entities on the planet, having colonizedand adapted to an array of different environments andmodes of transmission within all domains of life (Wasik &Turner, 2012). The most significant event that might havehelped shape the distribution of flaviviruses is perhaps thespread of modern humans out of Africa. Modern humansarose in Africa (Cavalli-Sforza & Feldman, 2003; McDougallet al., 2005; Stringer & Andrews, 1988), and began migratingto other continents between 80 000 and 40 000 years ago,having populated all continents except Antarctica by 10 000years ago (Blome et al., 2012; Henn et al., 2012; Macaulayet al., 2005; Rasmussen et al., 2011). The migration ofhumans out of Africa has been accompanied by several otherpathogens. The pathogenic bacterium Helicobacter pyloriappears to have accompanied human migration out of Africaand is estimated to have spread from east Africa ~58 000

years ago (Falush et al., 2003; Linz et al., 2007; Moodley et al.,2012). Likewise, the Mycobacterium tuberculosis complex isestimated to have started to diverge ~70 000 years ago,consistent with an African origin linked to human expansionand migration (Comas et al., 2013; Wirth et al., 2008).Similarly, several human pathogenic viruses also showpatterns of co-divergence associated with human diversifica-tion (Holmes, 2004). Therefore, it is perhaps not unlikelythat the spread and evolution of several Flavivirus strains,especially if much of the Flavivirus diversity originated inAfrica (Gould et al., 2001, 2003), was influenced by humanmigration and expansion across the globe.

METHODS

Virus genomes, sequence alignments and models of molecular

evolution. Complete coding genomes of the genus Flavivirus,excluding the untranslated 59 and 39 flanking regions, were retrievedfrom GenBank (Table 1). Genomes were selected based on availablebackground information on the year of virus strain isolation (isolatedbetween 1937 and 2011). BVDV-1 of the genus Pestivirus within thefamily Flaviviridae was included as an outgroup based on a previouslypublished Flavivirus phylogeny (Cook et al., 2012). To construct arobust alignment without frameshifts, all sequences were translated toamino acids in SeaView 4.4.2 (Gouy et al., 2010), aligned with MAFFT

7.037b using the default G-INS-i algorithm and parameters (Katoh &Standley, 2013), and then back-translated into nucleotides. Theresulting alignments were inspected visually and edited with AliView0.8 (http://ormbunkar.se/aliview). To estimate the best-fitting modelof evolution for the sequence alignments, model tests were performedfor the nucleotide alignment using jModelTest 2.1.3 (Darriba et al.,2012) and for the amino acid alignment using ProtTest 3.2.1 (Darribaet al., 2011). To test the null hypothesis of a strict molecular clock, amaximum-likelihood clock test was performed using MEGA 5.22(Tamura et al., 2011). The best-fitting models were then specified inthe respective analyses.

Phylogenetic analysis: maximum-likelihood and Bayesian

analysis. Maximum-likelihood trees were estimated using RAxML7.6.3 (Stamatakis, 2006) with 1000 rapid bootstraps under the GTR+I+G model for the nucleotide alignment and the WAG+I+G modelfor the amino acid alignment, respectively. Bayesian phylogenetic treeswere inferred using MrBayes 3.2.1 (Ronquist et al., 2012) by executingtwo parallel runs with four Metropolis-coupled chains for 20 million and9 million Markov chain Monte Carlo (MCMC) generations, usingGTR+I+G for nucleotides and WAG+I+G for amino acids as modelof evolution, respectively, sampling every 1000 generations and run withdefault priors, discarding the first 25 % as burn-in.

Estimating the tMRCA: Bayesian analysis with BEAST. To explorethe temporal scale of the entire genus Flavivirus, evolutionary ratesand the tMRCA were estimated using a Bayesian MCMC method asimplemented in BEAST 1.7.5 (Drummond et al., 2012). All BEAST runswere performed with calibrated tip dates where the sequence with themost recent sampling date (2011) was set to represent the present.The SRD06 codon-based partition model (Shapiro et al., 2006) andHKY85+G nucleotide substitution model were used together with anon-parametric Bayesian skyline population coalescent tree priorwith a piecewise-constant skyline demographic model (Drummondet al., 2005), along with a log-normal uncorrelated relaxed molecularclock. The effect of using the Yang96 codon model (Yang, 1996) wasalso explored in a separate analysis. All analyses were run for 100million generations in triplicate, sampling every 1000 generations toensure mixing of chains and that a sufficiently effective sample size

Table 3. Mean estimated evolutionary rates (�10”5 substitu-tions site–1 year–1) and 95 % HPD intervals for the Beringiancalibration (11 000–15 000 years)

Node Mean rate 95 % HPD

B: TABV/ISF–MBF–TBF–NKVF 3.96 2.04–6.14

C: ISFa/MBFdom–TBF–NKVF 3.60 1.81–5.57

D: MBFdom/TBF–NKVFa 2.56 1.57–3.70

E: ISFa 6.24 4.00–8.65

F: MBFdom 3.24 1.73–5.00

G: NKVFb/MBFAedes 2.92 1.63–4.43

H: MBFAedes 3.11 1.70–4.72

I: ISFb/NOUV–MBFCulex 4.02 2.28–6.03

J: NOUV/MBFCulex 3.95 2.12–6.06

K: MBFCulex 4.19 2.30–6.37

L: TBF/NKVFa 4.27 2.39–6.38

M: TBF 3.89 2.32–5.73

N: NKVFa 5.18 2.32–6.15

O: POWV/sister clade of TBF 4.30 3.31–7.23

J. H.-O Pettersson and O. Fiz-Palacios

1978 Journal of General Virology 95

(.200) was reached. Convergence of chains and effective sample size

statistics were analysed with Tracer 1.5 (http://beast.bio.ed.ac.uk/

Tracer). Log and tree files were combined in LogCombiner (BEAST

package) to produce consensus files of the different runs. Finally,

maximum clade credibility trees of the three different analyses were

produced using TreeAnnotator (BEAST package). Chronograms were

viewed and annotated in FigTree 1.3.1 (http://tree.bio.ed.ac.uk/software/

figtree). Computations were performed at the Bioportal webportal at the

University of Oslo (www.bioportal.uio.no), the CIPRES webportal

(Miller et al., 2010) and at the Uppsala Multidisciplinary Center for

Advanced Computational Science (www.uppmax.uu.se).

Calibration schemes. In BEAST, three calibration schemes were

applied. (i) The first analysis was run with default uniform priors

using tip dates only (i.e. uncalibrated internal nodes) allowing the

MCMC chains to freely explore the treespace and the node height. (ii)

The second analysis was run using tip dates together with secondary

calibration data for the TBF node (split of mammalian TBFs from

seabird TBFs), previously estimated to have originated between 44 929

and 16 100 years ago (Heinze et al., 2012), hereafter referred to as the

TBF calibration. These dates were incorporated as upper and lower

bounds using a uniform distribution. (iii) The third analysis was run

using tip dates together with an internal node calibration based on the

biogeographical event for the split of the POWV and closely related

TBFs during the existence of the Beringian land bridge, estimated to

have been open for mammal land migration between 15 000 and

11 000 years ago (see Introduction). Here, the ages of the opening and

closing times of the Beringian land bridge were used to specify a

maximum bound (15 000 years) and minimum bound (11 000 years)

for the age of this split, given that the POWV clade is the only North

American TBF and that all other TBFs are either African, Eurasian or

Oceanian (Heinze et al., 2012), hereafter referred to as the Beringian

calibration. In addition to the Beringian calibration, we also tested the

effect of allowing for a wider range of the opening and closing of the

Beringia land bridge (Dixon, 2013). Here, the age for the split

between POWV and closely related TBFs was set with an internal

calibration to an upper bound of 16 000 years ago and a lower bound

of 10 000 years ago.

ACKNOWLEDGEMENTS

We are grateful to Allison Perrigo, Martin Ryberg and Petra Korall for

their constructive comments and feedback on the manuscript. We

acknowledge Allison Perrigo and Stina Weststrand for their help with

the design of the figures. We are also grateful to James E. Dixon for

valuable information and discussions on Beringian biogeography.

Finally, we wish to acknowledge all the hard work by people all over

the world making the sequences used in the present study available to

everyone. J. H.-O. P. was supported by Stiftelsen Olle Engkvist

Byggmastare.

REFERENCES

Achilli, A., Perego, U. A., Lancioni, H., Olivieri, A., Gandini, F.,Hooshiar Kashani, B., Battaglia, V., Grugni, V., Angerhofer, N. &other authors (2013). Reconciling migration models to the Americas

with the variation of North American native mitogenomes. Proc Natl

Acad Sci U S A 110, 14308–14313.

Bertrand, Y., Topel, M., Elvang, A., Melik, W. & Johansson, M. (2012).First dating of a recombination event in mammalian tick-borne

flaviviruses. PLoS ONE 7, e31981.

Billoir, F., de Chesse, R., Tolou, H., de Micco, P., Gould, E. A. & deLamballerie, X. (2000). Phylogeny of the genus flavivirus using

complete coding sequences of arthropod-borne viruses and viruses

with no known vector. J Gen Virol 81, 781–790.

Blome, M. W., Cohen, A. S., Tryon, C. A., Brooks, A. S. & Russell, J.(2012). The environmental context for the origins of modern humandiversity: a synthesis of regional variability in African climate

150,000–30,000 years ago. J Hum Evol 62, 563–592.

Bodner, M., Perego, U. A., Huber, G., Fendt, L., Rock, A. W.,Zimmermann, B., Olivieri, A., Gomez-Carballa, A., Lancioni, H. &other authors (2012). Rapid coastal spread of First Americans: novel

insights from South America’s Southern Cone mitochondrialgenomes. Genome Res 22, 811–820.

Bolling, B. G., Olea-Popelka, F. J., Eisen, L., Moore, C. G. & Blair,C. D. (2012). Transmission dynamics of an insect-specific flavivirus in a

naturally infected Culex pipiens laboratory colony and effects of co-

infection on vector competence for West Nile virus. Virology 427, 90–97.

Bromham, L. & Penny, D. (2003). The modern molecular clock. Nat

Rev Genet 4, 216–224.

Catto, N. R. (1996). Richardson Mountains, Yukon-Northwest

territories: the northern portal of the postulated ‘Ice-Free Corridor’.

Quartern Int 32, 3–19.

Cavalli-Sforza, L. L. & Feldman, M. W. (2003). The application of

molecular genetic approaches to the study of human evolution. Nat

Genet 33 (Suppl), 266–275.

Chambers, T. J., Hahn, C. S., Galler, R. & Rice, C. M. (1990). Flavivirus

genome organization, expression, and replication. Annu Rev Microbiol

44, 649–688.

Comas, I., Coscolla, M., Luo, T., Borrell, S., Holt, K. E., Kato-Maeda,M., Parkhill, J., Malla, B., Berg, S. & other authors (2013). Out-of-Africa migration and Neolithic coexpansion of Mycobacterium

tuberculosis with modern humans. Nat Genet 45, 1176–1182.

Cook, S. & Holmes, E. C. (2006). A multigene analysis of thephylogenetic relationships among the flaviviruses (Family: Flavi-

viridae) and the evolution of vector transmission. Arch Virol 151,

309–325.

Cook, S., Moureau, G., Kitchen, A., Gould, E. A., de Lamballerie, X.,Holmes, E. C. & Harbach, R. E. (2012). Molecular evolution of theinsect-specific flaviviruses. J Gen Virol 93, 223–234.

Crochu, S., Cook, S., Attoui, H., Charrel, R. N., De Chesse, R.,Belhouchet, M., Lemasson, J. J., de Micco, P. & de Lamballerie, X.(2004). Sequences of flavivirus-related RNA viruses persist in DNA

form integrated in the genome of Aedes spp. mosquitoes. J Gen Virol

85, 1971–1980.

Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. (2011). ProtTest

3: fast selection of best-fit models of protein evolution. Bioinformatics27, 1164–1165.

Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. (2012).jModelTest 2: more models, new heuristics and parallel computing.Nat Methods 9, 772.

de Lamballerie, X., Crochu, S., Billoir, F., Neyts, J., de Micco, P.,Holmes, E. C. & Gould, E. A. (2002). Genome sequence analysis of

Tamana bat virus and its relationship with the genus Flavivirus. J Gen

Virol 83, 2443–2454.

Delport, W., Poon, A. F. Y., Frost, S. D. W. & Kosakovsky Pond, S. L.(2010). Datamonkey 2010: a suite of phylogenetic analysis tools for

evolutionary biology. Bioinformatics 26, 2455–2457.

Dixon, E. J. (2013). Late Pleistocene colonization of North America

from Northeast Asia: new insights from large-scale paleogeographicreconstructions. Quartern Int 285, 57–67.

Drummond, A. J., Pybus, O. G., Rambaut, A., Forsberg, R. & Rodrigo,A. G. (2003). Measurably evolving populations. Trends Ecol Evol 18,481–488.

Origin of the genus Flavivirus

http://vir.sgmjournals.org 1979

Drummond, A. J., Rambaut, A., Shapiro, B. & Pybus, O. G. (2005).Bayesian coalescent inference of past population dynamics frommolecular sequences. Mol Biol Evol 22, 1185–1192.

Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. (2012).Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol29, 1969–1973.

Duffy, S., Shackelton, L. A. & Holmes, E. C. (2008). Rates ofevolutionary change in viruses: patterns and determinants. Nat RevGenet 9, 267–276.

Dunham, E. J. & Holmes, E. C. (2007). Inferring the timescale ofdengue virus evolution under realistic models of DNA substitution.J Mol Evol 64, 656–661.

Dyke, A. S. (2004). An outline of the deglaciation of North Americawith emphasis on central and northern Canada. In QuaternaryGlaciations, Extent and Chronology Part II: North America, pp. 373–424. Edited by J. Ehlers & P. L. Gibbard. Amsterdam: Elsevier.

Ebel, G. D. (2010). Update on Powassan virus: emergence of aNorth American tick-borne flavivirus. Annu Rev Entomol 55, 95–110.

Ebel, G. D., Campbell, E. N., Goethert, H. K., Spielman, A. & Telford,S. R., III (2000). Enzootic transmission of deer tick virus in NewEngland and Wisconsin sites. Am J Trop Med Hyg 63, 36–42.

Elias, S. A. (2001). Beringian paleoecology: results from the 1997workshop. Quatern Sci Rev 20, 7–13.

Elias, S. A., Short, S. K., Nelson, C. H. & Birks, H. H. (1996). Life andtimes of the Bering land bridge. Nature 382, 60–63.

Fagundes, N. J. R., Kanitz, R., Eckert, R., Valls, A. C. S., Bogo, M. R.,Salzano, F. M., Smith, D. G., Silva, W. A., Jr, Zago, M. A. & otherauthors (2008). Mitochondrial population genomics supports asingle pre-Clovis origin with a coastal route for the peopling of theAmericas. Am J Hum Genet 82, 583–592.

Falush, D., Wirth, T., Linz, B., Pritchard, J. K., Stephens, M., Kidd, M.,Blaser, M. J., Graham, D. Y., Vacher, S. & other authors (2003).Traces of human migrations in Helicobacter pylori populations.Science 299, 1582–1585.

Gaunt, M. W., Sall, A. A., de Lamballerie, X., Falconar, A. K.,Dzhivanian, T. I. & Gould, E. A. (2001). Phylogenetic relationships offlaviviruses correlate with their epidemiology, disease association andbiogeography. J Gen Virol 82, 1867–1876.

Goebel, T. & Buvit, I. (editors) (2011). From the Yenisei to the Yukon:Interpreting Lithic Assemblage Variability in Late Pleistocene/EarlyHolocene Beringia. College Station, TX: Texas A&M University Press.

Goebel, T., Waters, M. R. & O’Rourke, D. H. (2008). The latePleistocene dispersal of modern humans in the Americas. Science 319,1497–1502.

Gould, E. A. & Solomon, T. (2008). Pathogenic flaviviruses. Lancet371, 500–509.

Gould, E. A., de Lamballerie, X., Zanotto, P. M. & Holmes, E. C.(2001). Evolution, epidemiology, and dispersal of flaviviruses revealedby molecular phylogenies. Adv Virus Res 57, 71–103.

Gould, E. A., de Lamballerie, X., Zanotto, P. M. & Holmes, E. C.(2003). Origins, evolution, and vector/host coadaptations within thegenus Flavivirus. Adv Virus Res 59, 277–314.

Gouy, M., Guindon, S. & Gascuel, O. (2010). SeaView version 4: Amultiplatform graphical user interface for sequence alignment andphylogenetic tree building. Mol Biol Evol 27, 221–224.

Grard, G., Moureau, G., Charrel, R. N., Lemasson, J.-J., Gonzalez,J.-P., Gallian, P., Gritsun, T. S., Holmes, E. C., Gould, E. A. & deLamballerie, X. (2007). Genetic characterization of tick-borneflaviviruses: new insights into evolution, pathogenetic determinantsand taxonomy. Virology 361, 80–92.

Grard, G., Moureau, G., Charrel, R. N., Holmes, E. C., Gould, E. A. &de Lamballerie, X. (2010). Genomics and evolution of Aedes-borneflaviviruses. J Gen Virol 91, 87–94.

Gritsun, T. S., Nuttall, P. A. & Gould, E. A. (2003). Tick-borneflaviviruses. Adv Virus Res 61, 317–371.

Gubler, D., Kuno, G. & Markoff, L. (2007). Flaviviruses. In FieldsVirology, 5th edn, vol. 1, pp. 1153–1253. Edited by D. Knipe,P. Howley, D. Griffin, R. Lamb, M. Martin, B. Roizman & S. Straus.Philadelphia, PA: Lippincott-Raven.

Hanada, K., Suzuki, Y. & Gojobori, T. (2004). A large variation in therates of synonymous substitution for RNA viruses and its relationshipto a diversity of viral infection and transmission modes. Mol Biol Evol21, 1074–1080.

Heaton, T. H. & Grady, F. (2003). The late Wisconsin vertebrate historyof Prince of Wales Island, Southeast Alaska. In Ice Age Cave Faunas ofNorth America, pp. 17–53. Edited by B. W. Schubert, J. I. Mead &R. W. Graham. Bloomington, IN: Indiana University Press.

Heinze, D. M., Gould, E. A. & Forrester, N. L. (2012). Revisiting theclinal concept of evolution and dispersal for the tick-borne flavivirusesby using phylogenetic and biogeographic analyses. J Virol 86, 8663–8671.

Henn, B. M., Cavalli-Sforza, L. L. & Feldman, M. W. (2012). The greathuman expansion. Proc Natl Acad Sci U S A 109, 17758–17764.

Hey, J. (2005). On the number of New World founders: a populationgenetic portrait of the peopling of the Americas. PLoS Biol 3, e193.

Hinten, S. R., Beckett, G. A., Gensheimer, K. F., Pritchard, E.,Courtney, T. M., Sears, S. D., Woytowicz, J. M., Preston, D. G., Smith,R. P., Jr & other authors (2008). Increased recognition of Powassanencephalitis in the United States, 1999–2005. Vector Borne ZoonoticDis 8, 733–740.

Ho, S. Y. W. & Phillips, M. J. (2009). Accounting for calibrationuncertainty in phylogenetic estimation of evolutionary divergencetimes. Syst Biol 58, 367–380.

Holmes, E. C. (2003a). Molecular clocks and the puzzle of RNA virusorigins. J Virol 77, 3893–3897.

Holmes, E. C. (2003b). Patterns of intra- and interhost nonsynon-ymous variation reveal strong purifying selection in dengue virus.J Virol 77, 11296–11298.

Holmes, E. C. (2004). The phylogeography of human viruses. MolEcol 13, 745–756.

Hoshino, K., Isawa, H., Tsuda, Y., Sawabe, K. & Kobayashi, M.(2009). Isolation and characterization of a new insect flavivirus fromAedes albopictus and Aedes flavopictus mosquitoes in Japan. Virology391, 119–129.

Huhtamo, E., Putkuri, N., Kurkela, S., Manni, T., Vaheri, A., Vapalahti,O. & Uzcategui, N. Y. (2009). Characterization of a novel flavivirusfrom mosquitoes in northern europe that is related to mosquito-borne flaviviruses of the tropics. J Virol 83, 9532–9540.

Hulten, E. (1937). Outline of the History of Arctic and Boreal Biota Duringthe Quaternary Period. Stockholm: Bokforlagsaktiebolaget Thule.

Jenkins, G. M., Rambaut, A., Pybus, O. G. & Holmes, E. C. (2002).Rates of molecular evolution in RNA viruses: a quantitativephylogenetic analysis. J Mol Evol 54, 156–165.

Katoh, K. & Standley, D. M. (2013). MAFFT multiple sequencealignment software version 7: improvements in performance andusability. Mol Biol Evol 30, 772–780.

Katzourakis, A. & Gifford, R. J. (2010). Endogenous viral elements inanimal genomes. PLoS Genet 6, e1001191.

Katzourakis, A., Gifford, R. J., Tristem, M., Gilbert, M. T. P. & Pybus,O. G. (2009). Macroevolution of complex retroviruses. Science 325,1512.

J. H.-O Pettersson and O. Fiz-Palacios

1980 Journal of General Virology 95

Kelly, R. L. (2003). Maybe we do know when people first came to

North America; and what does it mean if we do? Quartern Int 109–110, 133–145.

Klungthong, C., Zhang, C., Mammen, M. P., Jr, Ubol, S. & Holmes,E. C. (2004). The molecular epidemiology of dengue virus serotype 4

in Bangkok, Thailand. Virology 329, 168–179.

Kolodziejek, J., Pachler, K., Bin, H., Mendelson, E., Shulman, L.,Orshan, L. & Nowotny, N. (2013). Barkedji virus, a novel mosquito-

borne flavivirus identified in Culex perexiguus mosquitoes, Israel,

2011. J Gen Virol 94, 2449–2457.

Korber, B., Muldoon, M., Theiler, J., Gao, F., Gupta, R., Lapedes, A.,Hahn, B. H., Wolinsky, S. & Bhattacharya, T. (2000). Timing theancestor of the HIV-1 pandemic strains. Science 288, 1789–1796.

Kumar, S. R. P., Patil, J. A., Cecilia, D., Cherian, S. S., Barde, P. V.,Walimbe, A. M., Yadav, P. D., Yergolkar, P. N., Shah, P. S. & otherauthors (2010). Evolution, dispersal and replacement of American

genotype dengue type 2 viruses in India (1956–2005): selection

pressure and molecular clock analyses. J Gen Virol 91, 707–720.

Kuno, G. (2007). Host range specificity of flaviviruses: correlation

with in vitro replication. J Med Entomol 44, 93–101.

Kuno, G. & Chang, G.-J. J. (2006). Characterization of Sepik and

Entebbe bat viruses closely related to yellow fever virus. Am J Trop

Med Hyg 75, 1165–1170.

Lee, J. S., Grubaugh, N. D., Kondig, J. P., Turell, M. J., Kim, H.-C.,Klein, T. A. & O’Guinn, M. L. (2013). Isolation and genomic

characterization of Chaoyang virus strain ROK144 from Aedes vexansnipponii from the Republic of Korea. Virology 435, 220–224.

Lindenbach, B. D. & Rice, C. M. (2003). Molecular biology offlaviviruses. Adv Virus Res 59, 23–61.

Lindenbach, B. D., Thiel, H.-J. & Rice, C. M. (2007). Flaviviridae: the

viruses and their replication. In Fields Virology, 5th edn, vol. 1, pp.1101–1152. Edited by D. Knipe, P. Howley, D. Griffin, R. Lamb,

M. Martin, B. Roizman & S. Straus. Philadelphia, PA: Lippincott-Raven.

Linz, B., Balloux, F., Moodley, Y., Manica, A., Liu, H., Roumagnac, P.,Falush, D., Stamer, C., Prugnolle, F. & other authors (2007). An

African origin for the intimate association between humans andHelicobacter pylori. Nature 445, 915–918.

Lobo, F. P., Mota, B. E. F., Pena, S. D. J., Azevedo, V., Macedo, A. M.,Tauch, A., Machado, C. R. & Franco, G. R. (2009). Virus–hostcoevolution: common patterns of nucleotide motif usage in

Flaviviridae and their hosts. PLoS ONE 4, e6282.

Lutomiah, J. J. L., Mwandawiro, C., Magambo, J. & Sang, R. C. (2007).Infection and vertical transmission of Kamiti river virus in laboratory

bred Aedes aegypti mosquitoes. J Insect Sci 7, 1–7.

Macaulay, V., Hill, C., Achilli, A., Rengo, C., Clarke, D., Meehan, W.,Blackburn, J., Semino, O., Scozzari, R. & other authors (2005).Single, rapid coastal settlement of Asia revealed by analysis ofcomplete mitochondrial genomes. Science 308, 1034–1036.

Mackenzie, J. S., Gubler, D. J. & Petersen, L. R. (2004). Emergingflaviviruses: the spread and resurgence of Japanese encephalitis, West

Nile and dengue viruses. Nat Med 10 (Suppl), S98–S109.

Mandryk, C. A., Josenhans, H., Fedje, D. W. & Mathewes, R. W.(2001). Late Quaternary paleoenvironments of Northwestern North

America: implications for inland versus coastal migration routes.

Quat Sci Rev 20, 301–314.

May, F. J., Davis, C. T., Tesh, R. B. & Barrett, A. D. T. (2011).Phylogeography of West Nile virus: from the cradle of evolution inAfrica to Eurasia, Australia, and the Americas. J Virol 85, 2964–2974.

McDougall, I., Brown, F. H. & Fleagle, J. G. (2005). Stratigraphic

placement and age of modern humans from Kibish, Ethiopia. Nature433, 733–736.

Medeiros, D. B. A., Nunes, M. R. T., Vasconcelos, P. F. C., Chang,G.-J. J. & Kuno, G. (2007). Complete genome characterization ofRocio virus (Flavivirus: Flaviviridae), a Brazilian flavivirus isolatedfrom a fatal case of encephalitis during an epidemic in Sao Paulostate. J Gen Virol 88, 2237–2246.

Mehla, R., Kumar, S. R. P., Yadav, P., Barde, P. V., Yergolkar, P. N.,Erickson, B. R., Carroll, S. A., Mishra, A. C., Nichol, S. T. & Mourya,D. T. (2009). Recent ancestry of Kyasanur Forest disease virus. EmergInfect Dis 15, 1431–1437.

Miller, M. A., Pfeiffer, W. & Schwartz, T. (2010). Creating the CIPRESScience Gateway for inference of large phylogenetic trees. Presented atthe Gateway Computing Environments Workshop, 2010.

Mohammed, M. A. F., Galbraith, S. E., Radford, A. D., Dove, W.,Takasaki, T., Kurane, I. & Solomon, T. (2011). Molecular phylogeneticand evolutionary analyses of Muar strain of Japanese encephalitisvirus reveal it is the missing fifth genotype. Infect Genet Evol 11, 855–862.

Moodley, Y., Linz, B., Bond, R. P., Nieuwoudt, M., Soodyall, H.,Schlebusch, C. M., Bernhoft, S., Hale, J., Suerbaum, S. & otherauthors (2012). Age of the association between Helicobacter pyloriand man. PLoS Pathog 8, e1002693.

Pan, X.-L., Liu, H., Wang, H.-Y., Fu, S.-H., Liu, H.-Z., Zhang, H.-L., Li,M.-H., Gao, X.-Y., Wang, J.-L. & other authors (2011). Emergence ofgenotype I of Japanese encephalitis virus as the dominant genotype inAsia. J Virol 85, 9847–9853.

Patil, J. A., Cherian, S., Walimbe, A. M., Patil, B. R., Sathe, P. S., Shah,P. S. & Cecilia, D. (2011). Evolutionary dynamics of the AmericanAfrican genotype of dengue type 1 virus in India (1962–2005). InfectGenet Evol 11, 1443–1448.

Pond, S. L. K., Frost, S. D. W. & Muse, S. V. (2005). HyPhy: hypothesistesting using phylogenies. Bioinformatics 21, 676–679.

Porterfield, J. S. (1980). Antigenic characteristics and classification ofTogaviridae. In The Togaviruses: Biology, Structure, Replication, pp.13–46. Edited by R. W. Schlesinger. New York: Academic Press.

Pybus, O. G., Rambaut, A., Belshaw, R., Freckleton, R. P.,Drummond, A. J. & Holmes, E. C. (2007). Phylogenetic evidence fordeleterious mutation load in RNA viruses and its contribution to viralevolution. Mol Biol Evol 24, 845–852.

Ramırez, A., Fajardo, A., Moros, Z., Gerder, M., Caraballo, G.,Camacho, D., Comach, G., Alarcon, V., Zambrano, J. & other authors(2010). Evolution of dengue virus type 3 genotype III in Venezuela:diversification, rates and population dynamics. Virol J 7, 329.

Rasmussen, M., Guo, X., Wang, Y., Lohmueller, K. E., Rasmussen, S.,Albrechtsen, A., Skotte, L., Lindgreen, S., Metspalu, M. & otherauthors (2011). An Aboriginal Australian genome reveals separatehuman dispersals into Asia. Science 334, 94–98.

Ronquist, F., Teslenko, M., van der Mark, P., Ayres, D. L., Darling, A.,Hohna, S., Larget, B., Liu, L., Suchard, M. A. & Huelsenbeck, J. P.(2012). MrBayes 3.2: efficient Bayesian phylogenetic inference andmodel choice across a large model space. Syst Biol 61, 539–542.

Sanjuan, R. (2012). From molecular genetics to phylodynamics:evolutionary relevance of mutation rates across viruses. PLoS Pathog8, e1002685.

Schurr, T. G. (2004). The peopling of the new world: perspectivesfrom molecular anthropology. Annu Rev Anthropol 33, 551–583.

Shapiro, B., Drummond, A. J., Rambaut, A., Wilson, M. C., Matheus,P. E., Sher, A. V., Pybus, O. G., Gilbert, M. T. P., Barnes, I. & otherauthors (2004). Rise and fall of the Beringian steppe bison. Science306, 1561–1565.

Shapiro, B., Rambaut, A. & Drummond, A. J. (2006). Choosingappropriate substitution models for the phylogenetic analysis ofprotein-coding sequences. Mol Biol Evol 23, 7–9.

Origin of the genus Flavivirus

http://vir.sgmjournals.org 1981

Sharp, P. M. & Simmonds, P. (2011). Evaluating the evidence forvirus/host co-evolution. Curr Opin Virol 1, 436–441.

Stamatakis, A. (2006). RAxML-VI-HPC: maximum likelihood-basedphylogenetic analyses with thousands of taxa and mixed models.Bioinformatics 22, 2688–2690.

Stringer, C. B. & Andrews, P. (1988). Genetic and fossil evidence forthe origin of modern humans. Science 239, 1263–1268.

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M. & Kumar,S. (2011). MEGA5: molecular evolutionary genetics analysis usingmaximum likelihood, evolutionary distance, and maximum par-simony methods. Mol Biol Evol 28, 2731–2739.

Twiddy, S. S., Holmes, E. C. & Rambaut, A. (2003). Inferring the rateand time-scale of dengue virus evolution. Mol Biol Evol 20, 122–129.

Uzcategui, N. Y., Sironen, T., Golovljova, I., Jaaskelainen, A. E.,Valimaa, H., Lundkvist, A., Plyusnin, A., Vaheri, A. & Vapalahti, O.(2012). Rate of evolution and molecular epidemiology of tick-borneencephalitis virus in Europe, including two isolations from the samefocus 44 years apart. J Gen Virol 93, 786–796.

Wasik, B. R. & Turner, P. E. (2013). On the biological success ofviruses. Annu Rev Microbiol 67, 519–541.

Weidmann, M., Frey, S., Freire, C. C. M., Essbauer, S., Ruzek, D.,Klempa, B., Zubrikova, D., Vogerl, M., Pfeffer, M. & other authors

(2013). Molecular phylogeography of tick-borne encephalitis virus incentral Europe. J Gen Virol 94, 2129–2139.

Wertheim, J. O. & Kosakovsky Pond, S. L. (2011). Purifying selectioncan obscure the ancient age of viral lineages. Mol Biol Evol 28, 3355–3365.

Wirth, T., Hildebrand, F., Allix-Beguec, C., Wolbeling, F., Kubica, T.,Kremer, K., van Soolingen, D., Rusch-Gerdes, S., Locht, C. & otherauthors (2008). Origin, spread and demography of the Mycobac-terium tuberculosis complex. PLoS Pathog 4, e1000160.

Woelk, C. H. & Holmes, E. C. (2002). Reduced positive selection invector-borne RNA viruses. Mol Biol Evol 19, 2333–2336.

Worobey, M., Telfer, P., Souquiere, S., Hunter, M., Coleman, C. A.,Metzger, M. J., Reed, P., Makuwa, M., Hearn, G. & other authors(2010). Island biogeography reveals the deep history of SIV. Science329, 1487.

Yang, Z. (1996). Maximum-likelihood models for combined analysesof multiple sequence data. J Mol Evol 42, 587–596.

Zanotto, P. M., Gao, G. F., Gritsun, T., Marin, M. S., Jiang, W. R.,Venugopal, K., Reid, H. W. & Gould, E. A. (1995). An arbovirus clineacross the northern hemisphere. Virology 210, 152–159.

Zanotto, P. M., Gould, E. A., Gao, G. F., Harvey, P. H. & Holmes, E. C.(1996). Population dynamics of flaviviruses revealed by molecularphylogenies. Proc Natl Acad Sci U S A 93, 548–553.

J. H.-O Pettersson and O. Fiz-Palacios

1982 Journal of General Virology 95