![Page 1: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/1.jpg)
Mapping Influenza A Virus Transmission Networks with
Whole Genome Comparisons(Methods)
Adrienne Breland
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 2: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/2.jpg)
Goal
- to characterize global Influenza A Virus
transmission as a complex network
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 3: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/3.jpg)
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Russell (2008) The global circulation of seasonal influenza A (H3N2) viruses
Proposed global H3N2 circulation
![Page 4: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/4.jpg)
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 5: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/5.jpg)
• Motivation
• Major Questions
• Data
• Genome Comparison Method
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Outline
![Page 6: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/6.jpg)
• Motivation
• Major Questions
• Data
• Genome Comparison Method
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Outline
![Page 7: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/7.jpg)
Motivation
• Delineating real disease networks is difficult– Infection tracing: Detecting exact
transmission links– Contact tracing: All potential
transmission contacts– Diary Based: Subject records all contacts
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 8: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/8.jpg)
Motivation
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Infection tracing Contact tracing Diary Based
Keeling M & K Eames (2005) Networks and epidemic models. J. R. Soc. Interface 2:295-307
![Page 9: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/9.jpg)
Motivation
• Delineating real disease networks is very useful
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 10: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/10.jpg)
Motivation
• Delineating real disease networks is very useful
-targeting an attack
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 11: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/11.jpg)
Motivation
• Delineating real disease networks is very useful
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGTError and attack tolerance of complex networks. Réka Albert, Hawoong Jeong and Albert-László Barabási
![Page 12: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/12.jpg)
Motivation
• Delineating real disease networks is very useful
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGThttp://prblog.typepad.com/strategic_public_relation/images/2007/06/22/simple_social_network.png
![Page 13: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/13.jpg)
Motivation
• Delineating real disease networks is very useful
-correlation coefficients
ji,ijji
ii
ABD [DB] BD typeof pairs
D [D] D typeof singles
correlatednot B and D if 1
[A][B]n
N[DB]CDB
![Page 14: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/14.jpg)
Motivation
• Delineating real disease networks is very useful
-detecting more probable global routes
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 15: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/15.jpg)
Motivation
• Global routes
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 16: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/16.jpg)
Motivation
• Global routes
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Breland A, S Nasser, K Schlauch, M Nicolescu, F Harris (2008) Efficient Influenza A Virus Origin Detection. Journal of Electronics and Computer Science, 10;1-12
![Page 17: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/17.jpg)
Motivation
• Delineating real disease networks is very useful
-examine with other spatial data
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 18: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/18.jpg)
Motivation
• Spatial data
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 19: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/19.jpg)
Motivation
• Spatial data
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
VEGETATION
![Page 20: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/20.jpg)
Motivation
• Spatial data
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
POPULATION
![Page 21: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/21.jpg)
Motivation
• Spatial data
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
CLIMATE CHANGE
![Page 22: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/22.jpg)
• Motivation
• Major Questions
• Data
• Genome Comparison Method
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Outline
![Page 23: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/23.jpg)
Major questions
• Location and degree of host jumping
• Underlying structure (small world, power law..)
• Subtype independence
• Re-assortment
• Geographic routes
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 24: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/24.jpg)
• Motivation
• Major Questions
• Data
• Genome Comparison Method
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Outline
![Page 25: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/25.jpg)
Data
• http://www.ncbi.nlm.nih.gov/genomes/FLU/FLU.html
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 26: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/26.jpg)
Data
• ≈ 4000 sequences• 1999-2009• Global regions (i.e. China, U.S., Africa, India...)• All subtypes (i.e. H5N1, H1N1, ..)• All hosts species (Domestic Avian, Wild Avian, etc..)
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 27: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/27.jpg)
Data
• ≈ 374 per year
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 28: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/28.jpg)
Data
• Multiple host types
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 29: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/29.jpg)
Data
• Multiple sub types
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 30: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/30.jpg)
• Motivation
• Major Questions
• Data
• Genome Comparison Method
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
Outline
![Page 31: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/31.jpg)
Genome Comparisons
• Similarity matrix, N sequences:
N(N-1)/2 comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
-0.40.10.970.10.82N
--0.30.60.70.9.
---0.30.50.02.
----0.020.01.
-----0.932
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
![Page 32: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/32.jpg)
Romanova,J (2006) The fight against new types of influenza virus. Biotechnology J,1:1381-1392
![Page 33: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/33.jpg)
Genome Comparisons
• 8 segments
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
- 1 N
-- 1.
--- .
---- .
-----12
------1
N. .. 21
HA ≈ 1750bp
NS ≈ 900bp
M ≈ 1000bp NA ≈ 1300bp NP ≈ 1500bp
PA ≈ 2100bp PB1 ≈ 2200bp PB2 ≈ 2300bp
![Page 34: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/34.jpg)
Genome Comparisons
• 8 segments
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
![Page 35: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/35.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
• Alignment, O(n2), n = max sequence length
.....AAAACTTGAACC.....
.....GGACTTGACCT.....
![Page 36: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/36.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
AAGAACCTTTATGACAAGGTTCGACTACA GCTTAGGGATAATGCAAAGGAGCTGGT
• Alignment-free k-mers, O(n)
∑ = {A,C,G,T/U}
4k possible k-mers, k≥0
TT
TG
.
.
.
AG
AC
AA
frequencyk-word
![Page 37: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/37.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
• Feature Frequency Profiles (FFP)
Ck = <c1,...,c4k>
Fk = <c1/∑,...,c4k/∑> = <f1,...,f4k>
Sims GE, Jun SR, Wu GA, Kim SH (2009) Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions. Proc Natl Acad Sci U S A. ,106(8):2677-82 .
![Page 38: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/38.jpg)
Genome Comparisons
• Jensen-Shannon Divergence (JS)
compare(s1,s2)
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
Pk = FFP(s1), Qk = FFP(s2), Mk = (Pk + Mk)/2
JS(Pk,Mk) = 1/2KL(Pk,Mk) + 1/2KL(Qk,Mk)
KL =
k
i ik
ikik m
pp
4
1 ,
,2,
log
![Page 39: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/39.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
• k=?
![Page 40: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/40.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
• k=?
k s.t. N(k) ≥ N(k+1)
k ≈ 4
![Page 41: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/41.jpg)
Genome Comparisons
TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA
• Actual & Predicted times
![Page 42: Mapping Influenza A Virus Transmission Networks with Whole Genome Comparisons (Methods) Adrienne Breland TTGTGGATTCTTGATCGTCTTTTCTTCAAATGTAT TTATCGTCGCCTTAAATACGGA](https://reader031.vdocuments.site/reader031/viewer/2022032800/56649d425503460f94a1d98f/html5/thumbnails/42.jpg)
• Questions/Comments?
• Thanks