viewing the data: ucsc genome browser and its possibilities
TRANSCRIPT
![Page 1: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/1.jpg)
.Robert Kuhn
UC Santa Cruz.
Variant Prediction Training CourseJohor, Malaysia
.
August 27-30, 2018
Viewing the data: UCSC Genome Browser and its possibilities
@GenomeBrowser
![Page 2: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/2.jpg)
Disclosures
Royalties from Browser licensesBioinformatics contract, Regeneron, Inc
![Page 3: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/3.jpg)
funding:
National Human Genome Research Institute (NHGRI)
California Institute for Regenerative Medicine (CIRM)
QB3 (UCBerkeley, UCSF, UCSC)
Chan Zuckerberg Initiative
Howard Hughes Medical Institute
![Page 4: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/4.jpg)
UCSC Browser team
– David Haussler – co-PI
– Jim Kent – Browser Concept, BLAT, Team Leader, PI
EngineeringAngie HinrichsKate RosenbloomHiram ClawsonGalt BarberBrian RaneyMax HaeusslerJonathan CasperChristopher Lee
QA, Docs, SupportBrian LeeMatt Speir Jairo Navarro Chris VillarrealLou NassarDaniel SmelterConner Powell
KiloKluster, Sys-adminJorge GarciaErich WeilerHaifang Talc
ManagementAnn Zweig
![Page 5: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/5.jpg)
UCSC = UC Santa Cruz!= USC, USCS, UCSF, UCSD….
![Page 6: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/6.jpg)
![Page 7: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/7.jpg)
How do you visualize your Next-Gen Sequencing data?
![Page 8: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/8.jpg)
120 assemblies
30,000 genes54,000,000 SNPs
71,000,000 mRNAs
3,000,000,000 nucleotides
80 organisms
DECIPHER
OMIM
CNVs
array probesets
Microarray expression
7,000,000,000 people1000 Genomes Project
![Page 9: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/9.jpg)
http:// genome-asia.ucsc.edu
http:// bit.ly/ucscMalaysia2018
>>>>>
![Page 10: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/10.jpg)
UCSC Genome Browser
Display engine for genomic annotations.Consistent interface across genomes.
A tool for inquiry-driven discovery.
![Page 11: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/11.jpg)
![Page 12: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/12.jpg)
YouTube training channel: bit.ly/ucscVideos
On-site workshops (training link on main page):http:// genome.ucsc.edu/training/
![Page 13: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/13.jpg)
VariationWatson: 2.06 million SNPsVenter: 3.32 million SNPs
1.19 million in common
In 20 kb myoglobin region on chr22, Watson and Venter share 20 SNPs. Watson has 9 unique SNPs, Venter 6.
![Page 14: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/14.jpg)
VariationWatson: 2.06 million SNPsVenter: 3.32 million SNPs
1.19 million in common
![Page 15: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/15.jpg)
VariationWatson: 2.06 million SNPsVenter: 3.32 million SNPs
1.19 million in common
![Page 16: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/16.jpg)
support for (some) HGVS:
NM_198056.2:c.1654G>T
NP_002993.1:p.Asp92Glu
NP_002993.1:p.D92E
BRCA1 Ala744Cys / BRCA1 p744http://genome.soe.ucsc.edu/goldenPath/help/query.html#HGVS
= bit.ly/ucscHGVS
![Page 17: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/17.jpg)
And now: HGVS output from Variation Annotation Integrator:
![Page 18: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/18.jpg)
Wiggle track
BAM track
bamToBigWig
bedGraphToBigWig
bedGraph track
Data pipeline
slide modified from:
Tim Hubbard, King’s College, London
100K genomes (rare disease or cancer)
bamToBedGraph
![Page 19: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/19.jpg)
Data pipeline
slide modified from:
Tim Hubbard, King’s College, London
100K genomes (rare disease or cancer)
Variant Annotation Integrator
BAM trackVCF track
![Page 20: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/20.jpg)
How do you visualize your Next-Gen Sequencing data? -- BAM filemillions of short reads
files too large to upload (timeout)
mismatches to referenceare in red
Custom
Track
Raw Reads aligner / SAMtools SAM
BAM
.bai
index / SAMtools
Custom Track:Make BAM, .bai files available to the web (http:, https: or ftp:)
Upload only the location of the data.
track name=trackName type=bam bigDataUrl=http://path/file.bam
The Browser fetches only tiny portion of the file
http://samtools.sourceforge.net/
sample ChIP-seq data courtesy Charles Nicolet, UC Davis
drag / zoom
![Page 21: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/21.jpg)
zoom to base level
view alignment details
possible heterozygotes lower quality scores
shown in lighter color
SNP track
homozygous mismatch to reference is same as a known non-synonymous SNP
UCSC genes
![Page 22: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/22.jpg)
Display
Read alignments: BAM, CRAM
Coverage: (BAM), wiggle
Variants: pgSNP, VCF, (HGVS, rs#)
Predictive
Variant Annotation Integrator
SIFT, PolyPhen, Mutation{Taster,Assessor} …
Data tracks
Benign, Pathogenic
CNVs, SNPs
![Page 23: Viewing the data: UCSC Genome Browser and its possibilities](https://reader034.vdocuments.site/reader034/viewer/2022050306/626f2866c4c3845a2c75fb4c/html5/thumbnails/23.jpg)
live demo:
http:// genome-asia.ucsc.edu
FGF2 FGFR1 FGFR2 FGFR3 FGFR4FGF2 FGFR1 FGFR2 FGFR3 FGFR4