browsing the genome

42
Browsing the Genome Browsing the Genome Using Genome Browsers to Using Genome Browsers to Visualize and Mine Data Visualize and Mine Data

Upload: doannhi

Post on 27-Jan-2017

232 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Browsing the Genome

Browsing the GenomeBrowsing the Genome

Using Genome Browsers to Using Genome Browsers to Visualize and Mine DataVisualize and Mine Data

Page 2: Browsing the Genome

Genome BrowsersGenome Browsers Software designed to enable a user Software designed to enable a user

to access and display sequence data to access and display sequence data Provide a visual correlation for Provide a visual correlation for

different types of informationdifferent types of information Organize large amounts of genome Organize large amounts of genome

sequence datasequence data

Page 3: Browsing the Genome

Several Different Genome Several Different Genome BrowsersBrowsers

Common features:Common features: Coordinate system is based on the buildCoordinate system is based on the build Zoom in and outZoom in and out Gene features aligned to genomeGene features aligned to genome

Major Differences:Major Differences: Each Browser has a very different look and feelEach Browser has a very different look and feel Navigating through the informationNavigating through the information

Page 4: Browsing the Genome

Main Genome Browser Main Genome Browser RepositoriesRepositories

EnsemblEnsembl NCBI (Entrez) - BLASTNCBI (Entrez) - BLAST UCSC - BLATUCSC - BLAT

Ensembl, NCBI, and UCSC use the Ensembl, NCBI, and UCSC use the same human genome assembly that same human genome assembly that is generated by NCBI but release is generated by NCBI but release timing is different between sitestiming is different between sites

Page 5: Browsing the Genome

UCSCUCSC Vertebrates, Deuterostomes, Insects, Vertebrates, Deuterostomes, Insects,

Nematodes, YeastNematodes, Yeast Entry into genome sequence via BLATEntry into genome sequence via BLAT Table BrowserTable Browser Creation of PDFCreation of PDF Provides access to all the data produced by Provides access to all the data produced by

the project, and to the software used to the project, and to the software used to analyze and present itanalyze and present it

Site produces and maintains annotation Site produces and maintains annotation trackstracks

Page 6: Browsing the Genome

Aligned Annotation TracksAligned Annotation Tracks Genomic data: known genes, predicted Genomic data: known genes, predicted

genes, ESTs, mRNAs, CpG islands, genes, ESTs, mRNAs, CpG islands, assembly gaps and coverage, chromosomal assembly gaps and coverage, chromosomal bands, mouse homologies, and morebands, mouse homologies, and more

Annotation tracks are both computed at Annotation tracks are both computed at UCSC from publicly available sequence data UCSC from publicly available sequence data and provided by collaboratorsand provided by collaborators

Users can also add their own custom tracks Users can also add their own custom tracks to the browser to the browser

Page 7: Browsing the Genome

UCSC OutlineUCSC Outline NavigatingNavigating Configuring BrowserConfiguring Browser Extracting dataExtracting data

Page 8: Browsing the Genome

Home PageHome Page

Page 9: Browsing the Genome

BLATBLAT

Page 10: Browsing the Genome

BLAT ResultsBLAT Results

Page 11: Browsing the Genome

Standard QueryStandard Query

Page 12: Browsing the Genome

Query ResultsQuery Results

Page 13: Browsing the Genome

Graphical InterfaceGraphical Interface

Page 14: Browsing the Genome

Configuring DisplayConfiguring Display

Page 15: Browsing the Genome

ComponentsComponents

Page 16: Browsing the Genome

Get DNAGet DNA

Page 17: Browsing the Genome

Configuring DNAConfiguring DNA

Page 18: Browsing the Genome

DNA STS HighlightedDNA STS Highlighted

Page 19: Browsing the Genome

TracksTracks

Page 20: Browsing the Genome

Track DisplayTrack Display

Page 21: Browsing the Genome

Human SDAD1Human SDAD1

Page 22: Browsing the Genome

ConvertConvert

Page 23: Browsing the Genome

Mouse Sdad1Mouse Sdad1

Page 24: Browsing the Genome

EST TrackEST Track

Page 25: Browsing the Genome

Entry DataEntry Data

Page 26: Browsing the Genome

Viewing ExonsViewing Exons

Page 27: Browsing the Genome

Integrate Specific DataIntegrate Specific Data

Page 28: Browsing the Genome

Custom TracksCustom Tracks User provided annotation data User provided annotation data Can be in standard GFF format or in a Can be in standard GFF format or in a

format designed specifically for UCSC format designed specifically for UCSC Genome Browser, including GTF, PSL, Genome Browser, including GTF, PSL, BED, WIG, and microarray (BED15) BED, WIG, and microarray (BED15)

Page 29: Browsing the Genome

Add Custom TracksAdd Custom Tracks

Page 30: Browsing the Genome

Sample Custom TracksSample Custom Tracks GFFGFFchr5 EST exon 92719127 92719406 . + 0 BEchr5 EST exon 92719127 92719406 . + 0 BEchr5 EST exon 92731587 92731784 . + 0 BEchr5 EST exon 92731587 92731784 . + 0 BE

BedBedchr5 92715320 92715326 miR-194 1 -chr5 92715320 92715326 miR-194 1 -chr5 92715467 92715474 miR-124.1 3 -chr5 92715467 92715474 miR-124.1 3 -chr5 92715467 92715473 miR-124/506 1 -chr5 92715467 92715473 miR-124/506 1 -

Page 31: Browsing the Genome

Display of Custom TracksDisplay of Custom Tracks

Page 32: Browsing the Genome

Configure Track DisplayConfigure Track Display

Page 33: Browsing the Genome

Save PDFSave PDF

Page 34: Browsing the Genome

Table BrowserTable Browser

Page 35: Browsing the Genome

Sample Table DataSample Table Data

Page 36: Browsing the Genome

Proteome BrowserProteome Browser

Page 37: Browsing the Genome

Protein SequenceProtein Sequence

Page 38: Browsing the Genome

Protein CharacteristicsProtein Characteristics

Page 39: Browsing the Genome

Structure InformationStructure Information

Page 40: Browsing the Genome

Summary of UCSCSummary of UCSC Different ways of querying genomeDifferent ways of querying genome Control over graphical displayControl over graphical display Vast amount of genomic dataVast amount of genomic data Ability to collect that dataAbility to collect that data

Page 41: Browsing the Genome

HoweverHowever UCSC does not include my genomeUCSC does not include my genome

Actually no genome browser Actually no genome browser supports my genomesupports my genome

Page 42: Browsing the Genome

Custom Browser SoftwareCustom Browser Software GBrowse is a combination of GBrowse is a combination of

database and interactive Web page database and interactive Web page for manipulating and displaying for manipulating and displaying annotations on genomes annotations on genomes

Annotation Browsers- Argo and Annotation Browsers- Argo and ApolloApollo