web apollo resources at the national agricultural library christopher childers nal ars usda...

23
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Upload: elaine-bryant

Post on 16-Dec-2015

221 views

Category:

Documents


7 download

TRANSCRIPT

Page 1: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Web Apollo Resources at the National Agricultural Library

Christopher ChildersNAL ARS USDA

i5k.nal.usda.gov

Page 2: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Agenda

• Web Apollo– Web based tool for manual gene curation– Registration required for access

• Introduction to the Web Apollo interface– Viewing evidence tracks– Logging in/out

• BLAST• Example annotation

Page 3: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Annotation Overview

1. Find sequence2. Align to genome3. Examine available evidence4. Annotate based on evidence5. Compare results to related sequences

– BLAST/BLAT or multiple alignment

6. Repeat steps 1 through 5 as needed to refine model7. Add annotation details in the “Information Editor”

Page 4: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Annotator Registration

Page 5: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Some conventions

• HSP – High scoring pair in BLAST/BLAT alignments– The ‘Hits’ in an alignment result set– A subsection of a pair of sequences with sufficient score– Change based on the alignment parameters

• Five prime end and three prime end – Based on direction of transcription– Initiation site is at the five prime end– Stop codon is at the three prime end

• In the genome browser, arrowheads indicate direction3’ 5’

5’ 3’

Page 6: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Web Apollo/JBrowse

• JBrowse is a web based genome browser– Visualize features that are mapped to a genome– These features are displayed as tracks– Many different types of data may be

• Web Apollo adds editing functions to JBrowse– Manual gene curation– Changes automatically saved back to server– Edits are visible to other annotators in real-time

Page 7: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Web Apollo

1

3

2

Page 8: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Viewing evidence

1. The “Available Tracks” panel lists all tracks– Tracks are grouped hierarchically

• Study or analysis• Categories based on evidence type

2. The Navigation panel describes the viewed region– Coordinate ranges for the entire sequence on top– Navigation controls including pan, zoom and search in the middle– Coordinate ranges for the current view along the bottom

3. The Evidence panel shows the features in genomic context– Each type of data is a separate track– Tracks may be displayed in different ways

Page 9: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Viewing evidence

• Turning tracks on or off– Click a tracks checkbox to turn it on or off– Click ‘X’ by the track label also turns the track off

Page 10: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Login

• For scaffold selection view:

Page 11: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Login

• Directly from JBrowse genome browser

Page 12: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Login

• Directly from JBrowse genome browser

Page 13: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Login

• Directly from JBrowse genome browser

Page 14: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

BLAST at the i5k Workspace

• Redesigned: uses the latest version of BLAST+• Interface updates options automatically– Based on the combination of query and database– Chooses correct options (blastn, blastx, tblastn, etc)

• Results persist for one week– The results URL can be saved or shared

• HSPs are viewable in the genome browser– HSPs are conservatively chained together– Visible as connected blocks

Page 15: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

BLAST

Page 16: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

BLAST Results

Page 17: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

BLAST results in Web Apollo

Page 18: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Live Annotation Example

• Alpha-catenin in the Colorado Potato Beetle• We created special training versions of our toolsBLAST: https://i5k.nal.usda.gov/training/blast Web Apollo:https://apollo.nal.usda.gov/lepdec_training/jbrowse

Page 19: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov
Page 20: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov
Page 21: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov
Page 22: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Annotation using real data

• Logging in to the demo instance• Possible gene families to look at– Cytochrome P450– myotubularin-related proteins– COP9

Page 23: Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov

Sequence Search

• Map a gene of interest to a genomic region– BLAT: Web Apollo plugin– BLAST@NAL: separate tool

• Each has unique features– BLAT shows the hit as a highlighted region– BLAST shows hits as segments, similar to other

evidence tracks