assignment 6: motif findinggenetics.wustl.edu/bio5488/files/2017/03/assignment-6-review-.pdf ·...

6
Assignment 6: Motif Finding Bio5488 3/24/17!! Review J

Upload: others

Post on 04-Oct-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Assignment6:MotifFindingBio54882/24/173/24/17!!

ReviewJ

Page 2: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Assignment6:Motiffinding• Input• Promotersequences• PWMsofDNA-bindingproteins

• Goal• FindputativebindingsitesinthesequencesbyscanningthesequencesformatchestothePWM

• Output• Listofthelocationsandscoresofputativebindingsites

PWM Putativebindingsequence

Promoter

Page 3: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

AssignmentTODOs

• DeterminethehighestaffinitybindingsiteforeachPWM• CalculatebyhandorwriteascriptJ

• Commenttheexistingcode• Commenttheuser-definedfunctionswithfunctiondocstrings

• Modifythescripttoscanthereversecomplementoftheinputsequence• Modifythescriptonlyreporthitsthathavescoresaboveagiventhreshold

• Scanpromoters(n=2)tofindputativebindingsitesforeachDNA-bindingprotein(n=2)

• Answerfollow-upquestions

Page 4: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

TFScoringMatrix

Page 5: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Indexing

• Indexingissomewhatarbitrary;howeverit’simportanttofollowconventions:• Thestartpositionofafeatureissmallerthanthestopposition• Thecoordinatesarerelativetotheforwardstrand

Page 6: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

UseToyDataSets!!!

ACGT

1000

1000

0010

012

Base

Position

Lookatourexamples/instructionssoyougiveustherightanswersJ