clustered repeats and regulatory sites

63
Clustered Repeats and Regulatory Sites Abdulrahman Alazemi, Shahroze Abbas, Liam Lewis, Donald Ta, Ann Vo

Upload: minnie

Post on 22-Feb-2016

45 views

Category:

Documents


0 download

DESCRIPTION

Clustered Repeats and Regulatory Sites. Abdulrahman Alazemi , Shahroze Abbas, Liam Lewis, Donald Ta, Ann Vo. Overview. Clustered repeats Wide variety of functions History largely unknown Regulatory Sites A segment capable of altering expression of specific genes - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Clustered Repeats and Regulatory Sites

Clustered Repeats and Regulatory SitesAbdulrahman Alazemi, Shahroze Abbas, Liam Lewis, Donald Ta, Ann Vo

Page 2: Clustered Repeats and Regulatory Sites

Overview• Clustered repeats

Wide variety of functions History largely unknown

• Regulatory Sites A segment capable of altering expression of

specific genes Various classifications of regulatory sequences

Found in non-coding regions Functions at the transcriptional level

Page 3: Clustered Repeats and Regulatory Sites

Identification• Consensus sequences

Utilize PSSM How?

Determine consensus

Page 4: Clustered Repeats and Regulatory Sites

Clustered repeats and potential regulatory sequencesAbdulrahman Alazemi

Page 5: Clustered Repeats and Regulatory Sites

Background; Transcription factors?

Activators Vs Repressors.

Page 6: Clustered Repeats and Regulatory Sites

Thoughts;- My questions;

Can I apply a known method/tool with a known results to other phage and get the same/similar

result?

Page 7: Clustered Repeats and Regulatory Sites

The known case; Examine the proven

Repressor and Cro binding sites (operators) of Phage Lambda.

Bioinformatics method in the notes.

First “Name of Lambda in BioBike”.

Go to BioBike/Phantome.

Page 8: Clustered Repeats and Regulatory Sites

The known case; Second; Motifs in for the upstream sequence of phage Lambda. Labeled, DNA, Multiple-Hits-ok.

Page 9: Clustered Repeats and Regulatory Sites

The known case; Results of motifs in. More than one interesting

case. Motifs 1, 2 , 3.

Page 10: Clustered Repeats and Regulatory Sites

The known case; BioBike function; Description-analysissubmenu, Genes-

proteins menu. Now, we have anidea about whereto look.

Page 11: Clustered Repeats and Regulatory Sites

The known case; Used the function

sequence-of from Genome menu.

Go to the specific region in the genome

Page 12: Clustered Repeats and Regulatory Sites

The known case; Finding the operators. Directly Vs inversion

of.

Page 13: Clustered Repeats and Regulatory Sites

Phage Lambda map;

Page 14: Clustered Repeats and Regulatory Sites

Thoughts;- My concern/focus;

Would I find some sort of generality between operators of different phages?

Page 15: Clustered Repeats and Regulatory Sites

My experiment; Twenty one random phages of different phage families. Eight of them don't have repressors. (eliminated) Three of the 13 phages left didn't display a map because of linear

amplicon. Ten phages out of 21 went through all the steps of the

method/tool successfully and gave me back out come that I can work with.

Three out of 10 have similar results to phage Lambda.

Page 16: Clustered Repeats and Regulatory Sites

Outcome analysis;- Similar to phage lambda:

-Bacillus-phage-1

Page 17: Clustered Repeats and Regulatory Sites

Phage Bacillus-phage-1 map;

Page 18: Clustered Repeats and Regulatory Sites

Outcome analysis;-Similar to phage Lambda:

-Listeria-phage-A006

Page 19: Clustered Repeats and Regulatory Sites

Phage Listeria-phage-A006 map;

Page 20: Clustered Repeats and Regulatory Sites

Outcome analysis;-Similar to phage lambda:

-Lactobacillus-johnsonii-prophage-Lj928

Page 21: Clustered Repeats and Regulatory Sites

Phage Lactobacillus-johnsonii-prophage-Lj928 map;

Page 22: Clustered Repeats and Regulatory Sites

First conclusion;- Out of 21 or 10 phages, only 3 phages are similar to phage

Lambda.- Less than 50%.- No Generality. - Appropriate conclusion;- Phage Lambda, Bacillus-phage-1, Listeria-phage-A006 , and Lactobacillus-johnsonii-prophage-Lj928 have a similarity/generality between their operators that the repressors bind to.

Page 23: Clustered Repeats and Regulatory Sites

Inspiration; - Dead end.

- The articles !!!! - Extend my research.

- look for something interesting.

Page 24: Clustered Repeats and Regulatory Sites

First interesting case;- In Burkholderia-phage-Bcep1.- Six similar sequence in one intergenic region- another 6 similar sequences in another intergenic region.- Palindromic sequences.- 6 or 3 sequences ?- Bacillus-phage-1 is similar to Burkholderia-phage-Bcep1 somehow.

Page 25: Clustered Repeats and Regulatory Sites

First interesting case;

Page 26: Clustered Repeats and Regulatory Sites

Phage Burkholderia-phage-Bcep1 map;

Page 27: Clustered Repeats and Regulatory Sites

Second interesting case;• - In phage Clostridium-phage-39-O.• - Eight nucleotides sequence (TTACTACA) repeated 10 times in one

intergenic sequence.• - Again the same sequence repeated 8 times in another intergenic

sequence in another place on the phage.

Page 28: Clustered Repeats and Regulatory Sites

Second interesting case;

Page 29: Clustered Repeats and Regulatory Sites

Phage Clostridium-phage-39-O map;

Page 30: Clustered Repeats and Regulatory Sites

Conclusion; - Goals;

- Pick one interesting case.

- Research it.

- Make sense of it.

Page 31: Clustered Repeats and Regulatory Sites

A. Comparison of Pseudomonas putida and Azotobacter REP sequencesDonald Ta

Page 32: Clustered Repeats and Regulatory Sites

REPs• Repetitive Extragenic Palindromic Sequences

Found mainly in abundance in Enterobacteriaecae

• Can be anywhere around 20 to 40 nt long

• Clustered into structures called BIMEs (bacterial interspersed mosaic element) as two inverted tandem repeats separated by a short linker of variable length

Page 33: Clustered Repeats and Regulatory Sites

What do REPs do?• Regulate Gene Expression

• Structuring DNA

• Specific target sites for bacterial insertion sequences

• Possibly more that are undiscovered

Page 34: Clustered Repeats and Regulatory Sites

Previous Study• I. Aranda-Olmedo 2002 used BLAST (Basic local

alignment search tool) to find regions of local similarity between sequences downloaded from the National Center for Biotechnology Information (NCBI)

• Used database with all contigs of Pseudomonas putida already available in The Institute for Genome Research

• Developed their own program to screen all of the strains against the 35 nt sequence 5’-CCGGCCTCTTCGCGGGTAAGCCCGCTCCTACAGGG-3’

Page 35: Clustered Repeats and Regulatory Sites

Results of that Study

Page 36: Clustered Repeats and Regulatory Sites

Implications from that study

They suggest that the 35 bp element they found is species specific in P. putida

First time that REP sequences have been described and characterized in a group of non-enterobacteriaceae

Page 37: Clustered Repeats and Regulatory Sites

What am I doing?• Comparing REP sequence element of Pseudomonas

putida KT2440 with Azotobacter vanlandii Why? Order Pseudomonadales

• Used the REP element that is most common among Pseudomonas species “GCGGGnnnnCCCGC”

Page 38: Clustered Repeats and Regulatory Sites

Methods• Used built-in functions of BioBike to scan a

sequence for possibly loose matches of a pattern

• “****GCGGG****CCCGC****” sequence iterated over the sequence of the organism of interest and then whenever there was a match it was displayed on the output

• “*” means an unspecified amino acid

Page 39: Clustered Repeats and Regulatory Sites

Findings• 52 sequence hits in Azotobacter vanlandii that

appear to have the same conserved region found in Pseudomonas putida

• The species share similar REP elements with the same conserved central palindromic region

• “GCGGG****CCCGC”

Page 40: Clustered Repeats and Regulatory Sites

Output

Page 41: Clustered Repeats and Regulatory Sites

Significance• REP sequences mainly found abundantly in

Enterobacteriacaea

• Study by Bao Ton-Hoang 2012 suggested that transposases could’ve been responsible for the proliferation of REP sequences in the genomes of bacteria in Enterobacteriacaea

• Possibly suggest a similar origin of REP sequences/elements for Pseudomonas and Enterobacteriacaea?

Page 42: Clustered Repeats and Regulatory Sites

Problems?• Found 2 hits in E. Coli K-12 that had the REP

element Maybe suggests similar origin?

• Could be just a fluke/just by chance that these two organisms share the same REP element in abundance Past Study found 804 REP sequences with that REP

element in Pseudomonas putida I found 52 in Azotobacter vanlandii

Page 43: Clustered Repeats and Regulatory Sites

Possible plans of the future/near future?• Compare with other bacteria in the order of

Pseudomondas to see if I get similar results

• Possibly try to find a link to how REP sequences started proliferating in bacteria outside of Enterobacteriacae

Page 44: Clustered Repeats and Regulatory Sites

Positional Preference of Rho-Independent Transcriptional Terminators in E. ColiAnn Vo

Page 45: Clustered Repeats and Regulatory Sites

Transcriptional Terminators• Rho-independent• Specific activities poorly understood• Occurs in ssDNA and RNA• Unique characteristics:

T-Tract: 12-15 nt GC-rich stem: 4-18 nt

Page 46: Clustered Repeats and Regulatory Sites

Transcriptional Terminators• Available algorithms:

RNAMotif TransTermHP ARNold

• About 317 natural terminators found in E. Coli• Lai et al. (2013) found a positional preference

between other regulatory sequences

Do transcriptional terminators have a positional preference relative to the end of the gene?

Page 47: Clustered Repeats and Regulatory Sites

ARNold• Erpin

Scores input sequences Compares against 1,200 known terminators from

Bacillus subtilitis and Escherichia coli

• RNAMotif Used descriptors to find possible terminators Scores free energy of hairpin formation

Page 48: Clustered Repeats and Regulatory Sites

Matching Sequences• BioBIKE/PhAnToMe

Extracted the 50 nucleotides following every gene

• Python Compared sequences to terminators Calculated distance to terminator

ARNold 3248 possible terminatorsBioBIKE 5341 downstream sequencesPython 126 terminators

CAGGACGGTTTACCGGGGAGCCATAAACGGCTCCCTTTTCATTGTTATCA ACGGTTTACCGGGGAGCCATAAACGGCTCCCTTTTCATTGTTA

downstream sequenceterminator

Page 49: Clustered Repeats and Regulatory Sites

3 4 5 6 7 8 9 10 11 12 13 14 15 160

5

10

15

20

25

Count of Terminator Distance by Length

Terminators Distance (nt)

Num

ber o

f Ter

min

ator

s

Page 50: Clustered Repeats and Regulatory Sites

3 4 5 6 7 8 9 10 11 12 13 14 15 160

5

10

15

20

25

Count of Terminator Distance by Length

Terminators Distance (nt)

Num

ber o

f Ter

min

ator

s

Page 51: Clustered Repeats and Regulatory Sites

0 500000 1000000 1500000 2000000 2500000 3000000 3500000 4000000 45000000

2

4

6

8

10

12

14

16

18

20

Terminator Distance Relative to Genomic Position

Genome Length (nt)

Term

inat

or D

istan

ce (n

t)

Page 52: Clustered Repeats and Regulatory Sites

Conclusion• Appear to exhibit some degree of positional

preference• Reasons remain unclear

• Further studies: Length of terminator Function of operons

Page 53: Clustered Repeats and Regulatory Sites

References• Chen, Ying-Ja et al. “Characterization of 582 Natural and Synthetic Terminators and

Quantification of Their Design Constraints.” Nature methods 10.7 (2013): 659–64. Web. 20 Mar. 2014.

• Ermolaeva, M D et al. “Prediction of Transcription Terminators in Bacterial Genomes.” Journal of molecular biology 301.1 (2000): 27–33. Web. 4 Apr. 2014.

• Kingsford, Carleton L, Kunmi Ayanbule, and Steven L Salzberg. “Rapid, Accurate, Computational Discovery of Rho-Independent Transcription Terminators Illuminates Their Relationship to DNA Uptake.” Genome biology 8.2 (2007): R22. Web. 17 Apr. 2014.

• Lai, Fu-Jou et al. “Identifying Functional Transcription Factor Binding Sites in Yeast by Considering Their Positional Preference in the Promoters.” PloS one 8.12 (2013): e83791. Web. 10 Apr. 2014.

• Lau, Lester F et al. “A Potential Stem-Oop Structure and the Sequence CAAUCAA in the Transcript Are Insufficient to Signal Q-Dependent Transcription Termination at XtR1.” 12.2 (1984): 1287–1299. Print.

• Macke, T J et al. “RNAMotif, an RNA Secondary Structure Definition and Search Algorithm.” Nucleic Acids Research 29.22 (2001): 4724–35.

• Mooney, Rachel Anne, and Robert Landick. “Building a Better Stop Sign: Understanding the Signals That Terminate Transcription.” Nature Methods 10.7 (2013): 618–619. Web. 21 Mar. 2014.

• Naville, Magali et al. “ARNold: A Web Tool for the Prediction of Rho-Independent Transcription Terminators.” RNA Biology 8.1 (2011): 11–13. Web. 8 Apr. 2014.

Page 54: Clustered Repeats and Regulatory Sites

Resemblances and differences between promoter sequences in E. coli and S. entericaLiam Lewis

Page 55: Clustered Repeats and Regulatory Sites

Inspiration• Novel sequence-based method for identifying

transcription factor binding sites in prokaryotic genomes

• Results found promoters with high probability

Page 56: Clustered Repeats and Regulatory Sites

Background of Promoter sequences• Regulatory Elements • -35 and -10 consensus sequence• Sigma factor + RNA Polymerase

Page 57: Clustered Repeats and Regulatory Sites

Program used to identify promoters• PePPER• Uses PSSMs and Hidden markov Models• Algorithm is universal for prokaryotes

Page 58: Clustered Repeats and Regulatory Sites

Biobike implementation• Biobike to compare both outputs from PePPER.

Page 59: Clustered Repeats and Regulatory Sites

What’s next?• Comparison of results• Biobike algorithm to accurately predict promoters

Page 60: Clustered Repeats and Regulatory Sites

Comparison of Repressor-Operator Sequences in Lambda and other Temperate PhagesShahroze Abbas

Page 61: Clustered Repeats and Regulatory Sites

Repressor Sequences in Lambda• Two possible life cycles, dependent upon either

Lambda repressor or Cro repressor. • cI repressor maintains lysogenic state• Cro repressor initiates a switch to the lytic state• Significance of intergenic sequences and

neighboring genes to determine ‘hypothetical proteins’ in other organisms similar

Page 62: Clustered Repeats and Regulatory Sites

Comparison of Repressor Sequence• Lambda repressor sequence tested for occurrence

in other phages• Enterobacteria phage Sfl• Enterobacteria phage HK244• Enterobacteria phage HK542• Enterobacteria phage HK544• Enterobacteria phage HK 106• Enterobacteria phage CL707

Page 63: Clustered Repeats and Regulatory Sites

Still to come…• Analysis of operator sequences• Comparison of cro repressor in other phages• Trend or pattern to determine function of

neighboring proteins in other phages• Trend or pattern in sequences between phages