identification and evolution of transcription factors in stramenopiles · title: identification and...

1
High diversity among Stramenopiles Organism Size Genes Proteins Source Phytophthora infestans 237 Mb 18179 18140 BROAD Phytophthora ramorum 65 Mb 15743 16066 DOEJGI Phytophthora capsici 64 Mb 19805 15919 DOEJGI Phytophtora sojae 86 Mb 19027 19276 DOEJGI Saprolegnia. parasi7ca 53.09 Mb 20113 20088 BROAD Hyaloperonospora parasi7ca 82.05 Mb 14567 14565 BROAD Pythium ul7mum 44.91 Mb 15291 12614 BROAD Thalassiosira pseudonana 32 Mb 11397 11318 DOEJGI Phaeodactylum tricornutum 28 Mb 10489 10389 DOEJGI Rules System Search Hmmscan Proteomes 58 Families Pfam V. 25 In House Models Assign Family Outlook Deep Analysis TFs/TRs families Phylogenetic independent contrasts (PIC) qPCR Inference of gene regulatory networks Identification and Evolution of Transcription Factors in Stramenopiles Francisco Buitrago-Florez, Silvia Restrepo, Diego Riaño-Pachón E-Mail: [email protected] URL: http://bce.uniandes.edu.co/ Identification of these mechanisms? Mechanisms of diversity? Evolutionary Relationships Among Stramenopiles: A Phylogenomic Approach The Relative Importance of TFs and TRs greatly varies among species Species included in this study Phylogenetic Signal DMRP greatly acknowledges funding from Faculty of Sciences of the Universidad de los Andes through the program Funding Support for Assistant Professors. Group of Computational and Evolutionary Biology of the Universidad de los Andes - BCE Laboratory of Mycology and Phytopathology of the Universidad de los Andes – LAMFU Javier Felipe Tabima M.Sc. for his help on Phylogenetic Analysis. Acknowledegements A rules system for classification of TFs and TRs Hierarchical Clustering of Species based on TFs and TRs family size Transcription regulation model A) General transcription factors (GTFs) recruit RNA polymerase II in order to start the transcription of the gene, generally, this type of transcription factors are common among genes transcribed by RNA-Pol II, B) Regulatory transcriptions factors (TFs) interact with the gene promoter to regulate gene transcription, this transcription factors are usually specific for each gene and have been found to be linked to the generation of diversity in organisms (Komberg, 2007) C) Other transcription regulators (TRs) as the Mediator complex participate in the process of transcription, but generally do not interact with DNA. A) B) C) Rules for the classification of TFs and TRs depicted as a bipartite graph. Blue squares represent families, TFs are indicated in solid color, TRs are indicated by shaded squares. Yellow circles represent protein domains from the PFAM database (htttp:// pfam.sanger.ac.uk/), orange circles represent domains generated in-house. In house domain models were generated based on seed alignments and profile HMMs were built using HMMER 3.0 (http://hmmer.janelia.org/) Columns represent selected families of TFs and TRs, rows represent the species included in this study and bars represent the relative importance: the proportion of a family over the total number of regulators. This analysis will highlight the importance of different TFs and TRs families. We can see that some families have greater representativity in some organisms. In some species, however, there is a homogeneity among families. In addition of Stramenopiles, we also identified and classified families in other organisms from different lineages. We can see that species are clustering according to their evolutionary relationships, as in Oomycetes, Stramenopiles, Protists, Plants, Metazoan and Fungi. The only exception is O. tauri, which belongs to the lineage of plants and is grouped within the Oomycetes, which are heterotrophic, this might arise due to its small genome size, a feature shared with parasitic organisms. First, we recovered orthologous groups using Ortho-MCL (http:// www.orthomcl.org/), finding 781 genes clusters. The phylogenetic reconstruction was carried out with Maximum Likelihood analysis using RAxML (Stamatakis, 2008) with 1000 bootstraps. In the tree we can see a clear separation between autotrophic organisms (Green), and the heterotrophic organisms (Blue). A contrast between the species phylogenetic tree and the family size hierarchical clustering shows that both trees have the same topology, implying that TFs and TRs families-sizes carry a strong phylogenetic signal. We use all the available genome sequences of Stramenopiles, there are large differences on genome size, number of genes and number of predicted proteins in this group. Phytophthora infestans Phytophthora sojae Phytophthora ramorum Hyaloperonospora parasitica Pythium ultimum Saprolegnia parasitica Phytophthora capsici Phaeodactylum tricornutum Thalassiosira pseudonana Dictyostelium discoideum Arabidopsis thaliana Physcomitrella patens Drosophila melanogaster Nematostella vectensis Schistosoma mansoni Trichoplax adhaerens Caenorhabditis elegans Neurospora crassa Saccharomyces cerevisiae Aspergillus niger Puccinia graminis Oomycetes Stramenopiles Protist Plantae Metazoa Fungi Ostreococcus tauri Stramenopiles is a Protist group that range from large multicellular seaweeds to tiny unicellular species; their members are present in freshwater, marine and terrestrial habitats, and embrace many ecologically important organisms (e.g., diatoms, brown algae) and many species of economically importance such as Phytophthora infestans, an important pathogen of potato, or Saprolegina parasitica, a major fish pathogen. 781 Orthologous Genes RAxML – 1000 Bootstraps Phytophthora sojae Phytophthora infestans Phytophthora capsici Phytophthora ramorum Hyaloperonospora parasitica Pythium ultimum Saprolegnia parasitica Phaeodactylum tricornutum Thalassiosira pseudonana Kornberg, R. 2007. The molecular basis of eukaryotic transcription. PNAS. 104:12955-12961. Lang, D. et al., 2010. Genome-Wide phylogenetic comparative analysis of plant transcriptional regulation: A timeline of loss, gain, expansion, and correlation with complexity. Genome Biology and Evolution. 2:488-503. Pérez-Rodriguez, P. et al., 2010. PlnTFDB: updated content and new features of the plant transcription factor database. Nucleic Acids Research. 38: Database issue doi:10.1093/nar/gkp805. Riaño-Pachón, D. et al., 2007. PlnTFDB: an integrative plant transcription factor database. BCM Bioinformatics. 8:42. Stamatakis, A. et al., 2005. RAxML-OMP: An efficient program for phylogenetic inference on SMPs. In proceedings of 8th international conference on parallel computing technologies, 36:288-302. Bibliography Phytophthora capsici Phytophthora ramorum Saprolegnia parasitica Phaeodactylum tricornutum Pythium ultimum Phytophthora infestans Phytophthora sojae Thalassiosira pseudonana Hyaloperonospora parasitica 100 Autotrophic Heterotrophic Nature Precedings : doi:10.1038/npre.2011.6672.1 : Posted 6 Dec 2011

Upload: ngolien

Post on 03-Dec-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Identification and Evolution of Transcription Factors in Stramenopiles · Title: Identification and Evolution of Transcription Factors in Stramenopiles Author: Francisco Buitrago,

High diversity among Stramenopiles

Organism   Size   Genes   Proteins    

Source  

Phytophthora  infestans   237  Mb   18179   18140   BROAD  

Phytophthora  ramorum   65  Mb   15743   16066   DOE-­‐JGI  

Phytophthora  capsici   64  Mb    

19805   15919   DOE-­‐JGI    

Phytophtora  sojae   86  Mb   19027   19276   DOE-­‐JGI  

Saprolegnia.  parasi7ca   53.09  Mb   20113   20088   BROAD  

Hyaloperonospora  parasi7ca   82.05  Mb   14567   14565   BROAD  

Pythium  ul7mum   44.91  Mb   15291   12614   BROAD  

Thalassiosira  pseudonana   32  Mb   11397   11318   DOE-­‐JGI  

Phaeodactylum  tricornutum   28  Mb   10489   10389   DOE-­‐JGI  

Rules  System  

Search  Hmmscan!

Proteomes  

58  Families  

Pfam  V.  25   In  House  Models  

Assign  Family  

!

"!#!$!

%!

&!

'!

"( )*+,-./.0!10.!+./.!)(/2345!63.!.0!372!8927/2:!#( ;/42+.7*859.!$( <4*/50/!%( <927/0!&( =./2>*2!'( ?37@5!

=./*A*B!C./D..7!E4*380!

F7/.4G29*B!H*.I5-5.7/.!A.!-*44.92-5J7!A.!<.240*7(!!

Outlook Deep Analysis TFs/TRs families

Phylogenetic independent contrasts (PIC) qPCR

Inference of gene regulatory networks

Identification and Evolution of Transcription Factors in Stramenopiles Francisco Buitrago-Florez, Silvia Restrepo, Diego Riaño-Pachón

E-Mail: [email protected] URL: http://bce.uniandes.edu.co/

Identification of these mechanisms?

Mechanisms of diversity?

Evolutionary Relationships Among Stramenopiles: A Phylogenomic Approach

The Relative Importance of TFs and TRs greatly varies among species

Species included in this study

Phylogenetic Signal

DMRP greatly acknowledges funding from Faculty of Sciences of the Universidad de los Andes through the program Funding Support for Assistant Professors. Group of Computational and Evolutionary Biology of the Universidad de los Andes - BCE Laboratory of Mycology and Phytopathology of the Universidad de los Andes – LAMFU Javier Felipe Tabima M.Sc. for his help on Phylogenetic Analysis.

Acknowledegements

A rules system for classification of TFs and TRs

Hierarchical Clustering of Species based on TFs and TRs family size

Transcription regulation model A) General transcription factors (GTFs) recruit RNA polymerase II in order to start the transcription of the gene, generally, this type of transcription factors are common among genes transcribed by RNA-Pol II, B) Regulatory transcriptions factors (TFs) interact with the gene promoter to regulate gene transcription, this transcription factors are usually specific for each gene and have been found to be linked to the generation of diversity in organisms (Komberg, 2007) C) Other transcription regulators (TRs) as the Mediator complex participate in the process of transcription, but generally do not interact with DNA.

A)

B)

C)

Rules for the classification of TFs and TRs depicted as a bipartite graph. Blue squares represent families, TFs are indicated in solid color, TRs are indicated by shaded squares. Yellow circles represent protein domains from the PFAM database (htttp://pfam.sanger.ac.uk/), orange circles represent domains generated in-house. In house domain models were generated based on seed alignments and profile HMMs were built using HMMER 3.0 (http://hmmer.janelia.org/)

Columns represent selected families of TFs and TRs, rows represent the species included in this study and bars represent the relative importance: the proportion of a family over the total number of regulators. This analysis will highlight the importance of different TFs and TRs families. We can see that some families have greater representativity in some organisms. In some species, however, there is a homogeneity among families.

In addition of Stramenopiles, we also identified and classified families in other organisms from different lineages. We can see that species are clustering according to their evolutionary relationships, as in Oomycetes, Stramenopiles, Protists, Plants, Metazoan and Fungi. The only exception is O. tauri, which belongs to the lineage of plants and is grouped within the Oomycetes, which are heterotrophic, this might arise due to its small genome size, a feature shared with parasitic organisms.

First, we recovered orthologous groups using Ortho-MCL (http://www.orthomcl.org/), finding 781 genes clusters. The phylogenetic reconstruction was carried out with Maximum Likelihood analysis using RAxML (Stamatakis, 2008) with 1000 bootstraps. In the tree we can see a clear separation between autotrophic organisms (Green), and the heterotrophic organisms (Blue).

A contrast between the species phylogenetic tree and the family size hierarchical clustering shows that both trees have the same topology, implying that TFs and TRs families-sizes carry a strong phylogenetic signal.

We use all the available genome sequences of Stramenopiles, there are large differences on genome size, number of genes and number of predicted proteins in this group.

Phytophthora infestans Phytophthora sojae

Phytophthora ramorum Hyaloperonospora parasitica

Pythium ultimum Saprolegnia parasitica

Phytophthora capsici Phaeodactylum tricornutum Thalassiosira  pseudonana

 Dictyostelium discoideum Arabidopsis thaliana

Physcomitrella patens Drosophila melanogaster

Nematostella vectensis Schistosoma mansoni Trichoplax adhaerens

Caenorhabditis elegans Neurospora crassa

Saccharomyces cerevisiae Aspergillus niger

Puccinia graminis

Oom

ycet

es

Stra

men

opile

s P

rotis

t

Plantae

Metazoa

Fungi

Ostreococcus tauri

Stramenopiles is a Protist group that range from large multicellular seaweeds to tiny unicellular species; their members are present in freshwater, marine and terrestrial habitats, and embrace many ecologically important organisms (e.g., diatoms, brown algae) and many species of economically importance such as Phytophthora infestans, an important pathogen of potato, or Saprolegina parasitica, a major fish pathogen.

781 Orthologous Genes RAxML – 1000 Bootstraps

Phytophthora sojae

Phytophthora infestans

Phytophthora capsici

Phytophthora ramorum

Hyaloperonospora parasitica

Pythium ultimum

Saprolegnia parasitica

Phaeodactylum tricornutum

Thalassiosira  pseudonana  

Kornberg, R. 2007. The molecular basis of eukaryotic transcription. PNAS. 104:12955-12961. Lang, D. et al., 2010. Genome-Wide phylogenetic comparative analysis of plant transcriptional regulation: A timeline of loss, gain, expansion, and correlation with complexity. Genome Biology and Evolution. 2:488-503. Pérez-Rodriguez, P. et al., 2010. PlnTFDB: updated content and new features of the plant transcription factor database. Nucleic Acids Research. 38: Database issue doi:10.1093/nar/gkp805. Riaño-Pachón, D. et al., 2007. PlnTFDB: an integrative plant transcription factor database. BCM Bioinformatics. 8:42. Stamatakis, A. et al., 2005. RAxML-OMP: An efficient program for phylogenetic inference on SMPs. In proceedings of 8th international conference on parallel computing technologies, 36:288-302.

Bibliography

Phytophthora capsici

Phytophthora ramorum

Saprolegnia parasitica

Phaeodactylum tricornutum

Pythium ultimum

Phytophthora infestans

Phytophthora sojae

Thalassiosira  pseudonana  

Hyaloperonospora parasitica

100

Autotrophic

Heterotrophic

Nat

ure

Pre

cedi

ngs

: doi

:10.

1038

/npr

e.20

11.6

672.

1 : P

oste

d 6

Dec

201

1