chimeric plastid proteome in the florida “red tide...

43
and Evolution. All rights reserved. For permissions, please e-mail: [email protected] The Author 2006. Published by Oxford University Press on behalf of the Society for Molecular Biology 1 Research Article: Chimeric Plastid Proteome in the Florida “Red Tide” Dinoflagellate Karenia brevis Tetyana Nosenko, Kristy L. Lidie 1 , Frances M. Van Dolah 1 , Erika Lindquist 2 , Jan-Fang Cheng 3 , US Department of Energy - Joint Genome Institute 4 , and Debashish Bhattacharya* University of Iowa, Department of Biological Sciences and the Roy J. Carver Center for Comparative Genomics, 446 Biology Building, Iowa City, Iowa 52242, USA 1 Marine Biotoxins Program, NOAA National Ocean Service, Center for Coastal Environmental Health and Biomolecular Research, Charleston, South Carolina 2 DOE Joint Genome Institute and Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA 3 Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, MS 84-171, Berkeley, CA 94720, USA 4 Corporate author, see Supplementary Material *Corresponding author: University of Iowa, Department of Biological Sciences and the Roy J. Carver Center for Comparative Genomics, 446 Biology Building, Iowa City, Iowa 52242-1324, USA, Telephone: (319) 335-1977, Fax: (319) 335-1069, Email: [email protected] Key words: endosymbiosis, endosymbiotic gene transfer, Karenia brevis, proteome, red tide Running head: Karenia plastid proteome MBE Advance Access published July 28, 2006

Upload: others

Post on 26-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

and Evolution. All rights reserved. For permissions, please e-mail: [email protected] The Author 2006. Published by Oxford University Press on behalf of the Society for Molecular Biology

1

Research Article:

Chimeric Plastid Proteome in the Florida “Red Tide” Dinoflagellate

Karenia brevis

Tetyana Nosenko, Kristy L. Lidie1, Frances M. Van Dolah1, Erika Lindquist2, Jan-Fang Cheng3, US

Department of Energy - Joint Genome Institute4, and Debashish Bhattacharya*

University of Iowa, Department of Biological Sciences and the Roy J. Carver Center for Comparative

Genomics, 446 Biology Building, Iowa City, Iowa 52242, USA

1Marine Biotoxins Program, NOAA National Ocean Service, Center for Coastal Environmental Health

and Biomolecular Research, Charleston, South Carolina

2DOE Joint Genome Institute and Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut

Creek, CA 94598, USA

3Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, MS 84-171,

Berkeley, CA 94720, USA

4Corporate author, see Supplementary Material

*Corresponding author: University of Iowa, Department of Biological Sciences and the Roy J. Carver

Center for Comparative Genomics, 446 Biology Building, Iowa City, Iowa 52242-1324, USA,

Telephone: (319) 335-1977, Fax: (319) 335-1069, Email: [email protected]

Key words: endosymbiosis, endosymbiotic gene transfer, Karenia brevis, proteome, red tide

Running head: Karenia plastid proteome

MBE Advance Access published July 28, 2006

Page 2: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

2

Abstract

Current understanding of the plastid proteome comes almost exclusively from studies of plants and red

algae. The proteome in these taxa has a relatively simple origin via integration of proteins from a single

cyanobacterial primary endosymbiont and the host. However, the most successful algae in marine

environments are the chlorophyll c-containing chromalveolates such as diatoms and dinoflagellates that

contain a plastid of red algal origin derived via secondary or tertiary endosymbiosis. Virtually nothing is

known about the plastid proteome in these taxa. We analyzed expressed sequence tag data from the toxic

“Florida red tide” dinoflagelate Karenia brevis that has undergone a tertiary plastid endosymbiosis.

Comparative analyses identified 30 nuclear encoded plastid-targeted proteins in this chromalveolate that

originated via endosymbiotic or horizontal gene transfer from multiple different sources. We identify a

fundamental divide between plant/red algal and chromalveolate plastid proteomes that reflects a history of

mixotrophy in the latter group resulting in a highly chimeric proteome. Loss of phagocytosis in the “red”

and “green” clades effectively froze their proteomes, whereas chromalveolate lineages retain the ability to

engulf prey allowing them to continually recruit new, potentially adaptive genes through subsequent

endosymbioses and horizontal gene transfers. One of these genes is an electron transfer protein

(plastocyanin) of green algal origin in K. brevis that likely allows this species to thrive under conditions

of iron depletion.

Page 3: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

3

Introduction

Oxygenic photosynthesis, the ability to harvest solar energy to fix carbon dioxide and produce organic

material evolved in cyanobacteria 2.5-3 billion years ago (Schopf 1993; Hedges et al. 2001). The advent

of photosynthesis was arguably the most important event in the evolution of life. It changed the redox

balance permitting the development and further metabolic and structural diversification of eukaryotic life

forms (Blankenship 2001). Eukaryotes acquired the ability to photosynthesize about 1.5 billion years ago

(Hedges et al. 2004; Yoon et al. 2004; Hackett et al. 2006) through the capture and retention of a

cyanobacterium in the “host” cytoplasm. This primary endosymbiosis gave rise to the plastid (e.g.,

chloroplast) in the common ancestor of three extant primary lineages: green algae and land plants

(Viridiplantae), red algae (Rhodophyta), and Glaucophyta (Bhattacharya and Medlin 1995; Cavalier-

Smith 2004), together known as the Plantae and more recently, the Archaeplastida (Adl et al. 2005).

Thereafter, seven other groups of eukaryotes gained their plastid through secondary endosymbiosis,

whereby a non-photosynthetic protist engulfed an existing alga and retained its plastid (Bhattacharya,

Yoon, and Hackett 2004). Separate secondary endosymbioses with green algae are believed to have given

rise to the plastid in euglenoids and chlorarachniophytes. Red algae contributed the plastid to a variety of

protist lineages putatively united in the supergroup Chromalveolata that includes alveolates

(apicomplexans, ciliates, and dinoflagellates) and chromists (cryptophytes, haptophytes, and

stramenopiles) (Cavalier-Smith 1999). The number of red algal secondary endosymbiotic events that gave

rise to the chromalveolate plastid is under debate (Bhattacharya, Yoon, and Hackett 2004) although recent

data suggest a single origin (Harper and Keeling 2004; Yoon et al. 2005; Li et al. 2006; Weber, Linka,

and Bhattacharya 2006). The ancestral red algal plastid was however not maintained in all

chromalveolates with its loss in ciliates, its diminution (apicoplast) to a remnant genome in the obligate

parasites, apicomplexans, and its replacement with a plastid from a different alga on at least five separate

occasions in dinoflagellates (Saldarriaga 2001; Ishida and Green 2002; Bhattacharya, Yoon, and Hackett

2004; Hackett et al. 2006). In Lepidodinium viride the broadly distributed peridinin containing

dinoflagellate plastid of red algal origin was replaced by one of green-algal origin. In other species the

Page 4: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

4

ancestral plastid was replaced with one from a cryptophyte, stramenopile, or haptophyte through tertiary

endosymbiosis (i.e., capture of a plastid of secondary endosymbiotic origin (Hackett et al. 2004a; Yoon et

al. 2005).

The dinoflagellates that acquired their plastid from a haptophyte alga are known as fucoxanthin

dinoflagellates because of the presence of this light harvesting protein in the endosymbiont. Given this

complex series of events, and the potential for large-scale transfer of endosymbiont genes from all of its

genomes to the host nucleus (endosymbiotic gene transfer, EGT) (Martin et al. 2002), the complement of

plastid proteins (proteome) in fucoxanthin dinoflagellates could quite literally be composed of a

menagerie of proteins of red and haptophyte (and potentially other) origins. Much of what we know about

plastid proteomes comes from analysis of the green and red lineages that have a significantly simpler

evolutionary history with a single cyanobacterial endosymbiosis and no known cases of phagotrophy.

Dinoflagellates on the other hand have undergone plastid replacements and are important grazers in the

marine environment. The capacity to engulf different algae and bacteria as prey provides the opportunity

to accumulate foreign genes in the dinoflagellate nucleus. These horizontally transferred genes could

potentially add to or replace genes involved in plastid function (Archibald et al. 2003; Hackett et al.

2004b; Li et al. 2006).

To determine the origins of the plastid proteome in a tertiary endosymbiosis, we reconstructed the

evolutionary history of nuclear encoded plastid targeted proteins in the fucoxanthin “Florida red tide”

dinoflagellate Karenia brevis. The plastid-targeted proteins were identified in two extensive EST libraries

from this species. The phylogenetic approach allowed us to reconstruct the chain of endosymbiotic gene

transfer events that resulted in the complex plastid proteome in K. brevis. We also demonstrate a

significant contribution of green algal-derived genes to the chromalveolate plastid proteome. Our data

underline the great plasticity of the plastid biochemical machinery and provide insights into the molecular

mechanisms of plastid establishment.

Materials and Methods

Page 5: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

5

Generating the K. brevis ESTs

Two cDNA libraries were generated from clonal K. brevis Wilson cells cultured under 25 ± 1°C on a

16:8-hour light-dark cycle. The Dark library was constructed using log phase cells grown on seawater

enriched with f/2 medium and during the dark phase of the diel cycle. The cDNA clones were sequenced

from the 5’ end. A total of 7,001 ESTs from the Dark library are available from the NCBI expressed

sequence tag database (ESTdb and at www.marinegenomics.org), which comprise 5,280 non-redundant

gene clusters. For complete details about Dark library generation, sequencing, and processing see Lidie et

al. (2005). The non-normalized Stress library was constructed using K. brevis Wilson cells grown under

five different culture conditions: 1) nitrate-depletion, 2) phosphate-depletion, 3) in log phase under replete

conditions, harvested during the light phase 4) in the presence of oxidative metals, 5) undergoing heat

stress. Total RNA from each treatment was purified with Trireagent (Molecular Research Center,

Cincinnati, OH) and pooled to construct a cDNA library in the Lambda Zap II vector system (Strategene,

La Jolla, CA) as previously described. A total of 22,272 ESTs from the Stress library and an additional

9,984 ESTs from the Dark library were sequenced from both 5’- and 3’-termini. This work was done by

JGI.

The EST data were processed using Trace2dbest V2.1 (Parkinson and Blaxter 2004). Low

complexity regions of the DNA sequences were masked using lower case and RepeatMasker V3.15

(http://www.repeatmasker.org/). Clustering and assembly of the EST data was done with TGICL

(http://www.tigr.org/tdb/tgi/software/) (Pertea et al. 2003) and resulted in 9,784 non-redundant gene

clusters.

Identification of Plastid Proteins and Phylogenetic Analysis

Plastid-targeting proteins were identified using sequence similarity search (BLAST) against the GenBank

non-redundant database (nr). The primary structure of identified sequences has been analyzed to confirm

nuclear localization of the gene (presence of poly-A tail, 3’ untranslated region [UTR] and plastid

localization of its product [i.e., presence of a 5’ plastid targeting leader peptide]). The 5’ leader peptides

Page 6: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

6

were analyzed using the following protein prediction programs: SignalP

(http://www.cbs.dtu.dk/services/SignalP/), TargetP (plant version)

(http://www.cbs.dtu.dk/services/TargetP), PSORT (http://www.psort.org/), and PATS

(http://gecco.org.chemie.uni-frankfurt.de/pats/pats-index.php). Protein precursors in chromalveolates are

considered to be plastid targeted if they contain an N-terminal extension composed of a signal peptide

(SignalP probability ≥ 0.7) and transit peptide. Any amino acid sequence separating the signal peptide and

mature protein is considered a transit peptide. The length of the N-terminal extension predicted by the

computer programs was verified using the protein alignments of the K. brevis sequence with its homologs

from other taxa. Identified K. brevis sequences encoding plastid-targeted proteins have been deposited in

GenBank under accession numbers DQ531572 – DQ531601 that are listed in Table 1.

Protein alignments were constructed using homologous sequences identified using BLAST

searches (e- value cut-off < 10-10) against the GenBank nr and dbEST databases and public protist

databases including C. merolae (Matsuzaki et al. 2004) (http://merolae.biol.s.u-tokyo.ac.jp/), Galdieria

sulphuraria (Weber et al. 2004) (Michigan State University Galdieria Database

http://genomics.msu.edu/galdieria/sequence_data.html), Porphyra yezoensis (Asamizu et al. 2003),

(http://www.kazusa.or.jp/en/plant/porphyra/EST), Chlamydomonas reinhardtii (http://genome.jgi-

psf.org/Chlre3/Chlre3.info.html), Phaeodactylum tricornutum (Scala et al. 2002)

(http://avesthagen.sznbowler.com/), Thalassiosira pseudonana (Armbrust et al. 2004), and unpublished

Cyanophora paradoxa data (ongoing project in our lab). In addition, search by gene/protein name was

used to identify homologous sequences from the Protist EST Program database

(http://amoebidia.bcm.umontreal.ca/public/pepdb/welcome.php).

Amino acid sequences for each protein were aligned using ClustalW and manually refined under BioEdit

(http://www.mbio.ncsu.edu/BioEdit/bioedit.html). The 6-protein concatenated protein alignment was

assembled using BioEdit. For each data set, a phylogeny was reconstructed under maximum likelihood

(ML) using the PHYML V2.4.3 computer program (Guindon and Gascuel 2003; Guindon et al. 2005)

using the WAG + Γ + I evolutionary model and tree optimization. The alpha values for the gamma

Page 7: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

7

distribution were calculated using eight rate categories. To assess the stability of monophyletic groups in

the ML trees, we calculated PHYML bootstrap (100 replicates) support values (Felsenstein 1985). In

addition, we calculated bootstrap values (500 replications) using the neighbor joining (NJ) method with

JTT+ Γ distance matrices (PHYLIPV3.63, http://evolution.genetics.washington.edu/phylip.html). The NJ

analysis was done with randomized taxon addition. We calculated Bayesian posterior probabilities for

nodes in the ML tree using MrBayes V3.0b4 (Huelsenbeck and Ronquist 2001) and the WAG + Γ model.

The Metropolis-coupled Markov chain Monte Carlo from a random starting tree was run for 1,000,000

generations with trees sampled each 1,000 cycles. The initial 20,000 cycles (200 trees) were discarded as

the ‘‘burn in.’’ A consensus tree was made with the remaining 800 phylogenies to determine the posterior

probabilities at the different nodes. Manual inspection of the trees produce by the MCMCMC tree search

suggested convergence by 200,000 generations.

Results

We identified sequences encoding 30 different plastid-targeted proteins from two EST libraries of K.

brevis (see Table 1). The evolutionary origins of 22 of these genes have been resolved. The sequences

encoding K. brevis light harvesting proteins were counted as one protein and were omitted from the

phylogenetic analysis due to the presence of multiple paralogs in different photosynthetic lineages. Our

phylogenetic analyses of the remaining genes show that most of the K. brevis plastid-targeted proteins are

shared by different lineages of chromalveolates. Thirteen of them were derived by the chromalveolate

ancestor from its red algal endosymbiont, six proteins originate from green algae, and three plastid-

targeted proteins have arisen through the duplication of genes encoding cytosolic proteins in the ancestral

chromalveolate (Table 1). After the replacement of the peridinin-containing plastid with the haptophyte-

derived plastid, 12 of these genes were acquired by fucoxanthin dinoflagellates from their tertiary

haptophyte endosymbiont and one gene was derived from the genome of the dinoflagellate ancestor.

Origins of the remaining nine proteins in the tertiary plastid proteome were unresolved. Here we classify

Page 8: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

8

proteins by their origin in the secondary plastid (i.e., red algal-derived) proteome that is typical for most

chromalveolates.

Red Algal Contribution to the K. Brevis Proteome through Secondary and Tertiary

Endosymbioses

The 13 proteins acquired by chromalveolates from their secondary red algal endosymbiont trace their

origins to the primary endosymbiosis: i.e., 11 “cyanobacterial” proteins and two proteins of pro-

mitochondrial origin recruited for a plastid function before the Archaeplastida divergence. We resolved

unambiguously the phylogeny of six of the “cyanobacterial” proteins: ATP synthase gamma subunit

(encoded by atpC), oxygen-evolving-enhancer 1 protein (psbO), cytochrome b6 (petC), photosystem II

12kD extrinsic protein (psbU), phosphoglycerate mutase (PGAM), and beta-ketoacyl-ACP reductase

(fabG) (see Fig. S1 in the Supplementary Material). These genes were transferred from a cyanobacterium

to a red alga (primary endosymbiosis), from the red alga to the chromalveolate ancestor (secondary

endosymbiosis), and finally from a haptophyte alga to fucoxanthin dinoflagellates (tertiary

endosymbiosis). To robustly position K. brevis within a tree of plastid-targeted nuclear proteins, we

generated a 6-protein alignment using these proteins. The resulting tree strongly supports the monophyly

of the K. brevis and haptophyte sequences and the expected monophyly of red algae and chromalveolates

(bootstrap proportions, maximum likelihood, BPml = 100%; neighbor joining, BPnj = 100%; Bayesian

posterior probability, BPP = 1.0 for both nodes; see Fig. 1). The red algal-derived plastid clade includes

representatives of both groups of chromalveolates, the chromists represented by haptophytes and

stramenopiles and the alveolates represented by peridinin dinoflagellates. This tree of nuclear-encoded

proteins provides significant support for the origin of all chromalveolates from a single common ancestor

that all initially shared a red algal secondary endosymbiont (Bhattacharya, Yoon, and Hackett 2004; Li et

al. 2006; Weber, Linka, and Bhattacharya 2006).

The phylogeny of translation elongation factor G (EF-G encoded by fusA) is similar to the tree

shown in Fig. 1, but is a host contribution to the primary plastid proteome. EF-G catalyzes the

Page 9: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

9

translocation step of protein synthesis in prokaryotic ribosomes including plastids and mitochondria

(Breitenberger and Spremulli 1980). Analysis of organellar EF-Gs in photosynthetic algae using

immunological techniques revealed that the two proteins have distinct structures (Breitenberger and

Spremulli 1980). Surprisingly, plastid EF-G, unlike the mitochondrial protein, has a significant structural

similarity to Escherichia coli EF-G. This is believed to be a case of convergent evolution (Breitenberger

and Spremulli 1980). Our analysis provides a phylogenetic perspective on this issue. The phylogenetic

analyses support the origin of plastid fusA from alpha proteobacteria (BPml = 99%, BPnj = 100, BPP =

1.0; Fig. 2). It is believed that a Rickettsia-like proteobacterium gave rise to the mitochondrion in

eukaryotes (Gray 1998). The inclusion of mitochondrial-targeted sequences into the fusA tree shows that

they form a separate clade that is closely related to delta-proteobacteria, spirochaetes, and planctomycetes

(BPml = 100%, BPnj = 100%, BPP = 1.0). Our interpretation of these results is that a fusA sequence

derived by HGT from a non-photosynthetic bacterium was recruited for protein translation in

mitochondria and later replaced the original mitochondrial fusA. This recruitment occurred prior to the

divergence of plants, animals, and fungi (see Fig. 2). Thereafter, the original pro-mitochondrial fusA

retained in the host nuclear genome replaced its homolog from the cyanobacterial endosymbiont and gave

rise to the plastid fusA in the common ancestor of Archaeplastida. Chromalveolates acquired the gene

encoding plastid EF-G from the red-algal secondary endosymbiont. In turn, fucoxanthin dinoflagellates

gained this gene from the haptophyte tertiary endosymbiont (BPml = 94%, BPnj = 95%, BPP = 1.0).

The evolutionary origin in K. brevis of plastid fructose-1,6-bisphosphatase (FBPase) that is a core

enzyme of the Calvin cycle and glycolysis, parallels fusA. Plastid FBPase arose through the duplication of

the primary host cytosolic FBPase that occurred early in eukaryotic evolution. Similarity of the cytosolic

FBPase of eukaryotes to its proteobacterial homolog supports the pro-mitochondrial origin of this enzyme

(Martin and Schnarrenberger 1997). Here we show that plastid FBPase of chromalveolates was acquired

from the red algal secondary endosymbiont (BPml = 68%, BPnj = 69, BPP = 1.0; Fig. 3). The origin of

FBPase in tertiary endosymbiosis remains however unresolved.

Page 10: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

10

Green Algal Contribution to the Tertiary Plastid Proteome

Six K. brevis plastid-targeted proteins originated in chromalveolates from green algae. These proteins

represent components of the primary plastid proteome: four of them are of a cyanobacterial origin and

two are of host origin (Table 1, Fig. 4). These “green genes” will be dealt with separately.

Glutamate 1-semialdehyde 2,1-aminomutase (GSA-AT) is an enzyme of the class-III

pyridoxal-phosphate-dependent aminotransferase family that is involved in chlorophyll and heme

biosynthesis in the plastid stroma in plants and algae (Kannangara and Gough 1978). Phylogenetic

analysis shows this protein to have a cyanobacterial provenance (BPml = 86 %, BPP = 1.0) and has

originated in chromalveolates from a green algal lateral transfer (BPml = 64%, BPnj = 73%, BPP = 1.0,

Fig. 4A). Chromists and alveolates form a monophyletic group (BPml = 81 %, BPnj = 100%, BPP = 1.0)

suggesting that acquisition of the green gene encoding GSA-AT by chromalveolates predates the

divergence of these two lineages. The position of K. brevis relative to haptophytes and peridinin

dinoflagellates is however unresolved.

Plastid ferredoxin NADP(H) reductase (FNR encoded by petH). FNRs are flavoprotenoids that

catalyze the reversible electron transfer between NADP(H) and electron carrier proteins ferredoxin or

flavodoxin (Arakaki, Ceccarelli, and Carrillo 1997). Plants contain two tissue-specific types of plastidic

FNRs. One of these is present only in plastids from photosynthetic leaf tissue and the second isoform is

present in plastids from non-photosynthetic organs such as root, fruits, and petals (Ceccarelli et al. 2004).

The two isoforms have arisen through duplication of a cyanobacterial gene that occurred early in the

evolution of the Archaeplastida. After the divergence of the three primary photosynthetic lineages, the

ancestral gene was apparently lost in the red algae and the glaucophytes, whereas the green algae lost the

“leaf” type of petH. We find that chromalveolates have only one isoform of petH (Fig. 4B). The

phylogenetic analysis strongly supports (BPml = 100%, BPnj = 89%, BPP = 1.0) the monophyly of

chromists and the “green” clade of the “root” (green-algal) type. PetH in alveolates, apicomplexan, and

peridinin dinoflagellates belong to the “root” type. However these sequences are highly divergent and we

are unable to resolve the position of alveolates in the petH tree. The substitution of the red-algal gene with

Page 11: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

11

one of green-algal origin presumably occurred in the common ancestor of chromalveolates (Fig. 4B). The

petH tree topology supports the monophyly of fucoxanthin dinoflagellates and haptophytes implying that

fucoxanthin dinoflagellates acquired their “green” petH from the tertiary endosymbiont.

Plastocyanin (Pc encoded by petE) is a small (ca. 100 –150 aa) thylakoid lumen copper-binding

protein that is an electron carrier between Photosystems II and I (Gross 1993). Plastocyanin is believed to

be unique to plants, green algae, and some cyanobacteria (Ho and Krogmann 1984; Gross 1993). In red

algae and in chromalveolates that contain a red algal-derived plastid, electron transport from cytochrome

b6/f to Photosystem I is achieved via the iron binding analog of plastocyanin, cytochrome C6 (CytC6)

(Sandmann et al. 1983; Price et al. 1991). The complete genome sequences of Cyanidioschyzon merolae,

Galdieria sulfuraria, and Thalassiosira pseudonana confirm the absence of petE encoding plastocyanin

from the genomes of red algae and stramenopiles. Both, petE and CytC6 have a cyanobacterial origin.

Many cyanobacteria and green algae still retain both genes. In these organisms, CytC6 is the functional

backup of plastocyanin that is activated under copper deficiency (Wood 1978; Ho, Krogmann 1984).

We identified 18 clones encoding the petE sequence in the K. brevis EST libraries; i.e., two

clones from the “dark” library and 16 clones from the “stress” library (see Materials and Methods). These

ESTs are encoded by five closely related genes that have arisen through recent gene duplications. BLAST

searches against NCBI identified petE sequences from K. micrum (ABA55542.1). In addition, we found

two petE encoding ESTs in the haptophyte alga Emiliania huxleyi (CX773129 and CX773088). These

two sequences have identical 3’UTRs indicating that they are derived from a single locus. The K. brevis

and E. huxleyi petE sequences are polyadenylated, encode a complete petE ORF, have a stop codon in the

position conserved among all photosynthetic eukaryotes, and a 3’UTR. The E. huxleyi petE sequences do

not include an N-terminal extension, whereas K. micrum petE encodes an N-terminal extension that has a

structure typical for dinoflagellate plastid targeting signals; i.e., it contains a cleavable 19 aa signal

peptide that directs the protein precursor to the endoplasmic reticulum (ER) followed by a plastid

targeting transit peptide (Fig S2A). Interestingly, the N-terminal extension of all K. brevis petE sequences

differ from the canonical dinoflagellate plastid-targeting peptides (Fig S2B). The N-terminal 48 aa form a

Page 12: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

12

highly hydrophobic, putatively uncleaved signal peptide (P = 0.979 according to SignalP) followed by a

92 aa sequence. The C-terminal thylakoid-spanning motif of the K. brevis plastocyanin leader peptide

shares significant similarity with the corresponding sequence in K. micrum plastocyanin. According to

PSORT (http://www.psort.org/) and TMpred (http://www.ch.embnet.org/software/TMPRED_form.html),

a 48 aa segment can be classified as a type II signal anchor peptide that orients the protein in the N’-

cytoplasm/C’-ER lumen topology in the ER membrane. We propose that plastocyanin targeting to the

plastid involves some alternative mechanism in K. brevis. Study of the prion protein, a brain glycoprotein

involved in various neurodegenerative diseases, provides an example of an alternative protein trafficking

mechanism (Lopez et al. 1990). The prion protein precursor contains an uncleaved N-terminal segment

that functions as a membrane-anchor motif in cell-free systems. However in vivo prion protein precursors

are translocated across the ER membrane with the assistance of cytosolic factors and trafficked to the cell

surface.

The distribution of petE suggests that haptophytes and fucoxanthin dinoflagellates acquired this

gene either from a green alga or a cyanobacterium. Phylogenetic analysis of the protein provides strong

support (BPml = 89%, BPP = 1.0) for the divergence of haptophyte and dinoflagellate petE within the

green clade (Fig. 4C). Based on this tree topology, we suggest that the haptophyte ancestor acquired petE

from a green alga. Thereafter fucoxanthin dinoflagellates captured this gene from the haptophyte tertiary

endosymbiont. CytC6 has not been found in K. brevis, but it was identified in the EST library of K.

micrum (PEPdb accession number KME00008061;

http://amoebidia.bcm.umontreal.ca/public/pepdb/welcome.php), in different species of haptophytes

including E. huxleyi (GenBank accession number CX774985), and in peridinin dinoflagellates

(Alexandrium tamarense, Heterocapsa triquetra, Lingulodinium polyedrum, and Amphidinium carterae).

Thus, fucoxanthin dinoflagellates, like green algae, possess both electron carrier proteins. The results of a

recent microarray analysis show that K. brevis petE is expressed under both normal and iron-limited

culture conditions (KL Lidie and F Van Dolah unpublished data). PetE transcript accumulation is

apparently not affected by a change in iron concentration in the medium. Based on these observations, we

Page 13: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

13

hypothesize that in fucoxanthin dinoflagellates, like in green algae, copper-binding plastocyanin is used to

transfer electrons from the cytochrome b6/f to Photosystem I dominant over CytC6. The expression

pattern of CytC6 in fucoxanthin dinoflagellates remains to be determined.

Gamma-tocopherol O-methyltransferase (γ-Tmt) is the final enzyme of the γ-tocopherol (vitamin

E) synthesis pathway that is found only in photosynthetic organisms (Sattler et al. 2003). It is believed

that tocopherol protects membrane fatty acids from oxidative degradation by scavenging reactive oxygen

species produced by photosynthesis (Fryer 1993). In plants and algae, tocopherol biosynthesis occurs on

the plastid membrane. Until now, proteins involved in tocopherol biosynthesis have been reported only in

plants, green and red algae, and cyanobacteria. We found an EST encoding the entire γ-Tmt coding region

in K. brevis. Partial γ-Tmt sequences have been identified in the EST libraries of K. micrum and a

haptophyte alga Isochrysis galbana (PEPdb accession numbers KME00005944 and ISE00003655,

respectively). The bootstrap (BPml = 87%, BPnj = 91%) and Bayesian analyses (BPP = 0.99) provide

support for the monophyly of green algae, plants, haptophytes, and fucoxanthin dinoflagellates suggesting

the same scenario for the origin of γ-Tmt in these genomes as described above for plastocyanin (Fig. 4D).

Soluble inorganic pyrophosphatases (sPPases) are ubiquitous enzymes that are responsible for the

removal of inorganic pyrophosphate produced by a variety of vital biosynthetic reactions (Perez-

Castineira et al. 2001). Eukaryotic cells contain several isoforms of sPPase located in the cytosol and

mitochondrion of animals and fungi, in the plastid and mitochondrion of photosynthetic eukaryotes, and

only in the mitochondrion of heterotrophic protists. All of these isoforms, with the exception of plant and

green algal mitochondrial sPPase, arose though the duplication of the eukaryotic gene, which replaced the

bacterial homologs of the proteobacterial and cyanobacterial endosymbionts. Analysis of the plastid

sPPase distribution shows this protein to be present in both photosynthetic plastid-bearing

chromalveolates as well as non-photosynthetic apicoplast-bearing apicomplexans. We however excluded

the highly divergent apicoplast targeted proteins from the analysis (results not shown). The sPPase

phylogenetic tree strongly supports the monophyly of chromalveolates and the origin of this gene from a

Page 14: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

14

green alga (BPml = 96%, BPP = 0.96; Fig. 4E). Similar to GSA-AT and petH, “green” sPPase originated

in chromalveolates prior to the split of chromists and alveolates. The strongly supported (BPml = 100%

BPnj = 100, BPP = 1.0) position of K. brevis within the haptophyte clade indicates that this gene

originated in fucoxanthin dinoflagellates from the tertiary endosymbiont.

Serine protease IV (sppA) is a thylakoid-bound stroma-exposed protease involved in the light-

induced plastid protein degradation in plants (Lensch, Herrmann, and Sokolenko 2001). Phylogenetic

analyses show that sppA encoding plastid-targeted proteins from all photosynthetic taxa form a

monophyletic group (BPml = 100%, BPnj = 100 %, BPP = 1.0; Fig. 4F) suggesting a common ancestry.

The plastid-targeted sppA genes presumably have arisen in Archaeplastida through the duplication of

mitochondrial sppA (BPml = 95%, BPnj = 94%, BPP = 1.0). However, an unresolved phylogeny within

the mitochondrial/eubacterial clade leaves open the possibility for an origin of plastid sppA through HGT

from a non-cyanobacterium. The nested position of the K. brevis plastid sppA inside of the green

algal/plant clade (BPml = 93%, BPnj = 99%, BPP = 1.0) suggests a green algal origin of the gene.

Absence of plastid sppA homologs from other chromalveolate taxa does not allow us to infer the time of

entry of this “green” gene into the chromalveolate lineage.

The Secondary Host Contribution to the Tertiary Plastid Proteome

In chromalveolates, plastid targeted fructose-1,6-bisphosphate aldolase (FAB II) and glyceraldehyde-3-

phospahte dehydrogenase (GAPDH) arose through the duplication of the secondary host cytosolic

isoforms that replaced homologs of plastid-targeted proteins of the secondary endosymbiont (Harper and

Keeling 2003; Patron, Rogers, and Keeling 2004). The results of our phylogenetic analysis of K. brevis

and K. micrum FAB II support the haptophyte origin of this gene in fucoxanthin dinoflagellates that is

consistent with previously reported FAB II phylogenies (Patron, Waller, and Keeling 2006).

The cytosolic GAPDH that is the precursor of chromalveolate plastidic GAPDH belongs to the

most common form of these enzymes, NADH dependent GAPDH (NAD-GAPDH) that is involved in

glycolytic and gluconeogenetic pathways in all organisms (Forthergill-Gilmore and Michels 1993). In

Page 15: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

15

plants, plastid NAD-GAPDH, which originated through the duplication of the gene encoding a plant

cytosolic NAD-GAPDH, is involved in glycolysis that takes place in non-photosynthetic plastids and in

the dark in chloroplasts (Plaxton 1996). Photosynthetic CO2 assimilation in plant, green algal, and red

algal plastids is catalyzed by the cyanobacterial-derived NADPH dependent GAPDH (Martin et al. 1993).

In chromalveolate plastids that lack the NADPH dependent GAPDH, NAD-GAPDH presumably is

involved in both the Calvin cycle and glycolysis.

Several NAD-GAPDH phylogenies including fucoxanthin dinoflagellates have been previously

published (Takishita, Ishida, and Maruyama 2004; Yoon et al. 2005). These data show the dinoflagellate

nuclear genome to encode two cytosolic forms. One of them (gapC3) has been derived either from

spirochaetes or from euglenoids by horizontal gene transfer, whereas the second (gapC2) is a normal

vertically inherited isoform of NAD-GAPDH. Duplication of the gapC2 gave rise to the gene encoding

the plastid-targeting protein. The plastid-targeted NAD-GAPDH found in fucoxanthin dinoflagellates has

been derived from the haptophyte tertiary endosymbiont. Analyses of the K. brevis EST libraries allowed

us to identify four distinct isoforms of GAPDH: two cytosolic (gapC2 and gapC3) and two plastid-

targeted (gapC1-pd and gapC1-fd) isoforms (Fig. S3). One of the plastid isoforms is similar to the

described previously gapC1-fd (Yoon et al. 2005) and clusters with plastid-targeted sequences of the

haptophyte algae, whereas another isoform, gapC1-pd, is positioned within the clade that includes plastid-

targeted proteins from peridinin dinoflagellates (Fig. 5). These results demonstrate for the first time that

K. brevis retains both tertiary endosymbiont haptophyte-derived and secondary endosymbiont red algal-

derived forms of NAD-GAPDH.

Discussion

Plastid Establishment and Endosymbiotic Gene Transfer

The evolution of the fucoxanthin dinoflagellate plastid proteome was shaped by primary, secondary, and

tertiary endosymbioses (Yoon et al. 2005). Study of the green and red algal plastids that resulted from

primary endosymbiosis have until now received the greatest attention. Using these models, it was

Page 16: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

16

demonstrated that establishment of the primary plastid proteome involved the integration of the molecular

machineries of both the endosymbiont and the host (Martin et al. 2002; Richly and Leister 2004). This

process was facilitated by large-scale gene movement from the cyanobacterial endosymbiont to the host

nuclear genome, with the subsequent re-targeting of these gene products to the plastid (Martin et al.

1998). Using present estimates, ca. 900 – 1200 plastid-targeted proteins encoded by the Arabidopsis

thaliana nuclear genome and 676 proteins in the thermoacidiphilic red alga Cyanidioschyzon merolae are

of cyanobacterial origin (Reumann, Inoue, and Keegstra 2005; Sato et al. 2005). For Arabidopsis, these

numbers comprise up to 50% of its nuclear-encoded plastid-targeted proteins. The remaining proteins

(around 1200) are derived from the host nuclear or pro-mitochondrial genomes, or from various sources

via HGT. Phagocytosis is considered to be the major mechanism in free living eukaryotes for the

acquisition of new genes via HGT (Doolittle 1998). The inability of virtually all red and green algae to

carry out phagocytosis has most certainly limited the influx of new genetic material through HGT into

these lineages. However, it is likely that HGT played an important role in the early formation of the

Archaeplastida primary plastid proteome (Martin and Schnarrenberger 1997) because phagotrophy was an

ancestral character in this group that most likely allowed the capture of the primary plastid.

Following primary endosymbiosis red and green plastids were acquired by other protist lineages

through secondary endosymbiosis (Bhattacharya, Yoon, and Hackett 2004). In secondary endosymbiosis,

most of the genes encoding plastid-targeted proteins were transferred from the endosymbiont nucleus to

the new host nucleus and reused for plastid function and regulation. The results of our phylogenetic

analysis of the plastid-targeted proteins confirm this prediction with 13 out of 30 proteins in K. brevis

being derived by chromalveolates from a red alga through secondary endosymbiosis (Table 1). This

number includes 11 proteins of cyanobacterial origin and two proteins (fusA and FBPase) derived from

the pro-mitochondrial genome of the primary host (Figs. 6A, B).

Despite the large-scale influx of endosymbiont genes into the host nuclear genome, the secondary

plastid proteome should not be considered simply as a copy of that in the primary plastid. Phylogenetic

analyses show significant changes in the proteome of the red algal-derived plastid that occurred in the

Page 17: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

17

chromalveolates during the establishment of secondary endosymbiosis. These changes can be classified

into two major types. The first involves the substitution of cyanobacterial genes with host paralogs. Two

examples are GAPDH and FBA (Harper and Keeling 2003; Patron, Rogers, and Keeling 2004). Both of

these substitutions occurred prior to the divergence of the chromalveolate lineages (Fig. 6B). The second

type of change in the red algal-derived plastid proteome is the substitution of genes derived from the red

algal endosymbiont with green algal or bacterial homologs. This category also includes the acquisition of

“foreign” genes, homologs of which are absent from red algae. There are two possible, but not mutually

exclusive explanations for the occurrence of “foreign” genes in chromalveolates, 1) continuous HGT over

chromalveolate evolution, or 2) an ancient green algal endosymbiosis potentially preceding the red algal-

derived plastid in the chromalveolate ancestor. The latter idea was proposed by Funes et al. (2002) as an

explanation for the occurrence of the green algal derived mitochondrial protein COXII in apicomplexans.

The red algal secondary endosymbiosis that gave rise to the chromalveolate plastid occurred about

1.3 billion years ago (Yoon et al. 2004). However, unlike Archaeplastida, many extant photosynthetic

chromalveolates still retain the ability for phagocytosis (Kugrens and Lee 1990; Kawachi et al. 1991;

Wilcox and Wedemayer 1991; Jones, Leadbeater, and Green 1994) that makes possible the influx of new

genes via HGT. If continuous HGT is a major mechanism for chromalveolate plastid proteome evolution,

we would expect to see the substitutions of red algal-derived genes occurring at different time points in

the chromalveolate tree. Three examples of such substitutions have been reported so far. One of them is

plastid protein delta-aminolevulinic acid dehydrogenase (hemB) that has a green origin in dinoflagellates

and a red origin in stramenopiles (Hackett et al. 2004b). A second example is the substitution of the

normal plastid encoded RuBisCo with its proteobacterial homolog that is nuclear encoded in

dinoflagellates (Palmer 1996). And finally, a plastid-targeted protein encoded by fused genes for the

shikimate biosynthetic enzyme AroB and an O-methyltransferase (OMT) derived by HGT from a

cyanobacterium has been found in dinoflagellates (Waller, Slamovits, and Keeling 2006). These examples

provide evidence for the importance of intra- and interdomain HGT in chromalveolate plastid evolution.

Other reported cases of plastid proteins of non-red algal origin in chromalveolates include two “green”

Page 18: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

18

genes, chlorophyll A synthase (chlG, Li et al. 2006) and phosphoribulokinase (PRK, Li et al. 2006;

Petersen et al. 2006) that are shared by different chromalveolate lineages. Red algal derived homologs of

these genes have not yet been found in chromalveolates.

Analysis of the K. brevis EST libraries identified six proteins of green algal origin: GSA-AT,

petH, and sPPase that are shared by chromists and alveolates, petE and γ-Tmt that are shared by

haptophytes and fucoxanthin dinoflagellates, and sppA, homologs of which have not been found in other

lineages of chromalveolates (Fig. 6B, C). Phylogenetic analyses of petE and γ-Tmt show that tertiary

EGT from the haptophyte endosymbiont is the most plausible explanation for the presence of these genes

in fucoxanthin dinoflagellates (Figs. 6C, D). There are two possible scenarios for the occurrence of these

genes in haptophytes. Under the first scenario, haptophytes gained petE and γ-Tmt through an

independent HGT. The observation that many extant species of photosynthetic haptophytes still retain the

ability for phagotrophy (Kawachi et al. 1991; Jones, Leadbeater, and Green 1994) makes plausible this

scenario. Under the second scenario, these genes entered the chromalveolate nuclear genome prior to the

divergence of the constituent lineages and have subsequently been lost from stramenopiles and alveolates

after the divergence of the cryptophyte-haptophyte branch (for the chromalveolate phylogeny see Hackett

et al. 2006). Limited molecular data from cryptophyte algae preclude us from reaching a decisive

conclusion on the time of petE and γ-Tmt recruitment by chromalveolates. Phylogenies of GSA-AT,

petH, sPPase, PRK, and chlG however suggest that a major influx of green genes occurred early in the

evolution of chromalveolates prior to the divergence of the chromists and alveolates.

It remains a daunting task to distinguish between the scenarios that the “green” genes are derived

by EGT from a single endosymbiont prior to the establishment of the “red” plastid or alternatively via

HGT from multiple green algal donors after this event. One possible approach to resolving this issue is to

compare the phylogenetic composition of foreign genes acquired by Chromalveolata before and after the

split of its constituent lineages. The present data show that the list of organisms that donated genes to the

chromalveolate lineages after their divergence from each other includes green algae, proteobacteria, and

Page 19: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

19

cyanobacteria (see above). However all foreign genes acquired by chromalveolates before their split are

derived from a single donor lineage, the green algae. One reasonable explanation for these data is the

presence of a green algal endosymbiont in the chromalveolate ancestor prior to the establishment of the

red algal-derived plastid. Unfortunately, the red algal endosymbiosis likely resulted in the substitution of

most green algal nuclear genes of plastid function with red algal homologs, effectively erasing the major

evidence for the previous endosymbiosis.

The results of our study show that secondary endosymbiosis brought together components of highly

diverged red and green algal photosynthetic machineries. Examples are psbU, that is unique in red algae

and green algal plastocyanin, structural homologs of which are absent from the red algal lineage. A

significant flexibility of the light-harvesting complex (LHC) in plastids has been previously demonstrated

(Grabowski, Cunningham, and Gantt 2001), showing that LHC proteins from a red alga, Porphyridium

cruentum, could functionally bind pigments that had evolved separately in different evolutionary lineages

(e.g., plants, stramenopiles, dinoflagellates). The successful incorporation of green algal-derived plastid

proteins into the proteome of the red plastid demonstrated by our study is consistent with these results

(Grabowski, Cunningham, and Gantt 2001). These data support the significant biochemical plasticity of

key components of plastid energy metabolism, such as the electron transport chain and the pyrophosphate

utilization machinery.

Tertiary endosymbiosis introduced an additional level of complexity to the process of plastid

regulation. In this case, the genomes of both the endosymbiont and the host contain genes of plastid

function. The discovery of nuclear-encoded plastid-targeted PSBO in fucoxanthin dinoflagellates that

originated from the haptophyte tertiary endosymbiont led Ishida and Green (Ishida and Green 2002) to

propose the replacement of ancestral red algal derived genes in these photosynthetic taxa with the

haptophyte endosymbiont homologs. Until recently, this was the leading hypothesis for the origin of the

tertiary plastid proteome. Patron, Waller, and Keeling (2006) however reported several plastid-targeted

proteins of the ancestral peridinin-dinoflagellate-type that were retained in the nuclear genome of the

fucoxanthin dinoflagellate K. micrum. These proteins include thylakoid-bound ascorbate peroxidase and

Page 20: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

20

phosphoribulokinase. Based on their involvement in processes unrelated to photosynthesis, Patron,

Waller, and Keeling (2006) suggested that the fucoxanthin dinoflagellate ancestor contained a non-

photosynthetic (apicoplast-like) plastid that required a reduced set of genes to express its function. Thus,

genes of the haptophyte tertiary endosymbiont may not have replaced, but rather reestablished the

photosynthetic machinery with genes that were lost prior to this endosymbiotic event. Analysis of the K.

brevis EST data turned up 13 genes encoding plastid-targeted proteins of haptophyte origin: atpC, psbO,

petC, psbU, PGAM, fabG, petH, petE, γ-Tmt, fusA, sPPase, FBA II, and gapC1-fd (Fig. 6C).

In addition to haptophyte-derived GAPDH encoded by gapC1- fd, we found the ancestral red algal

derived gapC1-pd sequence in K. brevis. This the first report of both ancestral and tertiary endosymbiont-

derived genes encoding homologous plastid-targeted proteins in fucoxanthin dinoflagellates. This

suggests that fucoxanthin dinoflagellates likely retain two functional sets of other, yet undetected nuclear

genes encoding plastid-targeted proteins. However, the clear dominance of expressed genes of haptophyte

provenance suggests that most ancestral genes encoding plastid-targeted proteins have been lost or

replaced after the establishment of the tertiary endosymbiont. This situation may mirror the more ancient

substitution of green algal genes with genes derived from the red algal endosymbiont postulated above.

Serial Endosymbiosis and HGT as Mechanisms for Adaptation in Chromalveolates

Chromalveolate algae are the most abundant group of eukaryotic marine phytoplankton. It is however

unclear why these taxa have risen to pre-eminence. Grzebyk et al. (2003) posited as explanation the

presence of a larger set of genes involved in photosynthesis and energy transduction in chromalveolate

plastids than in reds and greens (plastid portability hypothesis) and structural traits of the chromalveolate

host cell such as an armored cell wall as potential explanations.

We propose that the ability to recruit a new genetic material through serial endosymbiosis and

HGT is a key chromalveolate trait that provides the ability for rapid adaptation to changing environmental

conditions, thereby increasing their evolutionary fitness. For example, Karenia brevis is remarkably

tolerant to low iron conditions (10-9M) that will not support the growth of diatoms (A Neeley, G DiTullio,

Page 21: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

21

FM Van Dolah unpublished data). Green algal-derived petE is an example of a gene that is absent in red

algae and provides a novel mechanism in fucoxanthin dinoflagellates that may contribute to this trait.

Like green algae and some cyanobacteria, haptophytes and fucoxanthin dinoflagellates possess two types

of functionally homologous electron carrier proteins, copper-binding plastocyanin and iron-binding

CytC6. In fucoxanthin dinoflagellates, like in green algae and cyanobacteria, plastocyanin is the main

protein carrying electrons between the cytochrome b6/f complex and Photosystem I. Based on our results

we suggest that the acquisition of plastocyanin is an adaptation of fucoxanthin dinoflagellates to iron-

limitation in its natural habitat. Another known adaptation to iron deficiency in many algae and

cyanobacteria is a physiological switch from the iron–sulfur protein ferredoxin to the non-iron protein

flavodoxin in the electron transport chain (Erdner et al. 1999; Geiss et al. 2001). Analysis of the K. brevis

EST data show that both proteins, ferredoxin and flavodoxin, are present in this fucoxanthin

dinoflagellate. Consistent with its role in facilitating photosynthetic electron transport under low iron

conditions, flavodoxin transcript levels increase, whereas those of ferredoxin decrease under iron

limitation (KL Lidie and FM Van Dolah unpublished data), and flavodoxin protein expression is induced

under iron limitation, whereas only ferredoxin is present in nutrient replete conditions (A Neeley, G

DiTullio, and FM Van Dolah unpublished data). Thus, the two types of plastid electron carrier proteins,

red algal-derived flavodoxin (Fig. 6B, Table 1) and green algal-derived plastocyanin, likely provide

fucoxanthin dinoflagellates with the ability to thrive under conditions of iron deficiency in their natural

habitats.

Page 22: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

22

Supplementary Material

Figures S1-S3 and the list of contributors in the JGI Production Sequencing Group are available at

Molecular Biology and Evolution online.

Page 23: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

23

Acknowledgements

The K.brevis EST data were produced under the auspices of the US Department of Energy's Office of

Science, Biological and Environmental Research Program and the University of California, Lawrence

Livermore National Laboratory under Contract No. W-7405-Eng-48, Lawrence Berkeley National

Laboratory under contract No. DE-AC03-76SF00098 and Los Alamos National Laboratory under

contract No. W-7405-ENG-36. This was Community Sequencing Program project #200407. D. B. and T.

N. were supported by grants from the National Science Foundation and the National Aeronautics and

Space Administration awarded to D. B. (EF 04-31117, NNG04GM17G). T. N. was partially supported by

an Avis E. Cone Fellowship from the University of Iowa. F. M. V. D. and K. L. L. acknowledge support

from the NOAA HAB Initiative.

Page 24: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

24

Literature Cited

Adl SM, Simpson AG, Farmer MA, et al. (25 coauthors). 2005. The new higher level classification of

eukaryotes with emphasis on the taxonomy of protists. J Eukaryot Microbiol 52:399-451.

Arakaki AK, Ceccarelli EA, Carrillo N. 1997. Plant-type ferredoxin-NADP+ reductases: a basal

structural framework and a multiplicity of functions. FASEB J 11:133-140.

Archibald JM, Rogers MB, Toop M, Ishida K, Keeling PJ. 2003. Lateral gene transfer and the evolution

of plastid-targeted proteins in the secondary plastid-containing alga Bigelowiella natans. Proc Natl

Acad Sci USA 100:7678-7683.

Armbrust EV, Berges JA, Bowler C, Green BR, Martinez D. 2004. The genome of the diatom

Thalassiosira pseudonana: ecology, evolution, and metabolism. Science 306:79–86.

Asamizu E, Nakajima M, Kitade Y, Saga N, Nakamura Y, Tabata S. 2003. Comparison of RNA

expression profiles between the two generations of Porphyra yezoensis (Rhodophyta), based on

expressed sequence tag frequency analysis. J Phycol 39:923–930.

Bhattacharya D, Medlin L. 1995. The phylogeny of plastids: a review based on comparisons of small-

subunit ribosomal RNA coding regions. J Phycol 31:489–498.

Bhattacharya D, Yoon HS, Hackett JD. 2004. Photosynthetic eukaryotes unite: endosymbiosis connects

the dots. Bioessays 26:50-60.

Blankenship RE. 2001. Molecular evidence for the evolution of photosynthesis. Trends Plant Sci 6:4-6.

Breitenberger CA, Spremulli LL. 1980. Purification of Euglena gracilis chloroplast elongation factor G

and comparison with other prokaryotic and eukaryotic translocases. J Biol Chem 255:9814-9820.

Cavalier-Smith T. 1999. Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid,

dinoflagellate, and sporozoan plastid origins and the eukaryote family tree. J Eukaryot Microbiol

46:347–366.

Page 25: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

25

Cavalier-Smith T. 2004. Only six kingdoms of life. Proc Biol Sci 271:1251–1262.

Ceccarelli EA, Arakaki AK, Cortez N, Carrillo N. 2004. Functional plasticity and catalytic efficiency in

plant and bacterial ferredoxin-NADP(H) reductases. Biochim Biophys Acta 1698:155-165.

Doolittle WF. 1998. You are what you eat: a gene transfer ratchet could account for bacterial genes in

eukaryotic nuclear genomes. Trends Genet 14:307-311.

Douzery EJ, Snell EA, Bapteste E, Delsuc F, Philippe H. 2004. The timing of eukaryotic evolution: does

a relaxed molecular clock reconcile proteins and fossils? Proc Natl Acad Sci USA 101:15386-15391.

Felsenstein J. 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution

39:783–791.

Forthergill-Gilmore LA, Michels PAM. 1993. Evolution of glycolysis. Prog. Biophys Molec Biol 59:105-

235.

Fryer MJ. 1993. Evidence for the photoprotective effects of vitamin E. Photochem. Photobiol 58:304-312.

Funes S, Davidson E, Reyes-Prieto A, Magallon S, Herion P, King MP, Gonzalez-Halphen D. 2002. A

green algal apicoplast ancestor. Science 298:2155.

Erdner DL, Price NM, Doucette GJ, Peleato ML, Anderson DM. 1999. Characterization of ferredoxin and

flavodoxin as markers of iron limitation in marine phytoplankton. Mar Ecol Prog Ser 184:43–53.

Geiss U, Vinnemeier J, Kunert A, Lindner I, Gemmer B, Lorenz M, Hagemann M, Schoor A. 2001.

Detection of the isiA gene across cyanobacterial strains: potential for probing iron deficiency. Appl

Environ Microbiol 67:5247-5253.

Grabowski B, Cunningham FX Jr, Gantt E. 2001. Chlorophyll and carotenoid binding in a simple red

algal light-harvesting complex crosses phylogenetic lines. Proc Natl Acad Sci USA 98:2911-2916.

Gray MW. 1998. Rickettsia, typhus and the mitochondrial connection. Nature 396:109-110.

Gross EL. 1993. Plastocyanin: Structure and function. Photosynth Res 37:103-116.

Page 26: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

26

Grzebyk D, Schofield O, Vetriani C, Falkowski PG. 2003. The Mesozoic radiation of eukaryotic algae:

the portable plastid hypothesis. J Phycol 39:259-267.

Guindon S, Gascuel O. 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by

maximum likelihood. Syst Biol 52:696–704.

Guindon S, Lethiec F, Duroux P, Gascuel O. 2005. Nucleic Acids Res 33 (Web Server issue):W557-559.

Hackett JD, Anderson DM, Erdner DL, Bhattacharya D. 2004a. Dinoflagellates: a remarkable

evolutionary experiment. Am J Bot 91:1523-34.

Hackett JD, Yoon HS, Soares MB, Bonaldo MF, Casavant TL, Scheetz TE, Nosenko T, Bhattacharya D.

2004b. Migration of the plastid genome to the nucleus in a peridinin dinoflagellate. Curr Biol 14:213-

218.

Hackett JD, Yoon HS, Butterfield NJ, Sanderson MJ, Bhattacharya D. 2006. Plastid endosymbiosis:

origins and timing of events. In: Falkowski P, Knoll A, editors. Evolution of Aquatic

Photoautotrophs. Academic Press: in press.

Harper JT, Keeling PJ. 2003. Nucleus-encoded, plastid-targeted glyceraldehyde-3-phosphate

dehydrogenase (GAPDH) indicates a single origin for chromalveolate plastids. Mol Biol Evol

10:1730-35.

Harper JT, Keeling PJ. 2004. Lateral gene transfer and the complex distribution of insertions in

eukaryotic enolase. Gene 340:227-235.

Hedges SB, Chen H, Kumar S, Wang DY, Thompson AS, Watanabe H. 2001. A genomic timescale for

the origin of eukaryotes. BMC Evol Biol 1:4.

Hedges SB, Blair JE, Venturi ML, Shoe JL. 2004. A molecular timescale of eukaryote evolution and the

rise of complex multicellular life. BMC Evol Biol 4:2.

Page 27: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

27

Ho KK, Krogmann DW. 1984. Electron donors to P700 in cyanobacteria and algae. An instance of

unusual genetic variability. Biochim Biophys Acta 766:310–316.

Huelsenbeck JP, Ronquist F. 2001. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics

17:754–755.

Ishida K, Green BR. 2002. Second- and third-hand chloroplasts in dinoflagellates: phylogeny of oxygen-

evolving enhancer 1 (PsbO) protein reveals replacement of a nuclear-encoded plastid gene by that of

a haptophyte tertiary endosymbiont. Proc Natl Acad Sci USA 99:9294-99.

Jones HLJ, Leadbeater BSC, Green HLJ. 1994. Mixotrophy in haptophytes. In: J.C. Green, BSC

Leadbeater, editor. The haptophyte algae. Oxford: Clarendon Press. pp. 247-263.

Kannangara CG, Gough SP. 1978. Biosynthesis of δ-aminolevulinate in greening barley leaves: glutamate

1- semialdehyde aminotransferase. Carlsberg Res Commun 43:185–194.

Kawachi M, Inouye I, Maeda O, Chihara M. 1991. The haptonema as a food-capturing device:

observation on Chrysochromulina hirta (Prymnesiophyceae). Phycol 30:563-573.

Kugrens P, Lee RE. 1990. Ultrastructural evidence for bacterial incorporation and myxotrophy in the

photosynthetic cryptomonad Chroomonas pochmanni Huber-Pestalozzi (Cryoptomonadida). J

Protozool 37:263-267.

Lensch M, Herrmann RG, Sokolenko A. 2001. Identification and characterization of SppA, a novel light-

inducible chloroplast protease complex associated with thylakoid membranes. J Biol Chem

276:33645-51.

Li S, Nosenko T, Hackett JD, Bhattacharya D. 2006. Phylogenomic analysis identifies red algal genes of

endosymbiotic origin in the chromalveolates. Mol Biol Evol 23:663-674.

Page 28: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

28

Lidie KB, Ryan JC, Barbier M, Van Dolah FM. 2005. Gene expression in Florida red tide dinoflagellate

Karenia brevis: analysis of an expressed sequence tag library and development of DNA microarray.

Mar Biotechnol 7:481-493.

Lopez CD, Yost CS, Prusiner SB, Myers RM, Lingappa VR. 1990. Unusual topogenic sequence directs

prion protein biogenesis. Science 248:226-229.

Martin W, Brinkmann H, Savonna C, R Cerff. 1993. Evidence for a chimeric nature of nuclear genomes:

eubacterial origin of eukaryotic glyceraldehyde-3-phosphate dehydrogenase genes. Proc Natl Acad

Sci USA 90:8692-8696.

Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa M, Penny

D. 2002. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals

plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci USA

99:12246-12251.

Martin W, Schnarrenberger C. 1997. The evolution of the Calvin cycle from prokaryotic to eukaryotic

chromosomes: a case study of functional redundancy in ancient pathways through endosymbiosis.

Curr Genet 32:1-18.

Martin W, Stoebe B, Goremykin V, Hansmann S, Hasegawa M, Kowallik KV. 1998. Gene transfer to the

nucleus and the evolution of chloroplasts. Nature 393:162-165.

Matsuzaki M, Misumi O, Shin-I T, et al. (39 coauthors). 2004. Genome sequence of the ultrasmall

unicellular red alga Cyanidioschyzon merolae 10D. Nature 428:653-657.

Palmer, J.D. 1996. Rubisco surprises in dinoflagellates. Plant Cell. 8:343-345.

Parkinson J, Blaxter M. 2004. Expressed sequence tags: analysis and annotation. Methods Mol Biol

270:93-126.

Page 29: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

29

Patron NJ, Rogers MB, Keeling PJ. 2004. Gene replacement of fructose-1,6-bisphosphate aldolase

supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot Cell 5:1169-

1175.

Patron NJ, Waller RF, Keeling PJ. 2006. A tertiary plastid uses genes from two endosymbionts. J Mol

Biol 357:1373-1382.

Perez-Castineira JR, Gomez-Garcia R, Lopez-Marques RL, Losada M, Serrano A. 2001. Enzymatic

systems of inorganic pyrophosphate bioenergetics in photosynthetic and heterotrophic protists:

remnants or metabolic cornerstones? Int Microbiol 4:135-142.

Pertea G, Huang X, Liang F, et al. (9 coauthors). 2003. TIGR Gene Indices clustering tools (TGICL): a

software system for fast clustering of large EST datasets. Bioinformatics 19:651-652.

Petersen J, Teich R, Brinkmann H, Cerff R. 2006. A "green" phosphoribulokinase in complex algae with

red plastids: evidence for a single secondary endosymbiosis leading to haptophytes, cryptophytes,

heterokonts, and dinoflagellates. J Mol Evol 62:143-157.

Plaxton WC. 1996. The organization and regulation of plant glycolysis. Annu. Rev Plant Physiol Plant

Mol Biol 47:185-214.

Price NT, Smith AJ, Sykes AG, Rogers LJ. 1991. Cytochrome c-553 from two species of macroalgae.

Phytochemistry 30:2845–2848.

Reumann S, Inoue K, Keegstra K. 2005. Evolution of the general protein import pathway of plastids

(review). Mol Membr Biol 22:73-86.

Richly E, Leister D. 2004. An improved prediction of chloroplast proteins reveals diversities and

commonalities in the chloroplast proteomes of Arabidopsis and rice. Gene 329:11-16.

Saldarriaga JF, Taylor FJR, Keeling PJ, Cavalier-Smith T. 2001. Dinoflagellate nuclear SSU rRNA

phylogeny suggests multiple plastid losses and replacements. J Mol Evol 53: 204–213.

Page 30: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

30

Sandmann G, Reck H, Kessler E, Boger P. 1983. Distribution of plastocyanin and soluble plastidic

cytochrome c in various classes of algae. Arch Microbiol 134:23–27.

Sato N, Ishikawa M, Fujiwara M, Sonoike K. 2005. Mass Identification of chloroplast proteins of

endosymbiont origin by phylogenetic profiling based on organism-optimized homologous protein

groups. Genome Informatics 16:56-68.

Sattler SE, Cahot EB, Coughlan SJ, DellaPenna D. 2003. Characterization of tocopherol cyclases from

higher plants and cyanobacteria. Evolutionary implications for tocopherol synthesis and function.

Plant Physiol 132:2184-2195.

Scala S, Carels N, Falciatore A, Chiusano ML, Bowler C. 2002. Genome properties of the diatom

Phaeodactylum tricornutum. Plant Physiol 129:993–1002.

Schopf JW. 1993. Microfossils of the Early Archean Apex chert: new evidence of the antiquity of life.

Science 260:640-646.

Takishita K, Ishida K, Maruyama T. 2004. Phylogeny of nuclear-encoded plastid-targeted GAPDH gene

supports separate origins for the peridinin- and the fucoxanthin derivative-containing plastids of

dinoflagellates. Protist 155:447-458.

Waller RF, Slamovits CH, Keeling PJ. 2006. Lateral gene transfer of a multigene region from

cyanobacteria to dinoflagellates resulting in a novel plastid-targeted fusion protein. Mol Biol Evol

23:1437-1443.

Weber AP, Linka M, Bhattacharya D. 2006. Single, ancient origin of a plastid metabolite translocator

family in Plantae from an endomembrane-derived ancestor. Eukaryot Cell 5:609-612.

Weber AP, Oesterhelt C, Gross W, et al. (14 coauthors). 2004. EST-analysis of the thermo-acidophilic red

microalga Galdieria sulphuraria reveals potential for lipid A biosynthesis and unveils the pathway of

carbon export from rhodoplasts. Plant Mol. Biol. 55:17–32.

Page 31: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

31

Wilcox LW, Wedemayer GJ. 1991. Phagotrophy in the freshwater, photosynthetic dinoflagellate

Amphidinium cryophilum. J Phycol 27:600–609.

Wood PM 1978. Interchangeable copper and iron proteins in algal photosynthesis. Studies on

plastocyanin and cytochrome c-552 in Chlamydomonas. Eur J Biochem 87:9–19.

Yoon HS, Hackett JD, Ciniglia C, Pinto G, Bhattacharya D. 2004. A molecular timeline for the origin of

photosynthetic eukaryotes. Mol Biol Evol 21:809-818.

Yoon HS, Hackett JD, Van Dolah FM, Nosenko T, Lidie KL, Bhattacharya D. 2005. Tertiary

endosymbiosis driven genome evolution in dinoflagellate algae. Mol Biol Evol 5:1299-1308.

Page 32: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

32

Fig. 1. - A 6-protein plastid phylogeny using genes implicated in tertiary EGT in K. brevis. This ML tree

was inferred from the combined plastid-targeted protein sequences of atpC, psbO, petC, psbU, PGAM,

and fabG. The numbers above and below the branches are the results of ML and NJ bootstrap analyses,

respectively. Only bootstrap values ≥ 60% are shown. The thick branches indicate ≥ 0.95 posterior

probability from Bayesian inference. Branch lengths are proportional to the number of substitutions per

site (see scale bars). Chr, chromista; Alv, alveolates; Ra, red algae; Gr, green algae/plants; CB,

cyanobacteria. Numbers in bold indicate bootstrap supports for the monophyly of chromalveolates and

red algae and K. brevis and haptophytes.

Fig. 2. - ML tree of plastid- and mitochondrial-targeted translation elongation factor G (fusA). The

numbers above and below the branches are the results of ML and NJ bootstrap analyses, respectively.

Only bootstrap values ≥ 60% are shown. The thick branches indicate ≥ 0.95 posterior probability from

Bayesian inference. Branch lengths are proportional to the number of substitutions per site (see scale

bars). The lineage designations are as in Fig. 1. Numbers in bold indicate bootstrap support for the

monophyly of plastid and alpha-proteobacterial clades, and K. brevis and haptophytes.

Fig.3. - ML tree of plastid targeted and cytosolic fructose-1, 6-bisphosphatase (FBPase) sequences. The

numbers above and below the branches are the results of ML and NJ bootstrap analyses, respectively.

Only bootstrap values ≥ 60% are shown. The thick branches indicate ≥ 0.95 posterior probability from

Bayesian inference. Branch lengths are proportional to the number of substitutions per site (see scale

bars). The lineage designations are as in Fig. 1. Numbers in bold indicate bootstrap support for the

monophyly of plastid and cytosolic isoforms of FBPase, and chromalveolates and red algae.

Fig. 4. - Proteins of green algal origin in the K. brevis tertiary plastid proteome. (A) ML tree of GSA-AT.

(B) ML tree of petH. (C) ML tree of petE. (D) ML tree of γ-Tmt. (E) ML tree of sPPase. (F) ML tree of

Page 33: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

33

sppA. The numbers above and below the branches are the results of ML and NJ bootstrap analyses,

respectively. Only bootstrap values ≥ 60% are shown. Dashes represent bootstrap values bellow 60%. The

thick branches indicate ≥ 0.95 posterior probability from Bayesian inference. Branch lengths are

proportional to the number of substitutions per site (see scale bars). The lineage designations are as in Fig.

except that Gl is glaucophytes. Numbers in bold indicate bootstrap support for the monophyly of

chromalveolates and green algae/plants and fucoxanthin dinoflagellates K. brevis and K. micrum and

haptophytes.

Fig. 5. - ML tree of chromalveolate plastid targeted NAD-GAPDH. The numbers above and below the

branches are the results of ML and NJ bootstrap analyses, respectively. Only bootstrap values ≥ 60% are

shown. The thick branches indicate ≥ 0.95 posterior probability from Bayesian inference. Branch lengths

are proportional to the number of substitutions per site (see scale bars). Chr, chromista; Alv, alveolates.

Numbers in bold indicate bootstrap support for the monophyly of gapC1-pd isoform of K. brevis and

peridinin dinoflagellates and gapC1-fd isoform of fucoxanthin dinoflagellates and haptophytes. This tree

represents a portion of the complete NAD-GAPDH tree provided in supporting materials (Fig. S3).

Fig. 6. - Origin of plastid-targeted proteins in K. brevis. Fucoxanthin dinoflagellate plastid proteome

evolution through primary, secondary, and tertiary endosymbioses is shown from top to bottom. The

sources of K. brevis genes encoding plastid targeted proteins are denoted using different colors in three

sections of the figure corresponding to primary, secondary, and tertiary endosymbioses. (a) Genes derived

from the primary host through vertically (black font/arrow) and cyanobacteria through EGT (blue

font/arrow). (b) Genes derived from the secondary host vertically (black font/arrow), red algal

endosymbiont through EGT (red font/arrow), and green algae through HGT (green font/arrow). (c) Genes

derived by fucoxanthin dinoflagellates from the peridinin dinoflagellate host vertically (black font/arrow)

and haptophyte tertiary endosymbiont through EGT (purple font/arrow). The question mark indicates

Page 34: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

34

uncertainty about the time for the gene transfer. * - numbers of cyanobacteria- and host-derived genes are

indicated for Arabidopsis plastid proteome (Reumann, Inoue, and Keegstra 2005).

Page 35: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

36

Table 1. Plastid targeted proteins in the fucoxanthin dinoflagellate Karenia brevis.

Gene source in1’ 2’ 3’

# Protein name Gene/Protein

abbreviation

Accessionnumber

Leaderpeptide

plastid proteome

Electron transport and carbohydrate metabolism1 Glyceraldehyde-3-phosphate

dehydrogenase isoform C1-fdgapC1-fd/

GAPDHp-fdDQ531601* st Cyt _ H

2 Glyceraldehyde-3-phosphatedehydrogenase isoform C1-pd

gapC1-pd/GAPDHp-pd

DQ531575 st Cyt _ PD

3 Cytochrome b6-f complex iron-sulfur subunit

petC/ RISP DQ531586 t CB _ RA _ H

4 Ferredoxin petF/ Fd DQ531588* st CB _ nr _ nr5 Ferredoxin NADPH reductase petH/ FNR DQ531589 st CB _ GA _ H6 Flavodoxin isiB /Flv DQ531584* st CB _ RA _ nr7 Oxygen-enhancer 1 protein psbO DQ531590 st CB _ RA _ H8 Photosystem II 12kD extrinsic

proteinpsbU DQ531591* st CB _ RA _ H

9 Fructose-1,6-bisphosphatealdolase class II

fbaC1/FBA-II

DQ531583 st Cyt _ H

10 Adenylate kinase ADK DQ531597 - CB _ RA _ nr11 Soluble inorganic

pyrophosphatase sPPase DQ531593* st Euk _ GA _ H

12 ATP synthase gamma subunit atpC DQ531581 st CB _ RA _ H13 Plastocyanin petE/ Pc DQ531587* at CB _ GA _ H14 Phosphoglycerate mutase PGAM DQ531576 st CB _ RA _ H15 Fructose-1,6 bisphosphatase fbp/ FBPase DQ531600 st Mt_Cyt _ RA _ nr16 Putative ribulose-1,5

bisphosphate carboxylaseRBCMT DQ531595 st nr

17 LH chlorophyll binding proteins not analyzed

Page 36: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

37

Translation18 Translation elongation factor G fusA/ EF-G DQ531572 - Mt _ RA _ H19 Translation elongation factor Ts EF-Ts DQ531577 st CB _ nr _ nr Transcription and DNA synthesis20 DNA gyrase subunit A gyrA DQ531573 - CB _ RA _ nr Fatty acid synthesis21 Beta-ketoacyl-ACP reductase fabG/ BKR DQ531582 st CB _ RA _ H Protein import and processing22 Clp protease clpC DQ531574 - nr23 Clp protease clpN DQ531578 st CB _ nr _ nr24 DegP serine-type peptidase degP DQ531596 t CB _ RA _ nr25 Periplasmic serine protease IV sppA DQ531592 st CB _ GA _ nr26 M48-like peptidase - DQ531585 st nr Plastid biogenesis27 RNA helicase VDL VDL DQ531579 st nr Chlorophyll biosynthesis28 Glutamate 1-semialdehyde 2,1-

aminomutaseGSA-AT DQ531598 - CB _ GA _ nr

Terpenoid biosynthesis29 Gamma-tocopherol O-

methyltransferase_-Tmt DQ531594 st CB _ GA _ nr

Unknown function30 Conserved hypothetical protein

CO065457- DQ531580 st CB _ RA _ nr

* The sequence represents one member of a closely related family of proteins in K. brevis; s - signal peptide (> 0.9 SignalP

probability); a - signal anchor peptide (> 0.9 SignalP probability); t - transit peptide; 1’ – Archaeplastida; 2’ – Chromalveolata; 3’-

fucoxanthin dinoflagellates. CB - cyanobacteria; RA - red algae; GA - green algae; H - haptophytes; PD - peridinin dinoflagellates; Mt

Page 37: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

38

- gene of mitochondrial or pro-mitochondrial origin; Cyt - host gene encoding cytosolic protein; Euk – host gene of eukaryotic origin

(if the original gene copy was lost from the host genome); nr- protein phylogeny was not resolved.

Page 38: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

Emiliania huxleyiIsochrysis galbana

Prymnesium parvumKarenia brevis - Alv

Thalassiosira pseudonanaPhaeodactylum tricornutum

Heterocapsa triquetraAlexandrium tamarense

Amphidinium carteraeGaldieria sulphurariaCyanidioschyzon merolae

EudicotArabidopsis thaliana

Oryza sativaChlamydomonas reinhardtii

Nostoc sp. PCC7120Crocosphaera watsonii WH8501

Synechococcus elongatus PCC6301Gloeobacter violaceus PCC7421

10095

100

100

97100

70

100

100

100100

100

10096

62

100-

100

100

-

100

100

100100

10087 95

73

100

-

Chr

Chr

Alv

Ra

Gr

Cb

0.1 substitutions/site

Page 39: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

Prymnesium parvumEmiliania huxleyi

Karenia brevis Thalassiosira pseudonanaFragilariopsis cylindrusCyanidioschyzon merolae

Porphyra yezoensisArabidopsis thaliana Glycine maxOryza sativa

Chlamydomonas reinhardtiiRhodobacter sphaeroidesOceanicola batsensis HTCC2597

Mesorhizobium loti MAFF303099Brucella melitensis biovar Abortus 2308

Rhodospirillum rubrum ATCC 11170Magnetospirillum magneticum AMB-1

marine alpha proteobacterium HOT2C01Rickettsia akari str. Hartford

Mus musculusRattus norvegicus

Homo sapiensCryptococcus neoformans

Neurospora crassaAspergillus nidulans

Kluyveromyces lactisArabidopsis thalianaOryza sativa

Galdieria sulphurariaCyanidioschyzon merolae

Dictyostelium discoideumBorrelia garinii PBi

Treponema pallidum subsp. pallidum str. NicholsSyntrophobacter fumaroxidans MPOB

Desulfotalea psychrophila LSv54Leptospira interrogans serovar Lai str. 56601

Blastopirellula marina DSM 3645Rhodopirellula baltica SH 1

Synechococcus elongatus PCC6301Nostoc sp. PCC7120

Gloeobacter violaceus PCC7421

6794

99

93 10099

100

99

100

100

69

91100

100100

98100

8791

100

100

6497

100

74100

Alpha-proteobacteria

Spirochaetesand Delta-proteobacteria

Planctomycetes

10078

100

100

96100

9495

100

96

100 100100

67

100

100

100

9799

100

95

6894 69

97

100

95

mitochondria

Cb

Gr

Ra

Chr

Chr- Alv

0.1 substitutions/site

-

-

plastid

Page 40: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

Arabidopsis thalianaBrassica oleraceaSolanum tuberosum

Triticum aestivumOryza sativa

Spinacia oleraceaScherffelia dubiaChlamydomonas reinhardtiiIsochrysis galbana

Prymnesium parvumLingulodinium

Karenia brevisGaldieria sulphuraria 1

Galdieria sulphuraria 2Cyanidioschyzon merolae

Cyanidioschyzon merolaeGaldieria sulphuraria

Thalassiosira pseudonanaAmphidinium carterae

Phytophthora infestansCaenorhabditis elegans

Ustilago maydisCryptococcus neoformansSolanum tuberosum

Spinacia oleraceaArabidopsis thalianaBrassica oleracea

Oryza sativa

0.1 substitutions/site

919860

62100

99

100

9972

6879

98

65

100

9289

100

100 10067

97100

63

93

98 100

6967

10082

100

100

100 100

-

-

Gr

Alv

Ra

cytosol

plastid

Chr

Page 41: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

0.1 substitutions/site

Prymnesium parvum - ChrKarenia brevisEmiliania huxleyi

Thalassiosira pseudonanaAmphidinium carterae

Lingulodinium polyedrumOryza sativaArabidopsis thaliana

Chlamydomonas reinhardtiiCyanidioschyzon merolae

Galdieria sulphurariaCyanophora paradoxa

Dictyostelium discoideumKluyveromyces lactis

Debaryomyces hanseniiSchizosaccharomyces pombe

Ustilago maydisCryptococcus neoformans

Porphyra yezoensisGaldieria sulphuraria

Cyanidioschyzon merolaeDrosophila melanogaster

Caenorhabditis elegansHomo sapiens

Rattus norvegicusGallus gallus

Danio rerio

79100

94

9697

100

94

95

100

94

98

10066

98

99

-

-

-

71

-

100100

-

100-

63

100 67

100

8699

mitochondrialand cytosolicsPPase

- GlRa

Gr

Alv

- AlvChr

-

0.1 substitutions/site

Amphidinium carteraeKarlodinium micrumKarenia brevis

Alexandrium tamarenseThalassiosira pseudonana

Phaeodactylum tricornutumIsochrysis galbana

Emiliania huxleyiPavlova lutheri

Arabidopsis thalianaBrassica oleraceaNicotiana tabacumLycopersicon esculentum

Hordeum vulgareOryza sativa

Polytomella sp.Chlamydomonas reinhardtii

Cyanidioschyzon merolaeGaldieria sulphuraria

Crocosphaera watsonii WH8501Nostoc sp. PCC7120

Synechococcus elongatus PCC6301Gloeobacter violaceus PCC7421

9281

64 100

10087

100

100

98

8480

86

99-

100

73

100

100

100

98100

82-

-

99

-83

-

Gr

Alv

Chr

Ra

Cb

A

E

0.1 substitutions/site

Isochrysis galbanaPrymnesium parvum

Emiliania huxleyiKarlodinium micrumKarenia brevis

Thalassiosira pseudonana 1Thalassiosira pseudonana 2

Phaeodactylum tricornutumChlamydomonas reinhardtiiVolvox carteriOryza sativaZea mays

Nicotiana tabacumPisum sativumArabidopsis thalianaCeratopteris richardii

Toxoplasma gondiiHeterocapsa triquetra

Amphidinium carteraeOryza sativa

Arabidopsis thalianaPisum sativum

Nicotiana tabacumCyanidioschyzon merolae

Glaucocystis nostochinearumCyanophora paradoxaGaldieria sulphuraria - Ra

Nostoc sp. PCC7120Synechococcus elongatus PCC6301Crocosphaera watsonii WH8501

Gloeobacter violaceus PCC7421

10067

10086

100

94

100 99

97100

93

60

10068

98

9699

90 Cb

- Ra Gl

Alv

Alv

Gr

Gr"leaf" isoform

Chr

Chr

B100

100

100100100

100 86

-

89

100

-

-

-

100

6999

-

100

9299

74

Solanum tuberosumPisum sativum

Nicotiana tabacumArabidopsis thaliana

Zea maysOryza sativaHordeum vulgarePhyscomitrella patens

Emiliania huxleyiKarlodinium micrum

Karenia brevisPediastrum duplex

Pediastrum boryanumChlorella vulgaris

Chlamydomonas reinhardtiiScherffelia dubia

Scenedesmus obliquusTrichodesmium erythraeum IMS101

Gloeobacter violaceus PCC7421Nostoc punctiforme PCC73102

Phormidium laminosumSynechococcus sp. WH 8102

Prochlorococcus marinus CCMP1375Crocosphaera watsonii WH8501

Synechocystis sp. PCC 6803

676389

89

7294

9499

90

100

91

10096

96

-

-

Gr

Gr

- ChrAlv

Cb

--

0.1 substitutions/site

C Arabidopsis thalianaBrassica oleracea

Helianthus annuusPicea sitchensis

Pinus taedaTriticum aestivumOryza sativa

Physcomitrella patensChlamydomonas reinhardtii

Isochrysis galbanaKarlodinium micrum

Karenia brevisCyanidioschyzon merolae

Galdieria sulphuraria Trichodesmium erythraeum IMS101

Anabaena variabilis ATCC 29413Crocosphaera watsonii WH8501Synechocystis sp. PCC 6803

10073

10093

88 10069

87100

98

100

D 100

66

10080

99

93

9193

99

1000.1 substitutions/site

- ChrAlv

Ra

Cb

"root"isoform

Gr

Vitis viniferaNicotiana tabacumArabidopsis thaliana

Sorghum bicolorOryza sativa

Hordeum vulgareChlamydomonas reinhardtii

Karenia brevisCyanidioschyzon merolae

Galdieria sulphuraria Solibacter usitatus Ellin6076

Chlorobium tepidum TLSThermobifida fusca YX

Magnetospirillum MS-1Mycobacterium leprae

Chlamydomonas reinhardtiiDictyostelium discoideuml

Galdieria sulphurariaOceanicaulis alexandrii HTCC2633Cyanidioschyzon merolae

Thalassiosira pseudonanaIdiomarina baltica OS145

Shewanella sp. MR-7Porphyromonas gingivalis W83

Cytophaga hutchinsoniiNostoc punctiforme PCC73102Crocosphaera watsonii WH8501

Trichodesmium erythraeum IMS101Cb

Ra

Gr

mitochondrialand EubacterialsppA

- Alv

0.1 substitutions/site

F 75

9494

99

93

100

80

7187

7376

95

7370

100

91100

70

-

-99

-

65-

72

100

84-

6877

91

62

100-64

10079

99

GSA-AT petH

petE G-Tmt

sPPase sppA

Page 42: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

Symbiodinium muscatineiAmphidinium carterae

Gymnodinium simplexKarenia brevis

Lingulodinium polyedrumHeterocapsa triquetra

Fucus distichusOchromonas danica

Mallomonas rasilisGuillardia theta

Pyrenomonas salinaPhaeodactylum tricornutum

Odontella sinensisKarenia mikimotoiKarenia brevis

Karlodinium micrumPrymnesium parvum

Isochrysis galbanaPavlova lutheri

Toxoplasma gondii

9087

7687

6091

100 97

97

100

100100

100100

93

100

--

--

-

92

97

100

96

100

100

9599

9785

AlvgapC1-pd

Alv gapC1-fd

Chr

Chr

- Alv

0.1 substitutions/site

Cytosolic gapC2

Page 43: Chimeric Plastid Proteome in the Florida “Red Tide ...dblab.rutgers.edu/home/downloads/Files/Nosenko et al MBE AdvAccess 06.pdfThe Author 2006. Pub lished by Oxford University Press

TERTIARY ENDOSYMBIOSIS

PRIMARY ENDOSYMBIOSIS

Peridinin Dinoflagellates

Fucoxanthin Dinoflagellates

Haptophytes

ENDOSYMBIOTIC / HORIZONTAL GENE TRANSFER

FBPasefusAsppAsPPase(~1200 genes*)

Large-scaleEGT fromCyanobacterium(~900 genes*)

PGAMpsbOpsbUC0065457VDLGSA-ATpetH sPPase

gapC1-pd

petHgapC1-fdPGAMpsbOpsbUsPPaseTmt

Rhodophytes

petETmt

SECONDARY ENDOSYMBIOSIS

gapC1FBAII

Origin of Plastid Targeted Proteins in Fucoxanthin Dinoflagellates

Archaeplastida

ChromalveolatesatpCdegPfabGFBPaseFlvfusAgyrApetC

atpCfabGFBAIIfusApetCpetE

?

HOST GENES

A

B

C

sppA?