characterization and expression of the complementary dna

5
Proc. Natl. Acad. Sci. USA Vol. 87, pp. 733-737, January 1990 Neurobiology Characterization and expression of the complementary DNA encoding rat histidine decarboxylase (histamine/amino acid sequence/chromosomal mapping/dopa decarboxylase/fetal liver) DAVID R. JOSEPH*tt, PATRICK M. SULLIVANt, YAN-MIN WANG§, CHRISTINE KOZAK¶, DAVID A. FENSTERMACHER*, MARK E. BEHRENDSEN*, AND CYNTHIA A. ZAHNOWt Departments of *Pediatrics, tPhysiology, and tBiology, and the §Curriculum in Neurobiology, The Laboratories for Reproductive Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599; and IThe Laboratory of Molecular Microbiology, The National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892 Communicated by Esmond E. Snell, October 9, 1989 ABSTRACT Histamine is a neurotransmitter in the central nervous system and an important modulator of gastric acid secretion, vasomotor control, inflammation, and allergic reac- tions. In biological systems the formation of histamine from its precursor histidine is catalyzed by the enzyme L-histidine decarboxylase (HDC; L-histidine carboxy-lyase, EC 4.1.1.22). We have cloned HDC-encoding cDNA from a fetal rat liver cDNA library (phage A gtll) and have deduced the amino acid sequence from the nucleotide sequence. The clone was proven to be HDC cDNA by expression of active recombinant enzyme in COS cells and by chromosomal mapping. The cDNA encodes a protein of Mr 73,450 (655 amino acid residues). The discrep- ancy between this molecular weight and the size of the purifiled fetal liver protein subunits [Taguchi, Y., Watanabe, T., Kubota, H., Hayashi, H. & Wada, H. (1984) J. Biol. Chem. 259, 5214-5221] (M, = 54,000) suggests that HDC may be posttranslationally processed. The 469 amino acid residues from the amino-terminal portion of the protein share 50% identity with rat and Drosophila L-dopa decarboxylases and much less homology with other characterized amino acid decarboxylases. The biogenic amine histamine is a neurotransmitter in the central nervous system and an important physiological mod- ulator of gastric acid secretion, airway smooth-muscle tone, vasomotor control, inflammation, and allergic reactions (1- 3). The formation of histamine from its precursor histidine is catalyzed by the enzyme L-histidine decarboxylase (HDC; L-histidine carboxy-lyase, EC 4.1.1.22). In rodents the en- zyme is primarily localized in the brain (hypothalamic neu- rons and projections to other brain regions) (4), the glandular regions of the stomach (5), mast cells (6), and fetal liver (7). HDC has been purified from rat stomach (5) and fetal liver (7) and was shown to be a dimer consisting of Mr 54,000 subunits. Like other mammalian amino acid decarboxylases, the en- zyme utilizes pyridoxal phosphate as a coenzyme. Although no primary amino acid sequence of mammalian or other eukaryotic HDC is known, immunological studies indicated that rodent HDC is closely related to L-dopa decarboxylase (DDC) (8). In this manuscript we present the amino acid sequence of rat HDC as deduced from the fetal liver cDNA11 and demonstrate its close homology to DDC, but not to other characterized amino acid decarboxylases. EXPERIMENTAL PROCEDURES Preparation and Screening of cDNA Library. Poly(A)+ RNA from fetal liver (16 days after conception) was prepared by guanidine thiocyanate extraction (9) and transcribed with avian myeloblastosis virus reverse transcriptase. The second strand was synthesized with RNase H and Escherichia coli DNA polymerase I (10). After treatment with T4 DNA polymerase to flush the ends, EcoRI linkers were applied by ligation and cleaved with EcoRI. Gel filtration (Sepharose CL-4B)-purified cDNA was ligated with dephosphorylated EcoRI-cleaved phage A gt11 DNA, and the recombinants were packaged in vitro (11). The unampliflied library was screened on E. coli Y1088, with [32P]DNA as previously described (12). Positive recombinants were plaque-purified, and the DNA inserts were isolated by electroelution. Transient Expression in Monkey Kidney Cells. The DNA inserts from isolates HDC-18 and HDC-21 were cloned into the expression vector pCMV5, a modified pCMV plasmid (13) containing an EcoRI cloning site (the modified plasmid was from David Russel and Matthew Lorence, University of Texas Southwestern Medical Center, Dallas). This plasmid vector contains the cytomegalovirus (CMV) promoter- enhancer of the immediate early gene, the poly(A)+ addition- transcription terminator region of the human growth hor- mone gene, the simian virus 40 origin of replication, and a polylinker region for the insertion of cDNAs (13). Recombi- nants with the correct orientation for expression were iden- tified by restriction endonuclease analysis, and the plasmid DNA (pCMVHDC-18 and pCMVHDC-21) was purified by CsCl/ethidium bromide centrifugation. For transfection, subconfluent COS 7 cells (106 cells per 100-mm dish) were transfected with 5 ug of pCMV DNA per dish by the DEAE-dextran method (14). After 48 hr, the cells were suspended in Ca2+/Mg2+-free phosphate-buffered saline and lysed by three cycles of freeze (-70°)/thaw (370); the lysate was assayed for protein and HDC and DDC enzymatic activities. Conditions for the HDC assay were as described (15), with 0.25 mM [3H]histidine as substrate. The formation of [3H]histamine was determined by thin-layer chromatogra- phy (16). Chromosomal Localization. The production and character- ization of the Chinese hamster-mouse somatic cell hybrids has been described (17). NFS/N mice were obtained from the Small Animal Section, National Institutes of Health. Mus musculus musculus mice were originally trapped in Skive, Denmark, and breeding pairs were provided by M. Potter (National Cancer Institute) from his colony at Hazelton Laboratories, Rockville, MD. NFS/N females were mated with M. m. musculus males, and the F1 females were then mated with M. m. musculus males. DNA extracted from the livers of the backcross progeny was typed by Southern blot Abbreviations: HDC, histidine decarboxylase; DDC, dopa decar- boxylase; GAD, glutamic acid decarboxylase; PLP, pyridoxal phos- phate; RFLP, restriction fragment length polymorphism; ABP, art- drogen-binding protein; CMV, cytomegalovirus. 1The sequence reported in this paper has been deposited in the GenBank data base (accession no. M29591). 733 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Upload: doantuong

Post on 02-Jan-2017

230 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Characterization and expression of the complementary DNA

Proc. Natl. Acad. Sci. USAVol. 87, pp. 733-737, January 1990Neurobiology

Characterization and expression of the complementary DNAencoding rat histidine decarboxylase

(histamine/amino acid sequence/chromosomal mapping/dopa decarboxylase/fetal liver)

DAVID R. JOSEPH*tt, PATRICK M. SULLIVANt, YAN-MIN WANG§, CHRISTINE KOZAK¶,DAVID A. FENSTERMACHER*, MARK E. BEHRENDSEN*, AND CYNTHIA A. ZAHNOWtDepartments of *Pediatrics, tPhysiology, and tBiology, and the §Curriculum in Neurobiology, The Laboratories for Reproductive Biology, The University ofNorth Carolina at Chapel Hill, Chapel Hill, NC 27599; and IThe Laboratory of Molecular Microbiology, The National Institute of Allergy and InfectiousDiseases, National Institutes of Health, Bethesda, MD 20892

Communicated by Esmond E. Snell, October 9, 1989

ABSTRACT Histamine is a neurotransmitter in the centralnervous system and an important modulator of gastric acidsecretion, vasomotor control, inflammation, and allergic reac-tions. In biological systems the formation of histamine from itsprecursor histidine is catalyzed by the enzyme L-histidinedecarboxylase (HDC; L-histidine carboxy-lyase, EC 4.1.1.22).We have cloned HDC-encoding cDNA from a fetal rat livercDNA library (phage A gtll) and have deduced the amino acidsequence from the nucleotide sequence. The clone was provento be HDC cDNA by expression of active recombinant enzymein COS cells and by chromosomal mapping. The cDNA encodesa protein ofMr 73,450 (655 amino acid residues). The discrep-ancy between this molecular weight and the size of the purifiledfetal liver protein subunits [Taguchi, Y., Watanabe, T.,Kubota, H., Hayashi, H. & Wada, H. (1984) J. Biol. Chem.259, 5214-5221] (M, = 54,000) suggests that HDC may beposttranslationally processed. The 469 amino acid residuesfrom the amino-terminal portion of the protein share 50%identity with rat and Drosophila L-dopa decarboxylases andmuch less homology with other characterized amino aciddecarboxylases.

The biogenic amine histamine is a neurotransmitter in thecentral nervous system and an important physiological mod-ulator of gastric acid secretion, airway smooth-muscle tone,vasomotor control, inflammation, and allergic reactions (1-3). The formation of histamine from its precursor histidine iscatalyzed by the enzyme L-histidine decarboxylase (HDC;L-histidine carboxy-lyase, EC 4.1.1.22). In rodents the en-zyme is primarily localized in the brain (hypothalamic neu-rons and projections to other brain regions) (4), the glandularregions of the stomach (5), mast cells (6), and fetal liver (7).HDC has been purified from rat stomach (5) and fetal liver (7)and was shown to be a dimer consisting ofMr 54,000 subunits.Like other mammalian amino acid decarboxylases, the en-zyme utilizes pyridoxal phosphate as a coenzyme. Althoughno primary amino acid sequence of mammalian or othereukaryotic HDC is known, immunological studies indicatedthat rodent HDC is closely related to L-dopa decarboxylase(DDC) (8). In this manuscript we present the amino acidsequence of rat HDC as deduced from the fetal liver cDNA11and demonstrate its close homology to DDC, but not to othercharacterized amino acid decarboxylases.

EXPERIMENTAL PROCEDURESPreparation and Screening of cDNA Library. Poly(A)+

RNA from fetal liver (16 days after conception) was preparedby guanidine thiocyanate extraction (9) and transcribed withavian myeloblastosis virus reverse transcriptase. The second

strand was synthesized with RNase H and Escherichia coliDNA polymerase I (10). After treatment with T4 DNApolymerase to flush the ends, EcoRI linkers were applied byligation and cleaved with EcoRI. Gel filtration (SepharoseCL-4B)-purified cDNA was ligated with dephosphorylatedEcoRI-cleaved phage A gt11 DNA, and the recombinantswere packaged in vitro (11). The unampliflied library wasscreened on E. coli Y1088, with [32P]DNA as previouslydescribed (12). Positive recombinants were plaque-purified,and the DNA inserts were isolated by electroelution.

Transient Expression in Monkey Kidney Cells. The DNAinserts from isolates HDC-18 and HDC-21 were cloned intothe expression vector pCMV5, a modified pCMV plasmid(13) containing an EcoRI cloning site (the modified plasmidwas from David Russel and Matthew Lorence, University ofTexas Southwestern Medical Center, Dallas). This plasmidvector contains the cytomegalovirus (CMV) promoter-enhancer of the immediate early gene, the poly(A)+ addition-transcription terminator region of the human growth hor-mone gene, the simian virus 40 origin of replication, and apolylinker region for the insertion of cDNAs (13). Recombi-nants with the correct orientation for expression were iden-tified by restriction endonuclease analysis, and the plasmidDNA (pCMVHDC-18 and pCMVHDC-21) was purified byCsCl/ethidium bromide centrifugation. For transfection,subconfluent COS 7 cells (106 cells per 100-mm dish) weretransfected with 5 ug of pCMV DNA per dish by theDEAE-dextran method (14). After 48 hr, the cells weresuspended in Ca2+/Mg2+-free phosphate-buffered saline andlysed by three cycles of freeze (-70°)/thaw (370); the lysatewas assayed for protein and HDC and DDC enzymaticactivities. Conditions for the HDC assay were as described(15), with 0.25 mM [3H]histidine as substrate. The formationof [3H]histamine was determined by thin-layer chromatogra-phy (16).Chromosomal Localization. The production and character-

ization of the Chinese hamster-mouse somatic cell hybridshas been described (17). NFS/N mice were obtained from theSmall Animal Section, National Institutes of Health. Musmusculus musculus mice were originally trapped in Skive,Denmark, and breeding pairs were provided by M. Potter(National Cancer Institute) from his colony at HazeltonLaboratories, Rockville, MD. NFS/N females were matedwith M. m. musculus males, and the F1 females were thenmated with M. m. musculus males. DNA extracted from thelivers of the backcross progeny was typed by Southern blot

Abbreviations: HDC, histidine decarboxylase; DDC, dopa decar-boxylase; GAD, glutamic acid decarboxylase; PLP, pyridoxal phos-phate; RFLP, restriction fragment length polymorphism; ABP, art-drogen-binding protein; CMV, cytomegalovirus.1The sequence reported in this paper has been deposited in theGenBank data base (accession no. M29591).

733

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Page 2: Characterization and expression of the complementary DNA

734 Neurobiology: Joseph et al. Proc. Natl. Acad. Sci. USA 87 (1990)

hybridization for restriction fragment length polymorphisms trophoresed on 0.4% agarose gels, and transferred to nylon(RFLPs) of the hemolytic complement locus (Hc), the /32- membranes (Hybond N, Amersham); the membranes weremicroglobulin locus (B2m), and the parotid secretory protein hybridized with [32P]cDNA probes for Hc, B2m, Psp, andlocus (Psp) by using probes provided by G. Barsh (University HDC-2.3 (Fig. 1 legend).of California at San Francisco), B. Tack (Scripps Institute), Other Analytical Procedures. Restriction endonuclease di-and D. Margulies (National Institutes of Health), respec- gestions of DNA and fractionation by agarose gel electro-tively. DNAs were digested with restriction enzymes, elec- phoresis were done by standard techniques (ref. 18, pp. 98

HDC-21 G DDC -

GCACAGACAATAGTGAGTAAGAAGCCCGGCTGCCTTTCGGTCTCCTCTTTCTGTCTGCAAGCCAAGTCAAGAAAGATGATGGAGCCCAGTGAATACCaTGAATACCAAGCTAGAGGGAAA-- 120

MetMetGLuProSerGLuTyrHisGLuTyrGLnALaArgGLyLys-- 15

GAGATGGTCGATTACATCTGCCAGTACCTGAGCACCGTGCGGGAGAGGCAGGTGACCCCMMATGTGAAGCCTGGGTACCTGCGAGCCCAGATACCTTCMAGTGCTCCTGAGGMACCCGAC-- 240

GluMetValAspTyrI leCysGLnTyrLeuSerThrValArgGluArgGLnValThrProAsnValLysProGLyTyrLeuArgAlaGflIn leProSerSerALaProGtuGLuProAsp- - 55

AGCTGGGATAGCATCTTTGGGGACATTGAGCAAATCATCATGCCTGGGGTGGTTCACTGGCAGAGCCCCCACATGCACGCCTACTATCCGGCTCTCACCTCTTGGCCATCCCTGCTAGGA-- 360

SerTrpAspSerl tePheGlyAspl leGluGlnl tel LeMetProGlyVatValHisTrpGlnSerProHisMetHisAlaTyrTyrProALaLeuThrSerTrpProSerLeuLeuGLy-- 95

GATATGCTGGCTGATGCATCAACTGCTTGGGGTTCACGTGGGCTTCCAGCGGCCTGCACAGAGCTGGAGATGAACATCATGGACTGGCTGGCGAAGATGCTGGGGCTCCCGGACTTCTTC-- 480

AspMetLeuALaAspAlaSerThrALaTrpGlySerArgGlyLeuProALaALaCysThrGluLeuGluMetAsnlleletAspTrpLeuAlaLysMetLeuGtyLeuProAspPhePhe-- 135

CTGCACCACCATCCCAGCAGCCAGGGGGGAGGCGTCTTGCAGAGGACTGTCAGCGAATCCACTT TATTGCCCTGCTGGCAGCAAGGAAGAACAAAATCCTAGAAATGAAAGCGCATGAG-- 600

Leu~isHisHisProSerSerGlnGlyGlyGlyVaLLeuGonArgThrVa(SerGtuSerThrLeulLeAlaLeuLeuAlaAlaArgLysAsnLysI LeLeuGluL~etLysAtaHisGlu-- 175

CCCAATGCTGATGA-GTCCTCTCTGAACGCTCGTCTTGTTGCCTATGCCTCTGACCAGGCTCACTCTTCAG'TGGAGAAGGCTGGCTTGATTTCCCTTGTGMAGATCAAATTTCTGCCTGTG-- 720

ProAsnALaAspGluSerSerLeuAsnAlaArgLeuValAlaTyrAlaSerAspGlnALaHisSerSerValGluLysAlaGLyLeuIleSerLeuValLys IleLysPheLeuProVal-- 215

GACGACAACTTCTCACTCCGAGGAGAAGCTCTCCAGAGGCCATCGAGGAAGACAGCAACAGGGCTTGGTGCCTGTGTT TGTCTGTGCACCTTAGGGACCACTGGAGTCTGTGCAT TT - - 840

AspAspAsnPheSerLeuArgGtyGtuAtaLeuGtnLysAtal LeGluGluAspLysGlnGlnGlyLeuValProValPheValCysAlaThrLeuGlyThrThrGlyValCysAlaPhe- - 255T

GACAAGCTGTCAGAGCTGGGGCCCATCTGTGCCAGGGAGGGACTGTGGCTCCACGTCGATGCTGCTTATGCAGGAACAGCCTTTCTGlGCCCTGAGCTCCGGGGCTTCCTGAAGGGCATT-- 960

AspLysLeuSerGluLeuGLyProl leCysALaArgGLuGLyLeuTrpLeuHisValAspAlaAlaTyrAlaGlyThrAlaPheLeuArgProGLuLetArgGLyPheLeuLysGyyI le-- 295

GAGTACGCCGACTCCTTCACCTTTAACCCTTCCAAGTGGATGATGGTGCACTTTGACTGCACTGGGt TCTGGGTCAAGGACAAGTACAAGCTACAGCAGACCTTCAGTGTGAACCCCATC - -1080

GluTyrAlaAspSerPheThrPheAsnProSerLysTrpMetMetVa t Hi sPheAspCysThrG LyPheTrpValLysAspLysTyrLysLeuGlnGflnThrPheSerValAsnProI le- - 335

TACCTCAGACATGCGMACTCTGGTGTCGCCACTGACTTCATGCATTGGCAGATCCCCTTGAGCCGGCGCTTTCGCTCCAT TMGCTGTGGTTTGTGATTCGGTCCTTCGGGGTGAAGMAT --1200

TyrLeuArgHisAlaAsnSerGlyVaLAlaThrAspPheMetHisTrpGlnl leProLeuSerArgArgPheArgSerl LeLysLeuTrpPheVal IleArgSerPheGtyValLysAsn- - 375

CTTCAAGCACATGTCAGACACGGTACAGACATGGCTAAATACT TTGAATCTCTAGTCAGGAGCGACCCTGT T TTCGAAATTCCTGCTGAGAGGCACCTTGGTCTGGTGGTTTTTCGTCTG - -1320

LeuGlnAlaHisValArgHisGLyThrAspMetAlaLysTyrPheGLuSerLeuValArgSerAspProVa5PheGlul LeProAlaGluArgHisLeuGLyLeuVaLValPheArgLeu-- 415

AAGGGTCCCAACTGTCTCACAGAAA6tGTGTTAAAGGAAATAGCCAAAACT GGCCAGGTCTTCCTCATCCCAGCCACTATCCAGGACAAGCTGATCATCCGTTTCACCGT GACGTCCCAG --1440LysGLyProAsnCysLeuThrGluSerVatLeuLysGLul leALaLysThrG yGlnValPheLeul leProAlaThrI eGtnAspLysLeul lel LeArgPheThrVatThrSerGln-- 455

_| DDC

TTCACCACCAAGGATGACATCCTGAGAGAT TGGMACCTCATCCGAGAGGCTGCTAACCTTGTCCTGAGCCAGCACTGCACT TCTCAGCCGAGCCCTCGGGCCMAGMCCTTATTCCACCG - -1560

PheThrThrLysAspAspl leLeuArgAspTrpAfsnLeul LeArgGLuAlaAlaAsnLeuValLeuSerG nH isCysThrSerGlnProSerProArgAlaLysAsnL~euI leProPro- - 495

CCGGTGACCAGAGACTCCAAAGACCTGACCAATGGGCTATCCCTGGAGTCTGTCAATGAGGGAGGAGATGACCCAGTACAGGTCCGGAAGATCTTCAGGCTGCCAGGAGACAGTCTGGAA- -1680

ProValThrArgAspSerLysAspLeuThrAsnGtyLeuSetLeuGLuSerValAsnGLuGLyGLyAspAspProValGlnValArgLysI tePheArgLeuProGlyAspSerLeuGlu-- 535

ACGACAATGGATCCCTT TGATGATTGCTTCTCAGAAGAGGCCTCCGATACCACCAAGCACAAGCTGTCGTCCTT TCTGTTCAGTTACTTGTCGGTACAGACAAGAAGAGACAATGCGG - -1800

ThrThrMetAspProPheAspAspCysPheSerGtuGluAlaSerAspThrThrLysHi sLysLeuSerSerPheLeuPheSerTyrLeuSerValGtlnsnLysLysLysThrMetArg- - 575

TCCCTCAGCTGCAACAGTATGCCTATGAGTGCCCAGAAGTCACCTCCCCCAGATGCTTCCGTGAAGCATGGGGGCTTCTTCCGGGCCAGAATCTTTTCTGGGTTCCCAGAAGAAATGATG- -1920

SerLeuSerCysAsnSerletProMetSerAtaGfnLysSerProProProAspAlaSerVa lLysH i sGtyGlyPhePheArgA laArgI lePheSerGlyPheProGtluGluMet~et- - 615

ATGATGGAAJAGGTGGCTTCMMAGCTGATCAAGTTCTACAGTGTTCCCAGCTTTCCTGMTGCAGCTCTCAGTGTGGTACCCTCCAGCTGCCCTGCTGCCCTCTGCAGGCCATGGTG--2040

MetMetLysLysGlyGtyPheLysLysLeul teLysPheTyrSerVatProSerPheProGluCysSerSerGlnCysGlyThrLeuGtlnLeuProCysCysProLeuGLtnAlMetVal - - 655

TAGGTGACGGGAGTCT TCAATCAGATGCAGGGTGTGCTTCAGGGAGTTCGGGAACCCTTGMATTGTGTGCAGTTTGTGTGCTTATTATGTATGTGTGTGCATCTTGAGGGAAGTAAG--2160

End

CCCATAAT TTTGATCATAGCCTCACAGGGGTTCATGACCCACATAGATTGGMTTGGGCAGTTTAAGCTGGCATGCT TCAGAGGGTTGCAGGGGCT TGTGTGACAGAGGGGCTGAGAG --2280

AGCAGTGTCCTGTTAAGCTTGTAATGTAAAAACAACCTAGAAATAAATTGTGCCTATATCTAAAAMAAA

FIG. 1. Nucleotide sequence and deduced amino acid sequence of clone HDC-18. The nucleotide and amino acid residue numbers are

indicated at the right. The region that is homologous with the gene for Drosophila DDC is indicated above the nucleotide sequence. A consensus

poly(A) addition signal (AATAAA) is underlined. The initial methionine is the first methionine residue following the putative transcription start

site in the gene (unpublished results). The arrowhead marks the first residue of clone HDC-21. Clone HDC-2.3 begins at residue 118. The lettersabove the nucleotide sequence denote the differences in some HDC cDNA clones. Potential sites of N-glycosylation [Asn-Xaa-(Thr or Ser)]are underlined. The site at residue 304 is adjacent to the putative PLP-binding site (Lys-307) and would likely inactivate the enzyme ifglycosylated. Two sequences fitting consensus phosphorylation sites of cAMP-dependent protein kinase (28) are located at residues 357-362and 338-343. The nucleotide sequence was determined on both strands.

Page 3: Characterization and expression of the complementary DNA

Proc. Natl. Acad. Sci. USA 87 (1990) 735

and 151). Southern blotting and hybridization with nick-translated [32P]DNA was performed as described by Wahl etal. (19). Blot hybridization analysis of RNA was done asdescribed by Thomas (20) with [32P]DNA (21). For nucleotidesequencing, restriction endonuclease fragments were sub-cloned in bacteriophage M13 mpl8 or mpl9 (22). DNAsequences were determined by the dideoxy method of Sangerand coworkers (23, 24) with 35S-substituted deoxyadenosine5'-[a-thioJtriphosphate (Amersham). Rat DNA was isolatedby the technique reported by Blin and Stafford (25). DNA andprotein sequences were analyzed with the MicroGenie com-puter program (Beckman). Comparison of rat HDC and thebacterial HDC proteins was done with the FASTA program ofLipman and Pearson (26).

RESULTSDuring studies of androgen-binding protein (ABP) geneexpression, a cDNA encoding ABP (27) and DDC-relateddomains was isolated from a fetal rat liver cDNA library. ThecDNA consisted of 750 base pairs (bp) (5') ofABP cDNA and2.3 kilobases (kb) (3') that shared 60% identity of nucleotideresidues with Drosophila cDNA encoding DDC. A 2.3-kbEcoRI fragment (3') contained all of the DDC-related domainand 17 bp ofABP cDNA, but this fragment did not hybridizeto ABP cDNA with our hybridization conditions. To identifythe nature of the DDC-related domain, a fetal rat liver cDNAlibrary (phage A gtll) was screened with the 2.3-kb DNA asprobe. Ten recombinants with inserts of 1.8-2.4 kb werecharacterized by nucleotide sequence analysis. All of theclones contained identical sequences with truncations at the5'-end, the shorter clones apparently representing incom-plete cDNA transcripts. Fig. 1 shows the nucleotide se-quence and deduced amino acid sequence of the longest clone(pCMVHDC-18). The DNA encodes a protein containing 655amino acid residues ofMr 73,450. The sequence contains twopotential sites of N-glycosylation (residues 218 and 304) andtwo sites that are homologous with the phosphorylation sitesof cAMP-dependent protein kinase (Fig. 1, residues flankingSer-342 and Ser-361). Nucleotide sequence analysis of thecDNA isolates revealed that some of the HDC cDNA clonescontained three residues that were different from HDC-18.These divergences are diagrammed in Fig, 1 (nucleotideresidues 98, 157, and 928). The alterations result in changesin the encoded amino acid residues: His-8 -* Arg, Val-28 -*Leu, and Arg-285 -* Cys respectively. This heterogeneity ofRNA transcripts may represent polymorphic genes in thenoninbred Sprague-Dawley rats. At least two alleles areevident. Supporting this conclusion is the finding that allHDC cDNA-hybridizing genomic restriction DNA fragments(BamHI) map to rat chromosome 3 (D.R.J., P.M.S., andClaude Szpirer, unpublished results). These data suggest asingle gene homologous with HDC cDNA; however, morethan one gene cannot be completely ruled out.To identify the nature of the protein encoded by HDC

DNA, the cDNAs were cloned into the eukaryotic expressionvector pCMV5. Transfection ofmonkey kidney cells (COS-7)with the recombinant plasmids (pCMVHDC) yielded HDCactivity in cell extracts, whereas the controls with an unre-lated DNA insert (pCMVABP) or pCMV DNA with no insertfailed to produce detectable HDC activity (Table 1). Each ofthe cDNAs tested (pCMVHDC-21 and pCMVHDC-18) rep-resented one of the polymorphic transcripts; both yieldedHDC activity. Moreover, the COS cell extracts (pCMVHDC)yielded no measurable DDC enzymatic activity. Thus, HDCcDNA encodes a protein that can catalyze the conversion ofL-histidine to histamine but not L-dopa to dopamine.Further evidence that the cDNA encodes HDC was ob-

tained by chromosomal linkage analysis. Southern blot anal-ysis of 13 Chinese hamster-mouse hybrid cell DNAs showed

Table 1. Transient expression of HDC cDNA in monkey cellsHDC activity, units/

Recombinant DNA mg of proteinpCMVHDC-18 140pCMVHDC-21 67pCMV5 <1pCMVABP9 <1Fetal liver 130Adult liver <1

One unit ofHDC activity is defined by the formation of one pmolof [3H]histamine per min at 370C. pCMVHDC-18 and pCMVHDC-21are pCMV recombinants containing HDC cDNA (Fig. 1). pCMV5 isthe vector DNA without insert, and pCMVABP9 has an unrelatedinsert, rat ABP cDNA (27). Transient expression of pCMVABPDNA yields high-affinity androgen-binding activity, with character-istics of native ABP.

that 7 contained the mouse HDC DNA fragments and that thepresence or absence of HDC-hybridizing DNA was corre-lated with the presence or absence ofchromosome 2 (data notshown). To localize HDC DNA within chromosome 2, weanalyzed an interspecific backcross. Southern blot analysisof DNAs extracted from the progenitors of this backcrossshowed that NFS/N mouse DNA yielded a 12-kb XbaI-reactive fragment, whereas M. m. musculus DNA con-tained a 15-kb fragment (Fig. 2). Analysis of 56 backcrossprogeny showed that 31 animals inherited the NFS/N-specific fragment. In the progeny, RFLPs of two chromo-some 2 markers, Hc and Psp, could be detected after diges-tion with the appropriate restriction enzymes. The data inTable 2 confirm that HDC DNA is on mouse chromosome 2and demonstrate that it lies in the middle of the chromosomewith the gene order Hc-HDC-Psp (Fig. 2). To confirm thismap location, 23 of these mice including the only doublerecombinant were also typed for inheritance of a Sca Ipolymorphism of the P2-microglobulin gene (B2m); no re-combinants were detected. B2m and the HDC locus, Hdc, areclosely linked on chromosome 2 (Fig. 2). The genetic data

N

n2Et)y LL00 3n z.iii.6- ~ U)q"r'OW4

15-

12-

7.7-

Chromosome 2

18 - Hc

HDCcDNA 47f Hdc,B2m

64 I-Psp

FIG. 2. (Left) Southern blot hybridization of DNA from M. m.musculus, NFS/N mice, and backcrosses. Liver DNA from progen-itor mice [NFS/N and M. m. musculus derived from wild mice (laneWM)] and backcrosses was cleaved with Xba I and analyzed bySouthern blot hybridization. The segregation of the NFS/N-specificband (12 kb in lane NFS) was determined in the backcrosses. Twoof the backcrosses, SK12 and SK13, were negative and positive,respectively, for NFS/N mouse Hdc DNA. (Right) Diagram ofmouse chromosome 2, showing the location of the HDC locus (Hdc)and the loci used as markers: Hc, B2m, and Psp. The arrow indicatesthe mapped location of Hdc DNA, which corresponds to the Hdclocus. Loci positions are from Davisson et al. (29). The Hdc locuswas mapped by Martin and Bulfield (30), using allelic variants withheat-stability differences.

Neurobiology: Joseph et al.

Page 4: Characterization and expression of the complementary DNA

Proc. Natl. Acad. Sci. USA 87 (1990)

HDC 15 - K E NV DY I C O Y L S T L R E R Q V T P N V K P G Y L R A 0 I P S SA P E E PDDC 11 - K r M V D F I A E Y L E N I R E R R V L P E V K P C Y L K P L I P D A A P E K P HDC 252 -- V C A F D K L S E L G P I C A R E G L W L HV D A A Y A G r A F L C P E L R G F

* * * * *-- * * * * * * * - * DDC 248 - - S C A F D Y L D E C G P V G N K H N L W I H V D A A Y A G S A F I C P E Y R H L

HDC 55 D S W D S I F G D I E 0 1 I M P G V V N W Q S P HNM H A Y Y P A L T S W P S L LDDC 51 E K W 0 D V M QD I E R V I M P G V T H W H S P K F H A Y F P T A N S Y P A rV

HDC 95 --G D M L AD AST A W G S R G LP A C T E L E N IM D W L A K M L G L PDDDC 91 --A D ML S G IA C I G F T W IA S P A C T E L EV V MM D W L G K M L E L PA

HDC 134- F F L H H P SS G G G V L OR T V S E S T LISA L LL A R KN K I L EN KADDC 131 E F L A C S G GK G G G V I OG T A S E S T L VA S AGS 0 G E V E G GE

NDC 174 --HE P N A D E S S L N A R LV A Y A S D 0 AH S S VE K A G L I S L V K IKDDC 169 -G A P S G V G L E HT I L G K LV G Y C S D 0 A HS S VE R A G L L G G V K LR

HDC 292 - L K G I E Y A D S F T F N P S K W MN V H F D C T G F W V K D K Y K L Q 0 T F SDDC 28U -- K G I E S A D S F N F N P H K W N L V N F D C S A N W L K D P S W V V NA^ F N

HDC 332 --V N P I Y L R NA N S G V AT D F MH W IP L S R R F R S IK L W F V IR S FDDC 328 -V D P L Y L K HD N QG S AP D Y RH W Q I P L G R R F R A LK L W F V LR L Y

HDC 372--G V K NL QA HV R H G TD A K YF E S L V R S D P V F E IP A E R H LG L VDDC 368--G V E NL QA NI R R H C NF AK QF GD L C V A D S R F E L A E I N NG L V

HDC 412 V F R L K G P N C L T E S V L K E IA K T G Q V F L I PA T IO D K L I IR F TDDC 408 S F R L K G S NE R NE A L L K R IN G R G N I N L V P A K IK D V Y G LRN A

HDC 212 --F L P V D D N F S L R G E A L 0 K A I E E D K Q G L V P V F V CA T L G T T G HDC 452 -V T S 0 F T T K D D I L R D W N L I R E A ADDC 209 - S V S E N H R N R G A A L E K A I E Q D V A E G L I P F Y A V V T L G T T C 448- I C S R F TQS E D N E Y S W K E V S A AA

FIG. 3. Homology of deduced HDC amino acid sequence with Drosophila DDC. Amino acid residues are shown at the left of each line forclone HDC-21 and DDC. The amino acid sequence of Drosophila DDC is that of Eveleth et al. (31) and Morgan et al. (32), with the presumedinitiating methionine codon as described (32). The PLP-binding site ofDDC is underlined. The remainder of the two sequences has no obvioushomology. The HDC amino acid sequence shared 50% identity of residues with rat DDC (33). The single-letter amino acid code is used.

presented here are consistent with the conclusion that theHDC cDNAs represent HDC RNA transcripts. Since in bothgenetic studies all of the mouse HDC restriction endonucle-ase fragments segregated together, there is likely a singlemouse gene for HDC.Comparison of the deduced amino acid sequence of rat

HDC with the National Biomedical Research FoundationProtein Data Bank revealed extensive homology with theamino acid sequence of DDC (EC 4.1.1.26). Fig. 3 comparesthe deduced amino acid sequence of clone HDC-21 withDrosophila DDC (31, 32). The homology begins at Lys-15 ofHDC-21 and Lys-11 of Drosophila DDC and ends near the Cterminus of DDC (residue 469) and residue 473 of HDC-21.The amino acid sequence ofDDC at the pyridoxal phosphate(PLP)-binding site (Lys-303) is highly homologous with theHDC sequence (Fig. 3, underlined sequence; six of sevenresidues identical). The C-terminal 182 residues of the HDCamino acid sequence have no identity with DDC or any othersequence in the data base. Likewise, the rat HDC amino acidsequence shared -50% homology with rat DDC (Fig. 3legend). The amino acid residues of rat HDC shared only 15%overall identity with feline glutamic acid decarboxylase (34);only 90 residues around the PLP-binding site showed signif-icant similarity (50%). No obvious homology was observedbetween rat HDC and murine ornithine decarboxylase (35) orbacterial (Lactobacillus) pyruvyl-containing HDC (36, 37).However, a limited but apparently significant similarity wasobserved between the rat HDC sequence and the PLP-

Table 2. Segregation of HDC-hybridizing restriction fragmentswith those of Hc and Psp among 56 progeny of the backcross:(NFS/N x M. m. musculus) F1 x M. m. musculus

Inheritance of NFSmouse RFLP* Mice,

Progeny Hc HDC Psp no.

Parental + + + 16- - - 11

Single recombinants + + - 7_ _ + 4+ - - 8_ + + 9

Double recombinants - + - 1

*Recombination (r) values are: for Hc-HDC, r = 18 of 56 = 32 ±6.2%; for HDC-Psp, r = 12 of 56 = 21 ± 5.4%; and for Hc-Psp, r= 28 of 56 (not significant).

dependent bacterial HDC from Morganella morganii (38).Residues 217-374 of rat HDC and residues 122-278 of Mor-ganella HDC shared 39 identical residues. These regionsflank the PLP-binding site (lysine) in Morganella HDC andthe potential PLP-binding site in rat HDC.

DISCUSSIONWe have cloned the cDNA encoding fetal rat liver HDC, andthe full amino acid sequence was deduced from the nucleo-tide sequence. Although no amino acid sequence for eukary-otic HDC was available for comparison, the nature of thecloned cDNAs was proven by expression of the cDNA inCOS cells to yield HDC enzymatic activity and demonstra-tion that the cDNA sequence was linked to the Hdc locus.The amino acid sequence of the amino-terminal 462 residuesof rat HDC was highly homologous with rat DDC andDrosophila DDC. This homology included the region aroundthe PLP-binding site of Drosophila DDC. The major differ-ence was the presence of a serine residue adjacent to the ratHDC PLP-binding lysine in place of the histidine in Droso-phila DDC (31, 32). This histidine residue is conserved infeline glutamic acid decarboxylase (34) and ornithine de-carboxylase (35) and in the bacterial (E. colt) arginine, lysine,ornithine, histidine, and glutamic acid decarboxylases (39). Aserine residue adjacent to the PLP-binding lysine has beendescribed in eukaryotic aspartate and alanine aminotrans-ferases and E. coli D-serine dehydratase (39). The signifi-cance of histidine or serine in this position is not known. Theclose sequence homology of HDC with DDC is consistentwith the known immunological properties of antibodiesagainst rat HDC. Taguchi et al. (8) found that rabbit anti-ratHDC crossreacts with guinea pig DDC and HDC but not withrat DDC.

Characterization of the purified rat HDC from fetal liverrevealed that, like eukaryotic DDC, the enzyme consists oftwo Mr 55,000 subunits, which are likely identical -(7). How-ever, the cDNA clones we isolated from fetal liver encode aprotein of Mr 73,000. This difference in size suggests that theHDC primary translation product may be posttranslationallyprocessed. Perhaps the carboxyl-terminal 180 residues, whichare not homologous to DDC, are cleaved to form the catalyt-ically active HDC. Although no PLP-dependent decarboxyl-ases are known to undergo posttranslational proteolysis, an-other class of bacterial HDC, the pyruvyl-containing HDC,undergoes autocatalytic modification and cleavage at a spe-

736 Neurobiology: Joseph et al.

Page 5: Characterization and expression of the complementary DNA

Proc. Natl. Acad. Sci. USA 87 (1990) 737

cific Ser-Ser bond (40). Alternatively, the differences in sizebetween the native and deduced recombinant protein could beexplained if the denatured protein migrates aberrantly duringgel electrophoresis. However, the size of the native protein(Mr 110,000) does not support this explanation.Many studies have suggested that isozymes of HDC exist

in mammals. For example, Savany and Cronenberger (5)described three isozymes in rat gastric mucosa with pI valuesof 5.35, 5.65, and 5.90, whereas Watanabe et al. (41) identi-fied two types of HDC in fetal and cerebral tissues, with pIvalues of 5.0 and 5.4, respectively. Others have found that afraction of rat brain HDC is membrane bound (42). Ourstudies suggest several possible explanations for the ob-served heterogeneity of HDC. First, the different forms ofcloned rat HDC (possibly allelic variants) could each havedifferent pl values. Second, the presence of two potentialN-glycosylation sites in the HDC sequence raises the possi-bility that the protein is glycosylated. Glycosylated proteinsgenerally exhibit charge heterogeneity because of differencesin the carbohydrate content. The characteristics of the nativecytoplasmic protein indicate that most HDC is not glycosy-lated; however, a membrane-bound fraction could be glyco-sylated. Third, sequence analysis of HDC also revealed twopotential sites of phosphorylation that closely fit the consen-sus recognition sequence of cAMP-dependent protein kinase(28). This finding supports the in vitro studies, suggesting thatphosphorylation of brain HDC by cAMP-dependent proteinkinase leads to enzyme inactivation (43). Also, each of thetwo HDC mRNA species (major 2.7 kb and minor 3.5 kb) as

revealed by Northern hybridization analysis (unpublishedresults) could encode unique proteins (i.e., alternative splic-ing of RNA transcripts).The physiological importance of HDC in gastric acid

secretion, respiratory pathway constriction, vasomotor con-

trol, and neurotransmission is well documented (1-3). Withthe cloning of HDC cDNA and knowledge of the amino acidsequence, the tools are now available to study the mechanismthat regulates histamine biosynthesis at the molecular level.Preliminary studies in our laboratory have identified theputative regulatory promoter region of the rat Hdc gene.

Future studies on Hdc gene regulation can be related to thephysiological functions of HDC and histamine.

We thank Drs. Hiroshi Wada and David Millhorn for their manyhelpful discussions. We thank Dr. Wada for sharing the unpublishedsequence of rat DDC and Drs. Ton Joh and Un Jung Kang for sharingthe bovine DDC sequence. We thank Dr. David Russel and Dr.Matthew Lorence for giving us the pCMV5 expression vector. Weappreciate Dr. Yoshitaka Taguchi's critical review ofthe manuscript.We thank Dr. Richard Mailman and Mr. Stan Southerland for helpwith the HPLC in the DDC assays. This work was supported byNational Institutes of Health Research Grants HD21744 and 5-P30-HD18968.

1. Douglas, W. W. (1980) in The Pharmacological Basis of Ther-apeutics, ed. Gilman, A. G., Goodman, L. S. & Gilman, A.(MacMillan, New York), 6th Ed., pp. 609-646.

2. Beaven, M. A. (1978) Histamine: Its Role in Physiological andPathological Processes (Karger, Basel).

3. Schwartz, J. C., Barbin, G., Duchemin, A. M., Garbarg, M.,Llorens, C., Pollard, H., Quach, T. T. & Rose, C. (1982) inPharmacology ofHistamine Receptors, eds. Ganellin, C. R. &Parsons, M. E. (Wright-PSG, Boston), pp. 351-391.

4. Watanabe, T., Yoshitaka, T., Shiosaka, S., Tanaka, J.,Kubota, H., Terano, Y., Tohyama, M. & Wada, H. (1984)Brain Res. 295, 13-25.

5. Savany, A. & Cronenberger, L. (1982) Eur. J. Biochem. 12,593-599.

6. Yamada, M., Watanabe, T., Harino, S., Fukui, H. & Wada, H.(1980) Biochim. Biophys. Acta 615, 458-464.

7. Taguchi, Y., Watanabe, T., Kubota, H., Hayashi, H. & Wada,H. (1984) J. Biol. Chem. 259, 5214-5221.

8. Taguchi, Y., Watanabe, T., Shiosaka, S., Tohyama, M. &Wada, H. (1985) Brain Res. 340, 235-242.

9. Chirgwin, J. M., Przybyla, A. E., MacDonald, R. J. & Rutter,W. J. (1979) Biochemistry 18, 5294-5299.

10. Gubler, U. & Hoffman, B. J. (1983) Gene 25, 263-269.11. Young, R. A. & Davis, R. W. (1983) Proc. Natl. Acad. Sci.

USA 80, 1194-1198.12. Joseph, D. R., Hall, S. H. & French, F. S. (1985) J. Androl. 6,

392-395.13. Andersson, S., Davis, D. N., Dahlback, H., Jornvall, H. &

Russell, D. W. (1989) J. Biol. Chem. 264, 8222-8229.14. Deans, R. J., Denis, K. A., Taylor, A. & Wall, R. (1984) Proc.

Nati. Acad. Sci. USA 81, 1292-1296.15. Roskoski, R. & Roskoski, L. M. (1978) Anal. Biochem. 87,

293-297.16. Hegstrand, L. R. (1985) Biochem. Pharmacol. 34, 3711-3716.17. Hoggan, M. D., Halden, N. F., Buckler, C. E. & Kozak, C. A.

(1988) J. Virol. 62, 1055-1056.18. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular

Cloning:A Laboratory Manual (Cold Spring Harbor Lab., ColdSpring Habor, NY).

19. Wahl, G. M., Stern, M. & Stark, G. R. (1979) Proc. NatI.Acad. Sci. USA 76, 3683-3687.

20. Thomas, P. S. (1980) Proc. NatI. Acad. Sci. USA 77, 5201-5205.

21. Rigby, P. W. J., Dieckmann, M., Rhodes, C. & Berg, P. (1977)J. Mol. Biol. 113, 237-251.

22. Messing, J. (1983) Methods Enzymol. 101, 20-78.23. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. NatI.

Acad. Sci. USA 74, 5463-5467.24. Sanger, F. & Coulson, A. R. (1975) J. Mol. Biol. 94, 441-448.25. Blin, N. & Stafford, D. W. (1976) Nucleic Acids Res. 3,

2303-2308.26. Lipman, D. J. & Pearson, W. R. (1985) Science 227, 1435-

1441.27. Joseph, D. R., Hall, S. H. & French, F. S. (1987) Proc. Natl.

Acad. Sci. USA 84, 339-343.28. Glass, D. B. & May, J. M. (1984) Collagen Relat. Res. 4,

63-74.29. Davisson, M. T., Roderick, T. M., Hillyard, A. L. & Doolittle,

D. P. (1988) Mouse News Lett. 81, 12-19.30. Martin, S. A. M. & Bulfield, G. (1984) Biochem. Genet. 22,

645-656.31. Eveleth, D., Gietz, R. D., Spencer, C. A., Nargang, F. E.,

Ross, B. H. & Marsh, J. L. (1986) EMBO J. 5, 2663-2672.32. Morgan, B. A., Johnson, W. A. & Hirsh, J. (1986) EMBO J. 5,

3335-3342.33. Tanaka, T., Horio, Y., Taketoshi, M., Imamura, I., Ando-

Yamamoto, M., Kangawa, K., Matsuo, H., Kuroda, M. &Wada, H. (1989) Proc. Natl. Acad. Sci. USA 86, 8142-8146.

34. Kobayashi, Y., Kaufman, D. L. & Tobin, A. J. (1987) J.Neurosci. 7, 2768-2772.

35. Kahana, C. & Nathans, D. (1985) Proc. Natl. Acad. Sci. USA82, 1673-1677.

36. Huynh, Q. K. & Snell, E. E. (1985) J. Biol. Chem. 260,2798-2803.

37. Vanderslice, P., Copeland, W. C. & Robertus, J. D. (1986) J.Biol. Chem. 261, 15186-15191.

38. Vaaler, G. L., Brasch, M. A. & Snell, E. E. (1986) J. Biol.Chem. 261, 11010-11014.

39. Tanase, S., Kojima, H. & Morino, Y. (1979) Biochemistry 18,3002-3007.

40. Huynh, Q. K. & Snell, E. E. (1986) J. Biol. Chem. 261,1521-1524.

41. Watanabe, T. & Wada, H. (1983) in Methods in Biogenic AmineResearch, eds. Parvez, S., Nagatsu, T., Nagatsu, I. & Parvez,H. (Elsevier, Amsterdam), pp. 689-720.

42. Palacios, J. M. & Blanco, I. (1988) J. Neurochem. 51, 1400-1406.

43. Huszti, Z. & Magyar, K. (1984) Agents Actions 14, 546-549.

Neurobiology: Joseph et al.