structure-function analyses and molecular modeling of caffeic acid-o-methyltransferase and...

20
Structure–function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways during monolignol biosynthesis Huma Naaz Veda P. Pandey Swati Singh Upendra N. Dwivedi Department of Biochemistry, Bioinformatics Infrastructure Facility, Centre of Excellence in Bioinformatics, University of Lucknow, Lucknow, U.P., India Abstract Ten protein sequences, each of caffeic acid-O-methyltransferase (COMT) and caffeoyl-coenzyme A-O-methyltransferase (CCoAOMT), catalyzing methylation of precursors of monolignol from selected dicots and monocots have been analyzed and compared on the basis of their amino acid sequence, motifs/domains, three-dimensional (3D) structure, and substrate binding. The isoelectric points of all the COMT and CCoAOMT sequences analyzed were found to vary in the pH range of 5 to 6. Molecular weight analyses suggested CCoAOMT to be smaller monomeric proteins (27–29 kDa) as compared with those of COMTs (39–40 kDa), which were dimeric. On the basis of phylogenetic analysis, COMT and CCoAOMT were clustered into two major groups, each of which could be further divided into two subgroups of monocots and dicots. Modeling and superimposition of COMT and CCoAOMT sequences of alfalfa (Medicago sativa) revealed that both were quite different at the 3D levels, although they had similarity in the core region. Molecular docking of 16 putative substrates (intermediates of monolignol biosynthesis pathway) revealed that both enzymes interact with all 16 substrates in a similar manner, with thiol esters being the most potent and binding of these putative substrates to CCoAOMT being more efficient. C 2013 International Union of Biochemistry and Molecular Biology, Inc. Volume 60, Number 2, Pages 170–189, 2013 Keywords: COMT, CCoAOMT, homology modeling, molecular docking, phylogenetic analysis, substrates specificity 1. Introduction Lignin, a phenolic heteropolymer, is present in the cell wall of plants and provides rigidity, strength, and resistance to chemical, physical, as well as biological attacks. Lignin, which Abbreviations: SAM, S-adenosyl methionine; SAH, S-adenosyl homocysteine; MD, Modeller; DS, Discovery Studio; ProSA, protein structure prediction; ProQ, protein quality predictor; SAVES, structural analysis and verification server. Address for correspondence: Upendra N. Dwivedi, PhD, Department of Biochemistry, Bioinformatics Infrastructure Facility, Centre of Excellence in Bioinformatics, University of Lucknow, Lucknow 226007, Uttar Pradesh, India. Tel.: +91 522 2740132; Fax: +91 522 2740132; e-mail: [email protected]. Received 15 August 2012; accepted 4 December 2012 DOI: 10.1002/bab.1075 Published online 18 April 2013 in Wiley Online Library (wileyonlinelibrary.com) composes an average of 25% of the plant biomass, is considered as one of the greatest obstacles to the optimal utilization of the plant biomass for various purposes such as paper manu- facturing, production of highly palatable forage, and bagasse utilization. Chemically, lignin is made up of mainly three kinds of monolignols, namely p-coumaryl alcohol, sinapyl alcohol, and coniferyl alcohol. These monolignols are synthesized via phenyl propanoid pathway, which starts with deamina- tion of phenylalanine, followed by successive hydroxylations, methylations, thiol ester formation, and two reduction re- actions leading to the formation of the three major kinds of monolignols. As far as methylation of the intermediates in the lignin biosynthesis pathway is concerned, classically it takes place at the level of free acids by the enzyme caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferases (or caf- feic acid-O-methyltransferase); EC 2.1.1.68 (COMT). Thus, COMT is capable of S-adenosyl methionine (SAM)-dependent methylation of caffeic acid at the 3-hydroxy position to yield ferulic acid and also of 5-hydroxyferulic acid at the 5-hydroxy 170

Upload: upendra-n

Post on 09-Dec-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Structure–function analyses and molecular

modeling of caffeic acid-O-methyltransferase

and caffeoyl-CoA-O-methyltransferase:

Revisiting the basis of alternate methylation

pathways during monolignol biosynthesis

Huma Naaz

Veda P. Pandey

Swati Singh

Upendra N. Dwivedi∗

Department of Biochemistry, Bioinformatics Infrastructure Facility, Centreof Excellence in Bioinformatics, University of Lucknow, Lucknow, U.P., India

Abstract

Ten protein sequences, each of caffeicacid-O-methyltransferase (COMT) and caffeoyl-coenzymeA-O-methyltransferase (CCoAOMT), catalyzing methylation ofprecursors of monolignol from selected dicots and monocotshave been analyzed and compared on the basis of their aminoacid sequence, motifs/domains, three-dimensional (3D)structure, and substrate binding. The isoelectric points of allthe COMT and CCoAOMT sequences analyzed were found tovary in the pH range of 5 to 6. Molecular weight analysessuggested CCoAOMT to be smaller monomeric proteins(27–29 kDa) as compared with those of COMTs (39–40 kDa),which were dimeric. On the basis of phylogenetic analysis,

COMT and CCoAOMT were clustered into two major groups,each of which could be further divided into two subgroups ofmonocots and dicots. Modeling and superimposition of COMTand CCoAOMT sequences of alfalfa (Medicago sativa) revealedthat both were quite different at the 3D levels, although theyhad similarity in the core region. Molecular docking of 16putative substrates (intermediates of monolignol biosynthesispathway) revealed that both enzymes interact with all 16substrates in a similar manner, with thiol esters being the mostpotent and binding of these putative substrates to CCoAOMTbeing more efficient. C© 2013 International Union of Biochemistry andMolecular Biology, Inc. Volume 60, Number 2, Pages 170–189, 2013

Keywords: COMT, CCoAOMT, homology modeling, molecular docking,phylogenetic analysis, substrates specificity

1. IntroductionLignin, a phenolic heteropolymer, is present in the cell wallof plants and provides rigidity, strength, and resistance tochemical, physical, as well as biological attacks. Lignin, which

Abbreviations: SAM, S-adenosyl methionine; SAH, S-adenosylhomocysteine; MD, Modeller; DS, Discovery Studio; ProSA, proteinstructure prediction; ProQ, protein quality predictor; SAVES, structuralanalysis and verification server.∗Address for correspondence: Upendra N. Dwivedi, PhD, Department ofBiochemistry, Bioinformatics Infrastructure Facility, Centre of Excellence inBioinformatics, University of Lucknow, Lucknow 226007, Uttar Pradesh,India. Tel.: +91 522 2740132; Fax: +91 522 2740132; e-mail:[email protected] 15 August 2012; accepted 4 December 2012DOI: 10.1002/bab.1075Published online 18 April 2013 in Wiley Online Library(wileyonlinelibrary.com)

composes an average of 25% of the plant biomass, is consideredas one of the greatest obstacles to the optimal utilization ofthe plant biomass for various purposes such as paper manu-facturing, production of highly palatable forage, and bagasseutilization. Chemically, lignin is made up of mainly three kindsof monolignols, namely p-coumaryl alcohol, sinapyl alcohol,and coniferyl alcohol. These monolignols are synthesizedvia phenyl propanoid pathway, which starts with deamina-tion of phenylalanine, followed by successive hydroxylations,methylations, thiol ester formation, and two reduction re-actions leading to the formation of the three major kinds ofmonolignols. As far as methylation of the intermediates inthe lignin biosynthesis pathway is concerned, classically ittakes place at the level of free acids by the enzyme caffeicacid/5-hydroxyferulic acid 3/5-O-methyltransferases (or caf-feic acid-O-methyltransferase); EC 2.1.1.68 (COMT). Thus,COMT is capable of S-adenosyl methionine (SAM)-dependentmethylation of caffeic acid at the 3-hydroxy position to yieldferulic acid and also of 5-hydroxyferulic acid at the 5-hydroxy

170

Page 2: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

position to yield sinapic acid [1]. Ferulic acid and sinapic acidthen could be converted, successively, to their related thiolesters, aldehydes, and alcohols, yielding the coniferyl (guaiacylor G) and sinapyl (syringyl or S) monolignols, respectively. Thethree monolignols (p-coumaryl alcohol, sinapyl alcohol, andconiferyl alcohol) vary only by the 3- and 5-methoxyl groups.Enzymatic reactions controlling the methylation of the 3- and5-hydroxyls of monolignol precursors are important to de-termine the lignin composition [2]. On the basis of a numberof biochemical and transgenic studies carried out over morethan a decade, alternative pathways for methylation of ligninprecursors have been proposed. According to these so-calledconcepts of “metabolic grid” and “overlapping pathways,”COMT can also methylate substrates at the levels of aldehydesand alcohols, whereas another SAM-dependent methylating en-zyme, namely caffeoyl-coenzyme A/5-hydroxyferuloyl-CoA 3/5-O-methyltransferases [or caffeoyl-CoA-O-methyltransferase](EC 2.1.1.104) (CCoAOMT), can methylate the CoA esters of 3-hydroxy and 5-hydroxy acids [3,4]. These studies have indicatedthat the methylation pathways in monolignol biosynthesis aremuch more complicated than those reported previously [3,5,6].

In view of alternative methylation pathways reported overmore than a decade, based on in vivo/in vitro wet laboratory ex-perimental data, it has been suggested that both COMTs as wellas CCoAOMTs can efficiently catalyze the methylation of sub-strates, which are traditionally not considered as their normalsubstrates [7, 8]. Thus, investigation of tobacco COMT to cat-alyze methylation of various nontraditional substrates, namelycaffeoyl and 5-hydroxyferuloyl esters, aldehydes, as well asalcoholic substrates, revealed efficient catalysis (based onVmax/Km), thereby suggesting that COMT-mediated syringyl unitsynthesis can be operated at the levels of free acids, CoA esters,aldehydes, or alcohols [9]. Furthermore, Parvathi et al. [10],in alfalfa (Medicago sativa) COMT, have investigated COMT-catalyzed methylation with a number of potential substrates ofmonolignol biosynthesis pathway; their results reveal activitiesin the order caffeoyl-CoA > caffeoyl alcohol > 5-hydroxyferulicacid > caffeoyl aldehyde > 5-hydroxyconiferyl alcohol > 5-hydroxyferuloyl CoA > 5-hydroxyconiferaldehyde > caffeicacid. Using antisense downregulated transgenic plants harbor-ing CCoAOMT and COMT gene constructs, they have furtherprovided evidence for an alternative methylation pathway in-volving caffeoyl aldehydes, caffeoyl alcohols, or caffeoyl-CoAesters by COMT. However, there is still ambiguity with regardto substrate affinities and efficiency of catalysis for COMTs andCCoAOMTs. Therefore, this needs further validation.

On the basis of sequence analyses, it is proposed thatCOMT and CCoAOMT belong to two distinct families of O-methyltransferases and are encoded by multigene families inmany plant species [11]. COMTs on an average are made upof about 350 amino acids and have an N-terminal domain(dimerization domain) that participates in homodimerizationand do not require metal ions for activity. As compared withCOMTs, CCoAOMTs are relatively smaller proteins with anaverage of about 220 amino acids and without an N-terminal

dimerization domain and require metal ions for activity. PlantCOMTs in general have been suggested to be dimeric, whereasthose of CCoAOMT are suggested to be monomeric in solutionfor catalysis [5,12,13].

At least three domains have been reported in COMTs,namely AdoMet domain, methyltransferase, and a dimerizationdomain, which are involved in SAM/S-adenosyl homocysteine(SAH) binding, phenolic substrates binding, and dimer forma-tion, respectively. Thus, in the case of alfalfa COMT, Zubietaet al. [5] reported a large C-terminal catalytic domain involvedin SAM/SAH and phenolic substrate binding, whereas a smallerN-terminal dimerization domain was involved in dimer for-mation. Comparison of the CCoAOMT structures with otherwell-known OMT structures exhibits a high level of structuralsimilarity in the SAM binding region of methyltransferases, sug-gesting that the SAM binding protein fold remained conservedduring evolution [14].

In the present paper, 10 each of COMT and CCoAOMT se-quences from selected dicots and monocots have been analyzedand compared for parameters like molecular weights, isoelec-tric point (pI), domain analysis, their cellular localization, thephylogenetic relationship, and substrate specificity. The COMTand CCoAOMT sequences of M. sativa were selected for three-dimensional (3D) structural modeling and docking analyses,with intermediates of monolignol biosynthesis pathway as 16putative substrates in order to find out the efficiency and affin-ity of these substrates toward binding to COMT and CCoAOMTand catalysis. Results of molecular docking analyses have beendiscussed in light of in vivo/in vitro wet laboratory experimen-tal data to throw light on the newer roles of these enzymesin methylation of these intermediates (putative substrates) ofmonolignol biosynthesis pathway.

2. Materials and Methods2.1. Sequence analysesMultiple sequence alignment of 10 sequences each ofCOMT and CCoAOMT from various plant sources wasdone using clustalW tool of EBI (Hinxton, CB, UK)(http://www.ebi.ac.uk/Tools/msa/clustalw2/). A phylogenetictree for these sequences was produced by MEGA (ASU,Tempe, AZ, USA) software to analyze the sequence con-servation. Theoretical pI and molecular weight of COMTsand CCoAOMTs were calculated using PI/MW tool of ExPASy(SIB, Geneva, Switzerland) (http://web.expasy.org/protparam/).The soluble and membrane protein prediction by an-alyzing transmembrane region was done with SOSUI(SIB, Geneva, Switzerland) server (http://bp.nuap.nagoya-u.ac.jp/sosui/). Domain prediction was done with thehelp of the CDD tool of NCBI (NIH, Bethesda, MD, USA)(http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi).

2.2. Preparation of protein structuresThe 3D structures of COMT and CCoAOMT of M. sativa wereconstructed using Modeller (MD) 9v9 (UCSF, San Francisco, CA,

Biotechnology and Applied Biochemistry 171

Page 3: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

USA) [15, 16] as well as Discovery Studio 3.1 (DS) (Accelrys,San Diego, CA, USA). Stereochemical validation of the gen-erated models was done using PROCHECK of the structuralanalysis and verification server (SAVES) (UCLA, Los Ange-les, CA, USA) (http://nihserver.mbi.ucla.edu/SAVES/), ERRAT(UCLA, Los Angeles, CA, USA), protein structure prediction(ProSA) (USB, Hellbrunnerstrasse, SB, Austria), and the proteinquality predictor (ProQ) (SBC, Solna, Sweden) tools [17–19].Validated models were subsequently submitted to ProteinModel Database as theoretical models, and were assignedidentity numbers PM0078258 and PM0078259 for COMT andCCoAOMT, respectively.

2.3. Docking studies2.3.1. Preparation of the target and ligands for dockingThe binding pockets and active sites of COMT and CCoAOMTwere recognized by Computed Atlas of Surface Topogra-phy of Proteins (CASTp) server (UIC, Chicago, IL, USA)(http://sts.bioengr.uic.edu/castp/) [20]. The largest cavitiespresent on enzymes were selected for docking the ligands. Themol files of various ligands were retrieved from the Pubchemdatabases of NCBI [21] and these mol files were converted intoPDB files with the help of an open babel converter. Moleculeswere docked at the binding sites by Autodock 4.0 (SRI, La Jolla,CA, USA) [22] as well as by DS.

The binding region for the docking study by Autodock wasdefined at a 0.375 A radius sphere centered on the active site. Agrid box of dimensions 50 × 50 × 50 A3 was constructed aroundthe binding site, based on the cocrystallized ligand. Ten geneticalgorithm (GA) runs were performed for each compound, and10 ligand bumps were allowed in an attempt to account formutual ligand/target fit. Each of the GA runs was performedon a population of 150 individuals. The rates of mutation andcrossover were set to 0.02 and 0.8, respectively. The valueof binding energy was used to rank the docking positions ofthe molecules. The clusters with lowest binding energy wereselected.

Docking was also done with the help of CDOCKER of DS.The binding site was detected by the “define and edit bindingsite” tool, and the binding site was selected on the basis ofreceptor cavities. The random conformations were set to 10with 1,000 dynamic steps; CHARMm force field was used tominimize the structures. The value of CDOCKER energy wasused to rank the docked substrates; the pose with minimumCDOCKER energy was selected to be the best one. Dockinganalysis of all the 16 putative substrates was also done withLIBDOCK of DS and the results were in the form of scores,and the pose with highest score was selected as the best;results of LIBDOCK were in agreement with the CDOCKERenergies.

2.3.2. Enzyme–substrate interactionsDocking interactions between enzymes and putative substrateswere analyzed with the help of the “analyze complexes” optionof CDOCKER tool of DS.

2.4. Superimposition of modeled COMT and CCoAOMTstructuresModeled COMT and CCoAOMT structures were superimposedusing Chimera (UCSF, San Francisco, CA, USA) as well as DS.

3. Results and Discussion3.1. Sequence analyses of COMT and CCoAOMTProtein sequences encoding COMTs and CCoAOMTs from 10common plant sources were analyzed and compared for theirtheoretical pIs, molecular weights, domain prediction, andtransmembrane regions. Results are presented in Table 1. ThepIs of all the COMT and CCoAOMT sequences analyzed werefound to vary in the pH range of 5 to 6, suggesting that all ofthem are acidic in nature. Molecular weights of COMTs werefound in the range of 39–40 kDa, whereas those of CCoAOMTswere in the range of 27–29 kDa, suggesting CCoAOMT tobe smaller proteins as compared with those of COMTs. Inagreement with this, Hong et al. [23] have also reported thatCOMTs are relatively bigger proteins having molecular weightin the range of 38 to 43 kDa, whereas those of CCoAOMTs aresmaller proteins having molecular weight in the range of 23 to27 kDa.

Domain prediction analysis of the selected COMT se-quences revealed a major methyltransferase 2 superfamilydomain containing AdoMet MTases domain (except for bishop’sweed [Ammi majus]). In addition, they possessed a dimeriza-tion domain at N-terminal. On the other hand, all the analyzedCCoAOMTs possessed a major AdoMet MTases superfamilydomain and a methyltransferase 3 domain. All the COMTsequences analyzed for domain prediction were unique inthe sense that all of them contained a dimerization domain,whereas CCoAOMT sequences lacked this dimerization domain.Presence of a dimerization domain in all the COMTs and itsabsence from all the CCoAOMTs analyzed suggested COMTs asdimers, and CCoAOMTs as monomers, in their native confor-mation, essential for catalysis. Zubieta et al. [5] have reportedthat alfalfa OMT forms a dimer in solution, and this dimericstructure is essential for its activity. On the other hand, Hoff-mann and coworkers [11] have reported that CCoAOMT fromtobacco exists as a catalytically active monomer in solution.Furthermore, alfalfa CCoAOMT has also been shown to cat-alyze transmethylation in monomeric forms [8]. Investigationof localization of COMTs and CCoAOMTs (cytosolic or mem-brane bound) by analyzing transmembrane regions on thesesequences revealed the absence of any transmembrane regionfrom all the sequences, suggesting all of them to be solubleproteins (i.e., cytosolic localization). Similar to our observation,COMTs and CCoAOMTs from wheat, maize, and alfalfa havebeen shown to be localized in cytosolic regions [24,25].

3.2. Conserved motif analysisThe 10 protein sequences from both COMT and CCoAOMTwere analyzed for identifying conserved regions usingClustalW tool. Results are shown in Fig. 1. Thus, among

172 Overlapping Methylation by COMT and CCoAOMT

Page 4: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TABLE 1Isoelectric point (pI), molecular weight, and domain prediction of COMT and CCoAOMT protein sequences from selected plant

sources

COMT CCoAOMT

S. No. Plant source pI Molecular weight (Da) Domainsa pI Molecular weight (Da) Domainsa

1 Ammi majus 6.09 40,151.57 MT 2, DD 5.29 27,042.97 MT 3, AMM

2 Sorghum bicolor 5.46 39,632.54 MT 2, DD 5.23 29,062.21 MT 3, AMM.

3 Zea mays 5.15 39,670.69 MT 2, DD 5.21 29,351.33 MT 3, AMM

4 Oryza sativa 5.23 39,578.49 MT 2, DD 5.21 28,847.77 MT 3, AMM

5 Populous trichocarpa 5.40 39,272.29 MT 2, DD 5.30 27,806.96 MT 3, AMM,

6 Acacia auriculiformis 5.61 39,807.04 MT 2, DD 5.46 28,178.30 MT 3, AMM

7 Leucaena leucocephala 5.60 39,839.16 MT 2, DD 5.31 27,550.63 MT 3, AMM

8 Jatropha curcas 5.52 39,783.18 MT 2, DD 5.29 27,915.89 MT 3, AMM

9 Vitis vinifera 5.48 41,969.34 MT 2, DD 5.27 27,234.28 MT 3, AMM

10 Medicago sativa 5.67 39,946.22 MT 2, DD 5.55 27,999.16 MT 3, AMM

aDomains: MT 2, methyl transferase 2 superfamily; MT 3, methyl transferase 3; DD, dimerization domain; AMM, AdoMet Mtases superfamily.

FIG. 1Conserved regions among COMT (A) andCCoAOMT (B).

the COMT sequences analyzed, seven conserved regions(motifs), namely NEDGVS(143–148), LMNQDK(154–159),LDGGIPFNKAXGMTAFEYHG(174–193), VDVGGGTGA(232–240), GINFDLPHV(254–262), GVEHVGGDMF(272–280), andIMLAHPGGK (348–357), were observed. All of these conservedregions in COMT sequences were found to span between themiddle and C terminus of these sequences. On the basis ofthe multiple sequence alignment of COMT protein sequencesderived from plant sources, Ibrahim et al. [26] have identifiedfive conserved regions (I–V), rich in glycine, toward the mid toC-terminal region of the polypeptide of the protein sequences.Similarly, in our study, the conserved Glycine-rich motifs werealso observed in the mid to C-terminal end of the COMT. In the

case of CCoAOMT, however, nine conserved regions (motifs),namely EVGHKSLL (36–43), LYQYIL (48–53), TSVVPRE (55–61),MTTSADEGQFL (80–90), AKNTMEIGVYTGYSLLATALALP (99–121), EGPALPVLD (156–164), FVDADKDNYLNYH (180–192),GYDNTLWNGSV (206–216), and LPVGDG (253–258), were ob-served. Similarly, Ghosh et al. [27] have also reported eightconserved motifs in Hibiscus cannabinus CCoAOMT, in whichthe three motives are conserved for the SAM binding site. Incomparison with conserved regions of COMT, CCoAOMT se-quences were found to be scattered all along the polypeptidechain. Furthermore, conserved regions in COMTs were foundto be rich in glycine residues, whereas those of CCoAOMTsequences were rich in lysine. It is noteworthy that muchconservation was observed among the various sequences be-longing to COMT and CCoAOMT, separately. However, therewas less conservation between COMT and CCoAOMT.

Biotechnology and Applied Biochemistry 173

Page 5: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

FIG. 2Dendogram showing phylogenetic relationships ofCOMT (A) and CCoAOMT (B) protein sequencesfrom various plant sources.

3.3. Phylogenetic analysisPhylogenetic analysis of all selected sequences of COMT andCCoAOMT was done using MEGA software. Results are shown inFig. 2. It is noteworthy that these sequences are clustered intotwo separate groups, one group composed of COMT sequencesand the other group composed of CCoAOMT sequences, vali-dating the results of conserved sequence analysis, as describedabove. Furthermore, both COMT and CCoAOMT sequencesbifurcated into two subgroups, namely monocots and dicots.

In the UPGMA-based tree generated with these 20 proteinsequences (10 each of COMT and CCoAOMT), five nodes werestrongly supported as they appeared with a frequency above80%. The three COMTs and CCoAOMTs belonging to monocotswere clearly separated from the other dicots. In the COMTcluster, Vitis vinifera was separated with other dicots with apercentage frequency of 92%; the other branch of this groupis monophyletic, sharing a common ancestor, whereas in thecase of monocot, Oryza sativa was separated with the groupof Zea mays and Sorghum bicolor with a 100% frequency. Inthe CCoAOMT cluster, Acacia magnium was separated withother dicots with a 100% frequency, whereas for other branchforming a monophyletic group, in the dicots group O. sativawas separated with the branch comprising Sorghum bicolorand Zea mays with a 95% frequency.

3.4. 3D structure modeling of COMT and CCoAOMTof M. sativa using MD and DS softwareHomology modeling of COMT and CCoAOMT of M. sativa wasdone using two different software tools, namely MD and DS.For this purpose, COMT (PDB ID 1KYZ) and CCoAOMT (PDB

ID 1SUI) of M. sativa were used as templates. Out of the fivemodels, each of COMT and CCoAOMT generated using twosoftware tools, models of COMT and CCoAOMT with minimumDOPE scores were found to be the most acceptable and henceselected. Both the structures modeled using MD 9v9 and DSversion 3.1 were found to be close. The most acceptable modelof COMT and CCoAOMT as generated using MD 9v9 is shownin Fig. 3. Both these models of COMT and CCoAOMT werefurther validated using PROCHECK, ERRAT, ProSA, and ProQtools. Results are presented in Table 2. The data presented inTable 2 suggest that both the generated models are reliable.

3.5. Superimposition of protein structures of COMTand CCoAOMTModeled COMT and CCoAOMT protein structures were su-perimposed using DS and Chimera. Results are shown inFig. 4. The RMSD score of superimposed structures was found

FIG. 3Modeled structure of COMT (A) and CCoAOMT(B) of M. sativa through Modeller.

174 Overlapping Methylation by COMT and CCoAOMT

Page 6: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TABLE 2Validation of Modeller (MD) and Discovery Studio (DS) generated models of COMT and CCoAOMT of M. sativa using various

tools

Tool Parameter COMT CCoAOMT

MD/ DS Template chosen 1KYZ 1SUI

MD/ DS DOPE score −40,259.19141(MD) −27,489.00977 (MD)

−41,356.4 (DS) −27,277.7 (DS)

PROCHECK (Ramachandran Plot) Percentage of residuesin favored region

93.4% (MD) 88.5% (MD)

92.4% (DS) 89.4% (DS)

ERRAT—overall quality factor Overall quality factora 84.551 (MD) 88.696 (MD)

89.045 (DS) 84.519 (DS)

ProSA Z score −8.91 (MD) −6.94 (MD)

−8.65 (DS) −5.8 (DS)

ProQ Predicted LG score 4.036 (MD) 4.563 (MD)

4.745 (DS) 4.390 (DS)

aOverall quality factor of protein model, expressed as the percentage of the protein for which the calculated error value falls below the 95%rejection limit.

FIG. 4Superimposed structures of COMT (pink) andCCoAOMT (cyan).

to be 21.523, suggesting a poor superimposition of the twostructures. However, at the core of the enzyme structures, asimilarity was observed in spite of the poor sequence similaritybetween the two. In agreement with our results based on the 3D

superimposition of the CCoAOMT and catechol OMT structures,Ferrer et al. [14] reported a high level of similarity in the coreof the enzyme, despite poor sequence similarity.

3.6. Docking studies3.6.1. Comparison of substrate specificities ofCCoAOMTs and COMTsModeled structures of COMT and CCoAOMT of M. sativa weredocked with metabolic intermediates of monolignol biosyn-thesis pathway, namely caffeic acid, sinapic acid, ferulic acid,5-hydroxyferulic acid, their CoA esters, and coniferyl and caf-feoyl alcohols, their aldehydes, 5-hydroxy coniferyl aldehyde,5-hydroxy coniferyl alcohol, SAM, and SAH as putative sub-strates using Autodock as well as CDOCKER and LIBDOCKtools of DS. Results of the docking in terms of binding energyand dissociation constant are presented in Table 3. From thedata presented in Table 3, it is evident that both COMT andCCoAOMT bind all 16 putative substrates efficiently and ef-fectively. The binding energy of various substrates to theseenzymes decreased in the following order: free acid substrates(caffeic acid, sinapic acid, and 5-hydroxyferulic acid) > al-coholic and aldehyde substrates (coniferyl alcohol, ceffeoylalcohol, coniferyl aldehyde, caffeoyl aldehyde, 5-hydroxylconiferyl aldehyde, and 5-hydroxy coniferyl alcohol) > thiolester substrates (caffeoyl-CoA, feruloyl-CoA, 5-hydroxyferuloylCoA, and sinapoyl-CoA), suggesting the best binding for thiolester substrates for both COMT and CCoAOMT. Also on the basisof CDOCKER energy analysis, thiol ester substrates were found

Biotechnology and Applied Biochemistry 175

Page 7: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

TABLE 3Summary of results of docking of various putative substrates with COMT and CCoAOMT using Autodock

COMT CCoAOMT

Binding energy Dissociation Binding energy DissociationS. No. Substrate (kcal/mol) constant (mM) (kcal/mol) constant (mM)

1 Caffeic acid −3.59 2.32 −5.27 1.03

2 Ferulic acid −3.26 4.07 −5.26 0.138

3 Sinapic acid −4.07 1.04 −4.59 0.430

4 5-Hydroxy ferulic acid −4.06 1.06 −4.44 0.553

5 Coniferyl aldehyde −5.32 0.125 −6.61 0.0142

6 Coniferyl alcohol −5.62 0.075 −7.26 0.0047

7 Caffeoyl aldehyde −5.03 0.205 −6.76 0.011

8 Caffeoyl alcohol −5.38 0.114 −7.37 0.0039

9 5-Hydroxyconiferyl aldehyde −5.39 0.111 −7.36 0.0040

10 5-Hydroxyconiferyl alcohol −5.28 0.134 −6.18 0.29

11 Caffeoyl-CoA −11.92 1.8 × 10−6 −13.86 6.9 × 10−8

12 Feruloyl-CoA −14.24 3.6 × 10−8 −16.38 9.89 × 10−10

13 Sinapoyl-CoA −15.79 2.9 × 10−9 −30.46 4.729 × 10−20

14 5-Hydroxyferuloyl CoA −11.34 4.8 × 10−6 −30.02 9.749 × 10−20

15 S-Adenosyl methionine −4.93 0.243 −5.89 0.048

16 S-Adenosyl homocystine −7.30 0.0044 −7.59 0.0027

to be best for both COMT and CCoAOMT (data not shown).Furthermore, binding energies for all 16 putative substrateswere found to be lower for CCoAOMT than those for COMT,suggesting a better binding of all the substrates to CCoAOMT incomparison with COMT.

Analysis of the dissociation constant (inversely related tothe binding constant of the ligand to the target [enzyme] andthereby with affinity of binding of the ligands to the target) forall the 16 putative substrates to COMT and CCoAOMT revealedthat free acids bind to CCoAOMT with almost 4–10 times higheraffinity than those for COMT (dissociation constant ∼1–4 mMfor COMT and ∼0.1–1.0 mM for CCoAOMT). Similarly, in thecase of aldehydes and alcohols, about 6–25 times higher affin-ity for CCoAOMT than those of COMT (dissociation constant∼0.1–0.2 mM for COMT and ∼0.004–0.03 mM for CCoAOMT)was found. In the case of thiol ester (CoA) derivatives, 10 toseveral thousand times higher affinity for CCoAOMT than thatof COMT (dissociation constant 2.93 × 10−9–4.84 × 10−6 mM forCOMT and 2.18 × 10−20–9.89 × 10−10 mM for CCoAOMT) wasfound. Thus, based on binding energy as well as dissociationconstant, thiol ester derivatives were found to be the bettersubstrates, among all, for CCoAOMT as well as COMT, withsinapoyl-CoA being the best for both enzymes. Furthermore,

between COMT and CCoAOMT, binding with CCoAOMT wasfound to be more effective and efficient than that of COMT.Thus, our in silico studies suggest that alcoholic and aldehy-dic substrates are preferred to those of caffeic, sinapic, andferulic acid by both COMT and CCoAOMT with a marked pref-erence for CoA ester substrates over free acids, aldehydes,and alcohols. More importantly, it is noteworthy that alongwith CCoAOMT, COMT can also methylate CoA esters effi-ciently and is also active toward 5-hydroxy coniferyl alcohol,indicating that COMT can catalyze syringyl unit synthesis inplant and may operate at the free acid, CoA ester, or alcohollevels [4,9].

In agreement with our in silico findings, a number ofreports from in vitro experimental studies are available(Table 4), which suggest broad substrate specificity forboth COMT as well as CCoAOMT, supporting the concept ofmetabolic grid/overlapping pathways in monolignol biosyn-thesis where methylation can occur at various levels andby both of these enzymes. Thus, Zubieta et al. [5] have re-ported that the alfalfa COMT shows marked preference for5-hydroxylated compounds over free acids (Km of 43, 10,and 5 μM for caffeic acid, 5-hydroxyferulic acid, and 5-hydroxyconiferaldehyde, respectively). Parvathi et al. [10]

176 Overlapping Methylation by COMT and CCoAOMT

Page 8: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TABLE 4A comparison of in silico estimated dissociation constants of various putative substrates for COMT and CCoAOMT with the in

vitro experimental Michaelis constant (Km)

In silico estimateddissociation

In vitro experimental data as reported in literature

S. No. Substrate constant (Kd) (mM) Km (mM) Source References

COMT

1 Caffeic acid 2.32 0. 335 Medicago sativa Kota et al. [33]

0.043 Medicago sativa Zubieta et al. [5]

2 5-Hydroxyferulic acid 1.06 0.01 Medicago sativa Zubieta et al. [5]

3 5-Hydroxy coniferyl aldehyde 0.111 0.0018 Medicago sativa Parvathi et al. [10]

4 5-Hydroxyconiferyl alcohol 0.134 0.0075 Medicago sativa Zubieta et al. [5]

0.0038 Medicago sativa Parvathi et al. [10]

0.005 Medicago sativa Zubieta et al. [5]

5 Caffeoyl aldehyde 0.205 0.01 Medicago sativa Zubieta et al. [5]

0.0069 Medicago sativa Parvathi et al. [10]

6 Caffeoyl alcohol 0.114 0.013 Medicago sativa Zubieta et al. [5]

0.012 Medicago sativa Parvathi et al. [10]

7 S-Adenosyl-L-methionine 0.243 0.012 Medicago sativa Edwards and Dixon [12]

CCoAOMT

1 Caffeic acid 1.03 0.088 Ammi majus Lukacin et al. [30]

0.039 Linumusitatissimum

Day et al. [31]

2 5-Hydroxy-ferulic acid 0.553 0.068 Ammi majus Lukacin et al. [30]

3 Caffeoyl–CoA 6.9 × 10−8 0.0043 Medicago sativa Parvathi et al. [10]

0.020 Ammi majus Lukacin et al. [30]

4 5-Hydroxyferuoyl-CoA 2.1 × 10−20 0.0062 Medicago sativa Parvathi et al. [10]

5 5-Hydroxy-coniferaldehyde 0.0040 0.0173 Triticum estivum Ma and Xu [28]

6 Caffeoyl aldehyde 0.011 0.0437 Triticum estivum Ma and Xu [28]

in alfalfa COMT have also reported COMT-catalyzed methy-lation with a number of potential substrates of monolignolbiosynthesis pathway, with enzyme activities in the follow-ing order: caffeoyl-CoA > caffeoyl alcohol > 5-hydroxyferulicacid > caffeoyl aldehyde > 5-hydroxyconiferyl alcohol > 5-hydroxyferuloyl CoA > 5-hydroxyconiferaldehyde > caffeicacid. Similarly, COMTs from wheat and aspen have also beenshown to catalyze methylation of various free acids, alde-hydic, and alcoholic substrates in the following decreasingorder: 5-hydroxyconiferaldehyde > caffeoyl aldehyde > 5-hydroxyconiferyl alcohol > caffeoyl alcohol > 5-hydroxyferulic

acid > caffeic acid [28, 29]. Maury et al. [9], using recombi-nant COMT and CCoAOMT from tobacco, have studied sub-strate specificity of various metabolic intermediates of phenyl-propanoid pathway, namely caffeic acid, 5-hydroxyferulic acid,their CoA esters, and 5-hydroxyconiferyl alcohol. On the basis ofthe Vmax/Km values, the authors have suggested that both theseenzymes efficiently interacted with CoA ester substrates andcatalyzed the methylation of these substrates more efficientlythan those of free acids. In agreement with these reports, basedon the binding energies and dissociation constants of free acidsand thiol esters substrates, our in silico analyses also revealed

Biotechnology and Applied Biochemistry 177

Page 9: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

that COMT and CCoAOMT bind CoA ester substrates moreefficiently than those of free acids. CCoAOMTs from bishop’sweed and flax have been reported to methylate free acids andaldehydes along with thiol ester-based substrates such thatmethylation of thiol ester-based substrates was more efficientthan those of free acids and aldehydes [30, 31]. Furthermore,based on the in silico analyses on two isozymes of CCoAOMTfrom Leucaena leucocephala, Pagadala et al. [32] have alsoreported a better binding for caffeoyl-CoA than those of otherCoA substrates for both the isozymes. Thus, our findings are inagreement with the concept of metabolic grid and overlappingpathways in monolignol biosynthesis where methylation canbe done at various levels and by both COMT and CCoAOMT.Thus, based on our in silico data, we have ample evidence topropose that COMT and CCoAOMT may catalyze methylation ofa variety of intermediates of monolignol biosynthesis pathwaywith a comparable efficiency.

3.6.2. Regiospecificity of binding of various putativesubstrates to COMTRegiospecificity of binding of all 16 putative substrates toCOMT, at the active site of enzyme, was investigated andcompared using CDOCKER tool of DS. Results depicting variousinteracting residues are presented in Table 5. Hydrogen bondinteractions between enzyme active-site residues and variousputative substrates along with details of donor/acceptor atomsinvolved in these interactions, their bond lengths, and anglesare summarized in Table 6.

A comparative account of regiospecificity of binding ofvarious substrates at the active site of the COMT in a pairwisemanner involving substrates and products (and vice versa) forrespective reactions is presented below. Caffeic and ferulic acidswere found to interact with a total of 12 and 13 interactingresidues, respectively, out of which, except for MET180 (inthe case of caffeic acid) and LEU127 and ASN324 (in thecase of ferulic acid), all the remaining interacting aminoacid residues of COMT were found to be common. Analysisof hydrogen bonds and other noncovalent interactions amongthese residues of enzymes and various atoms/groups of putativesubstrates revealed that caffeic acid binding involved one eachof hydrogen bond and charge interaction, whereas ferulicacid binding involved only a charge interaction. Thus, HE2 ofHIS269 formed a hydrogen bond with O1 of 3-hydroxyl groupof caffeic acid. HIS323 formed charge interactions with oxygenof carboxylic group of caffeic as well as both ferulic acids.

A comparison of COMT residues, interacting with sinapicacid and 5-hydroxyferulic acid, revealed a total of 12 interactingresidues each; all of these are common. Binding of sinapic acidas well as 5-hydroxyferulic acid both involved one hydrogenbond interaction each. Thus, HE2 of HIS269 formed hydrogenbond interaction with O1 of methoxy group of sinapic acidand O3 of 5-hydroxyl group of 5-hydroxyferulic acid, whereasHIS323 formed charge interactions, one each, with oxygenof carboxyl group of sinapic acid and 5-hydroxyferulic acid,respectively.

Among coniferyl aldehyde and coniferyl alcohol, out of13 interacting residues in each case, 10 interacting residueswere found to be common. TRP266, CYS298, THR356 (in thecase of coniferyl aldehyde) and PHE176, HIS323, ASN324 (inthe case of coniferyl alcohol) were found to be uncommon.Both these substrates involved hydrogen bond interaction withASN131. OD1 of ASN131 formed one hydrogen bond with H23of 4-hydroxyl group of coniferyl aldehydes. On the other hand,HD22 and OD1 of ASN131 formed two hydrogen bonds, onewith O3 and another with H25 of hydroxyl group of propylenechain of coniferyl alcohol, respectively.

In the case of caffeoyl aldehyde and caffeoyl alcohol, out of11 and 13 interacting residues, respectively, 11 residues werefound to be common, except CYS298 and THR356 (present incaffeoyl alcohol). Analysis of hydrogen bonding revealed thatH19 of 3-hydroxyl group of caffeoyl aldehyde involved onehydrogen bond interaction with OD1 of ASN 131, whereas H20and H21 of 3-hydroxyl and 4-hydroxyl group of caffeoyl alcoholwere involved in hydrogen bond interactions with OD1 of ASN131 and SD of MET 130, respectively.

5-Hydroxy coniferyl aldehyde and 5-hydroxy coniferylalcohol each interacted with 13 residues, out of which 12residues were common, whereas ASN131 (in the case of5-hydroxy coniferyl aldehyde) and GLY179 (in the case of 5-hydroxy coniferyl alcohol) were uncommon. HE2 of HIS269formed hydrogen bonds with O3 of the 5-hydroxyl group of5-hydroxy coniferyl aldehyde, whereas HE2 of HIS269 andND1 of HIS183 formed two hydrogen bond interactions withO3 of the 5-hydroxyl group and H24 of the 4-hydroxyl group of5-hydroxy coniferyl alcohol, respectively.

Caffeoyl-CoA and feruloyl-CoA interacted with 31 and34 residues, respectively, out of which 20 residues werefound to be common, whereas THR187, ASP231, LEU232,VAL235, ILE267, TRP271, CYS298, HIS323, PHE354, ASN355,THR356 (in case of caffeoyl-CoA) and LEU32, PRO33, LEU36,ILE89, VAL111, ASN128, GLN132, TRP140, THR211, GLY212,ALA213, VAL214, LYS265, ILE316 (in the case of feruloyl-CoA)were found to be uncommon. Caffeoyl-CoA binding involvedeight hydrogen bond interactions, whereas that of feruloyl-CoAinvolved only two hydrogen bond interactions. HD1 of HIS315,HE2 of HIS183, and O of PHE354 formed three hydrogen bondswith O6, N27, and H96 of adenosyl moiety of caffeoyl-CoA,respectively, whereas HD22 of ASN131 formed two hydrogenbonds with O11 and O14 of the diphosphate moiety of caffeoyl-CoA, respectively. OD1 of ASP231 formed two hydrogen bondswith H94 and H95 of caffeoyl group, whereas OD2 of ASP231formed another hydrogen bond with H95 of caffeoyl groupof caffeoyl-CoA. HD22 of ASN131 and O of GLY209 formedhydrogen bonds with O18 of diphosphate moiety and H98 offeruloyl group of feruloyl-CoA, respectively.

A comparison of sinapoyl-CoA and 5-hydroxyferuloylCoA revealed 27 and 28 interacting residues, respec-tively, out of which 16 residues were found to becommon, whereas MET180, THR187, ASP206, VAL207,GLY209, VAL214, PHE263, MET264, LYS265, CYS298,

178 Overlapping Methylation by COMT and CCoAOMT

Page 10: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TABLE 5Regiospecificity of binding of various putative substrates (ligands) to COMT and CCoAOMT (target) indicating residues involved

in interaction

COMT CCoAOMT(interacting residues and their number in (interacting residues and their number in

Substrate parentheses) in parentheses)

Caffeic acid MET130, ASN131, LEU136, PHE176, MET180,HIS183, TRP266, HIS269, ILE316, ILE319,MET320, HIS323 (12)

GLY19, LYS21, ILE60, ASN190, ASN194,ARG206, ASP238 (7)

Ferulic acid LEU127, MET130, ASN131, LEU136, PHE176,HIS183, TRP266, HIS269, ILE316, ILE319,MET320, HIS323, ASN324 (13)

GLY19, LYS21, SER22, LEU23, ILE60, SER64,ASN194, ASP238 (8)

Sinapic acid LEU127, MET130, ASN131, LEU136, PHE176,HIS183, TRP266, HIS269, ILE316, ILE319,MET320, HIS323 (12)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,LEU34, ILE60, THR62, SER64, ALA65, ASP66,GLY237, ASP238 (14)

5-Hydroxy ferulicacid

LEU127, MET130, ASN131, LEU136, PHE176,HIS183,TRP266, HIS269, ILE316,ILE319,MET320, HIS323 (12)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,LEU34, ILE60, THR62, SER64, ALA65, ASP66,GLY237, ASP238 (14)

Coniferyl aldehyde LEU127, MET130, ASN131, LEU136, MET180,HIS183, TRP266, HIS269, CYS298, ILE316,ILE319, MET320, THR356 (13)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,ILE60, MET61, THR62, THR63, SER64,GLU67, ASP163, ASP189, ASN190, ASN194,GLY237, ASP238 (18)

Coniferyl alcohol LEU127, MET130, ASN131, LEU136, PHE176,MET180, HIS183, HIS269, ILE316, ILE319,MET320, HIS323, ASN324 (13)

VAL18, GLY19, LYS21, SER22, LEU23, ILE60,MET61, THR62, THR63, GLU67, ASP163,ASP189, ASN190, ASN194, GLY237, ASP238(16)

Caffeoyl aldehyde LEU127, MET130, ASN131, GLY179, MET180,HIS183, TRP266, HIS269, ILE316, ILE319,GLU329 (11)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,ILE60, MET61, THR62, THR63, SER64,GLU67, ASP189, ASN190, GLY237, ASP238(16)

Caffeoyl alcohol LEU127, MET130, ASN131, LEU136, MET180,HIS183, TRP266, HIS269, CYS298, ILE316,ILE319, THR356, GLU329 (13)

GLY19, HIS20, LYS21, SER22, ILE60, MET61,THR62, THR63, SER64, GLU67, ASP189,ASN190, GLY237, ASP238 (14)

5-Hydroxy coniferylaldehyde

LEU127, MET130, ASN131, LEU136, PHE176,MET180, HIS183, TRP266, HIS269, ILE316,ILE319, MET320, HIS323 (13)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,ILE60, THR62, SER64, ASN190, ASN194,GLY237, ASP238 (13)

5-Hydroxy coniferylalcohol

LEU127, MET130, LEU136, PHE176, GLY179,MET180, HIS183, TRP266, HIS269, ILE316,ILE319, MET320, HIS323 (13)

LYS21, SER22, LEU23, TYR30, ILE60, MET61,THR62, THR63, SER64, ASP66, ASP189,ASN194, GLY237, ASP238 (14)

Caffeoyl-CoA ILE124, LEU127, MET130, ASN131, LEU136,PHE163, PHE176, MET180, HIS183, THR187,GLY208, GLY210, ASP231, LEU232, VAL235,TRP266, ILE267, HIS269, ASP270, TRP271,CYS298, GLY312, HIS315, ILE316, ILE319,MET320, HIS323, ASN324, PHE354, ASN355,THR356 (31)

SER12, GLY13, ARG14, HIS15, GLN16, GLU17,VAL18, GLY19, HIS20, LYS21, SER22, LEU23,ASP27, ALA28, TYR30, GLN31, LEU34,PHE39, ILE60, MET61, THR62, THR63,SER64, ALA65, ASP66, GLY87, VAL88,TYR89, ASP111, ILE112, ASP163, ALA164,ASP165, LYS166, ASN190, TRP193, ASN194,TYR208, TYR212, GLU237, ASP238 (41)

(Continued)

Biotechnology and Applied Biochemistry 179

Page 11: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

TABLE 5Continued

COMT CCoAOMT(interacting residues and their number in (interacting residues and their number in

Substrate parentheses) in parentheses)

Feruloyl-CoA LEU32, PRO33, LEU36, ILE89, VAL111, ILE124,LEU127, ASN128, MET130, ASN131,GLN132, LEU136, TRP140, PHE163, PHE176,MET180, HIS183, ASP206, GLY208, GLY210,THR211, GLY212, ALA213, VAL214, LYS265,TRP266, HIS269, ASP270, GLY312, HIS315,ILE316, ILE319, MET320, ASN324 (34)

GLU11, SER12, GLY13, ARG14, HIS15, GLN16,GLU17, VAL18, GLY19, HIS20, LYS21, SER22,LEU23, ASP27, ALA28, TYR30, GLN31,LEU34, PHE39, ILE60, MET61, THR62, THR63,SER64, ALA65, ASP66, GLY87, VAL88,TYR89, ASP111, ILE112, ASP163, ALA164,ASP165, LYS166, ASN190, TRP193, ASN194,TYR208, TYR212, GLY237, ASP238 (42)

Sinapoyl-CoA MET130, ASN131, PHE163, PHE176, MET180,HIS183, THR187, ASP206, VAL207, GLY209,VAL214, PHE263, LYS265, MET264, ILE267,HIS269, TRP266, ASP270, CYS298, LYS311,GLY312, HIS315, ILE316, ILE319, MET320,HIS323, ASN324 (27)

GLU11, SER12, GLY13, HIS15, GLN16, GLU17,VAL18, GLY19, HIS20, LYS21, SER22, LEU23,ASP27, ALA28, TYR30, GLN31, LEU34,PHE39, TRP58, ILE60, MET61, THR62, THR63,SER64, ALA65, GLY87, VAL88, TYR89,ASP111, ASP163, ALA164, ASP165, LYS166,ASN190, TRP193, ASN194, TYR208, TYR212,GLY237, ASP238 (40)

5-HydroxyferuloylCoA

LEU36, VAL111, TYR114, LEU115, LEU127,ASN128, MET130, ASN131, ALA162,PHE163, HIS166, PHE176, HIS183, GLY208,GLY210, ASP231, ILE267, TRP266, HIS269,ASP270, TRP272, GLY312, HIS315, ILE316,ILE319, MET320, HIS323, ASN324 (28)

SER12, GLY13, HIS15, GLN16, GLU17, VAL18,GLY19, HIS20, LYS21, SER22, ASP27, ALA28,TYR30, GLN31, LEU34, PHE39, ILE60,MET61, THR62, SER64, ALA65, ILE160,ASP163, LYS166, ASP189, ASN190, TRP193,ASN194, ARG206, TYR208, VAL209, TYR212,GLN237, ASP238 (34)

S-Adenosylmethionine

LEU127, MET130, ASN131, LEU136, ALA162,PHE163, HIS166, PHE176, MET180, HIS183,TRP266, HIS269, GLU297, CYS298, ILE316,ILE319, MET320, HIS323, GLU329, THR356(20)

VAL18, GLY19,HIS20, LYS21, SER22, LEU23,TYR30, LEU34, ASN59, ILE60, MET61,THR62, THR63, SER64, GLU67, ASP163,ASP189, ASN190, ASN194, ASP238 (20)

S-Adenosylhomocystine

LEU127, MET130, ASN131, LEU136, ALA162,PHE163, PHE176, MET180, HIS183, TRP266,HIS269, ASP270, GLY312, VAL313, HIS315,ILE316, ILE319, MET320, HIS323, ASN324(20)

VAL18, GLY19, LYS21, SER22, LEU23, TYR30,LEU34, ILE60, MET61, THR62, THR63,SER64, ALA65, GLU67, ASP189, ASN190,ASN194, GLY237, ASP238 (19)

and LYS311 (in the case of sinapoyl-CoA) and LEU36, VAL111,TYR114, LEU115, LEU127, ASN128, ALA162, HIS166, GLY208,GLY210, ASP231, and TRP272 (in the case of 5-hydroxyferuloylCoA) were found to be uncommon. Sinapoyl-CoA binding to theenzyme involved a total of eight hydrogen bond interactions.Thus, HD 1 of HIS315 formed a hydrogen bond with N26 ofadenosyl moiety, HD22 of ASN131 formed a hydrogen bondwith O15 of diphosphate moiety, HE2 of HIS269 formed twohydrogen bonds with O16 and O19 of pantothenate moiety,HG of CYS298 formed two hydrogen bonds with O9 and H100of 3′phosphate of ribose moiety, HD21 of ASN355, HE2 ofHIS183 formed hydrogen bonds with O9 and O12, respec-tively, of 3′phosphate of ribose moiety of sinapoyl-CoA. On the

other hand, 5-hydroxyferuloyl CoA binding involved only threehydrogen bond interactions, HD22 of ASN128 formed a hydro-gen bond with O21 of feruloyl moiety, HE2 of HIS269 formedtwo hydrogen bonds with O9 and O12 of 3′phosphate of ri-bose moiety and one charge interaction with the same moiety,whereas HIS323 formed charge interaction with oxygen ofdiphosphate moiety of 5-hydroxyferuloyl CoA.

In the case of SAM (methyl group donor) and SAH, out of 20interacting residues in each case 15 residues were found to becommon, whereas HIS166, GLU297, CYS298, GLU329, THR356(in case of SAM) and ASP270, GLY312, VAL313, HIS315,ASN324 (in case of SAH) were uncommon. SAM binding to theenzyme involved a total of seven hydrogen bond interactions.

180 Overlapping Methylation by COMT and CCoAOMT

Page 12: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TAB

LE

6H

yd

rog

en

bo

nd

inte

racti

on

sb

etw

een

acti

ve-s

ite

resid

ues

of

CO

MT

an

dC

Co

AO

MT

an

dvari

ou

sp

uta

tive

su

bstr

ate

salo

ng

wit

h

deta

ils

of

do

no

r/accep

tor

ato

ms

invo

lved

inth

ese

inte

racti

on

san

dth

eir

bo

nd

len

gth

san

dan

gle

sas

decip

here

du

sin

gC

DO

CK

ER

CO

MT

CC

oAO

MT

Dis

tanc

eD

ista

nce

betw

een

dono

rbe

twee

ndo

nor

Acc

epto

ran

dac

cept

orB

ond

Acc

epto

ran

dac

cept

orB

ond

S.N

o.S

ubst

rate

Don

orat

omat

omat

om(A

)an

gle

(◦ )D

onor

atom

atom

atom

(A)

angl

e(◦ )

1C

affe

icac

id(C

A)

HIS

269:

HE

2C

A:O

12.

0303

914

4.07

1LY

S21

:HZ

2C

A:O

32.

2824

913

9.03

8

––

––

LYS

21:H

Z2

CA

:O4

2.27

868

113.

555

2Fe

rulic

acid

(FA

)–

––

–LY

S21

:HZ

2FA

:O4

1.89

828

156.

613

3S

inap

icac

id(S

A)

HIS

269:

HE

2S

A:O

12.

2226

713

2.24

3A

RG

50:H

ES

A:O

42.

2688

616

0.50

4

––

––

SA

:H27

LYS

21:O

2.07

945

143.

365

45-

Hyd

roxy

feru

licac

id(5

HFA

)H

IS26

9:H

E2

5HFA

:O3

2.36

885

130.

712

AR

G50

:HE

5HFA

:O5

2.43

575

159.

591

––

––

5HFA

:H23

LYS

21:O

2.34

3212

1.36

5

5C

on

ifer

ylal

deh

yde

(Co

Ald

)C

oA

ld:H

23A

SN

131:

OD

12.

0178

149.

279

SE

R22

:HG

Co

Ald

:O2

2.18

246

143.

392

––

––

AS

N19

0:H

D22

Co

Ald

:O3

2.44

772

108.

55

6C

on

ifer

ylal

coh

ol

(Co

Alc

)A

SN

131:

HD

22C

oA

lc:O

32.

0910

614

2.78

5T

HR

63:H

G1

Co

Alc

:O3

2.44

379

140.

592

Co

Alc

:H25

AS

N13

1:O

D1

2.01

225

146.

635

Co

Alc

:H24

LYS

21:O

2.03

808

160.

386

––

––

Co

Alc

:H25

ME

T61

:O1.

9487

716

6.05

2

7C

affe

oyl

ald

ehyd

e(C

aAld

)C

aAld

:H19

AS

N13

1:O

D1

1.94

807

157.

429

SE

R22

:HG

CaA

ld:O

22.

2517

613

3.01

6

––

––

AS

N19

0:H

D22

CaA

ld:O

32.

0620

415

2.54

7

––

––

CaA

ld:H

23G

LY19

:O2.

0698

814

1.93

2

––

––

CaA

ld:H

19G

LY19

:O2.

0118

314

6.57

3

8C

affe

oyl

alco

ho

l(C

aAlc

)C

aAlc

:H20

AS

N13

1:O

D1

1.97

487

150.

126

AS

N19

0:H

D22

CaA

lc:O

12.

3631

916

1.46

2

CaA

lc:H

21M

ET

130:

SD

2.37

689

139.

039

CaA

lc:H

20M

ET

61:O

2.04

222

170.

332

––

––

CaA

lc:H

21A

SP

189:

OD

22.

1638

112

7.90

9

––

––

CaA

lc:H

22LY

S21

:O1.

9952

413

7.26

1

(Con

tinue

d)

Biotechnology and Applied Biochemistry 181

Page 13: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

TAB

LE

6C

on

tin

ued

CO

MT

CC

oAO

MT

Dis

tanc

eD

ista

nce

betw

een

dono

rbe

twee

ndo

nor

Acc

epto

ran

dac

cept

orB

ond

Acc

epto

ran

dac

cept

orB

ond

S.N

o.S

ubst

rate

Don

orat

omat

omat

om(A

)an

gle

(◦ )D

onor

atom

atom

atom

(A)

angl

e(◦ )

95-

Hyd

roxy

con

ifer

ylal

deh

yde

(5H

Co

Ald

)H

IS26

9:H

E2

5HC

oA

ld:O

31.

9935

132.

418

SE

R22

:HG

5HC

oA

ld:O

32.

1671

214

1.87

––

––

5HC

oA

ld:H

23G

LY19

:O2.

4258

213

0.33

9

105-

Hyd

roxy

con

ifer

ylal

coh

ol(

5HC

oA

lc)

HIS

269:

HE

25H

Co

Alc

:O3

2.26

441

111.

405

SE

R22

:HG

5HC

oA

lc:O

41.

8270

414

7.62

1

––

––

AS

N19

0:H

D22

5HC

oA

lc:O

12.

1123

715

4.73

7

5HC

oA

lc:H

24H

IS18

3:N

D1

2.43

104

131.

578

5HC

oA

lc:H

26G

LY23

7:O

2.41

951

152.

972

11C

affe

oyl

-Co

A(C

Co

A)

AS

N13

1:H

D22

CC

oA

:O11

2.46

727

127.

404

AS

N19

0:H

D21

CC

oA

:O18

2.27

188

121.

956

AS

N13

1:H

D22

CC

oA

:O14

2.04

916

139.

031

CC

oA

:H91

TY

R21

2:O

H2.

3584

210

4.85

6

HIS

183:

HE

2C

Co

A:N

272.

2602

814

0.59

9C

Co

A:H

99G

LY19

:O1.

9384

114

9.64

2

HIS

315:

HD

1C

Co

A:O

62.

0697

415

2.63

3–

––

CC

oA

:H94

AS

P23

1:O

D1

2.19

745

147.

815

––

––

CC

oA

:H95

AS

P23

1:O

D1

2.06

679

145.

028

––

––

CC

oA

:H95

AS

P23

1:O

D2

2.02

796

136.

368

––

––

CC

oA

:H96

PH

E35

4:O

2.32

5111

0.37

5–

––

12Fe

rulo

yl-C

oA

(FC

oA

)A

SN

131:

HD

22FC

oA

:O18

2.25

115

120.

051

GLY

13:H

NFC

oA

:O23

2.44

148

121.

878

FCo

A:H

98G

LY20

9:O

1.96

725

129.

722

LYS

166:

HZ

1FC

oA

:O7

1.79

045

144.

483

––

––

AS

N19

0:H

D22

FCo

A:O

82.

2756

411

1.93

5

––

––

AS

N19

0:H

D22

FCo

A:O

172.

1868

213

0.03

3

––

––

FCo

A:H

97LY

S21

:O1.

9344

915

2.15

––

––

FCo

A:H

100

AS

P11

1:O

D1

2.46

7717

6.96

6

––

––

FCo

A:H

100

AS

P11

1:O

D2

2.15

927

123.

127

––

––

FCo

A:H

102

TY

R30

:OH

2.11

178

125.

813

(Con

tinue

d)

182 Overlapping Methylation by COMT and CCoAOMT

Page 14: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

TAB

LE

6C

on

tin

ued

CO

MT

CC

oAO

MT

Dis

tanc

eD

ista

nce

betw

een

dono

rbe

twee

ndo

nor

Acc

epto

ran

dac

cept

orB

ond

Acc

epto

ran

dac

cept

orB

ond

S.N

o.S

ubst

rate

Don

orat

omat

omat

om(A

)an

gle

(◦ )D

onor

atom

atom

atom

(A)

angl

e(◦ )

13S

inap

oyl

-Co

A(S

Co

A)

AS

N13

1:H

D22

SC

oA

:O15

2.42

749

124.

597

ALA

65:H

NS

Co

A:O

202.

3890

415

5.40

7

HIS

183:

HE

2S

Co

A:O

121.

8855

215

7.39

LYS

166:

HZ

1S

Co

A:O

52.

2541

713

1.02

2

HIS

269:

HE

2S

Co

A:O

162.

1426

213

8.40

1A

SN

190:

HD

22S

Co

A:O

112.

1432

614

4.73

2

HIS

269:

HE

2S

Co

A:O

192.

1949

413

1.90

5A

SN

190:

HD

22S

Co

A:O

182.

2799

112

3.07

CY

S29

8:H

GS

Co

A:O

92.

2983

97.2

247

SC

oA

:H69

ME

T61

:SD

2.33

597

139.

905

HIS

315:

HD

1S

Co

A:N

262.

2483

514

7.46

9S

Co

A:H

72LY

S21

:O1.

9713

213

0.63

8

AS

N35

5:H

D21

SC

oA

:O9

2.00

787

142.

985

SC

oA

:H73

ILE

60:O

1.86

732

143.

623

SC

oA

:H10

0C

YS

298:

SG

2.40

703

138.

429

––

––

145-

Hyd

roxy

feru

loyl

Co

A(5

HFC

oA

)A

SN

128:

HD

225H

FCo

A:O

212.

4595

411

2.06

2LY

S16

6:H

Z1

5HFC

oA

:O14

2.49

223

87.9

172

HIS

269:

HE

25H

FCo

A:O

92.

1154

513

4.21

7LY

S16

6:H

Z3

5HFC

oA

:O14

1.99

856

118.

879

HIS

269:

HE

25H

FCo

A:O

122.

2159

212

1.10

1A

SP

189:

HD

25H

FCo

A:O

171.

8724

512

1.21

5

––

––

AS

N19

0:H

D22

5HFC

oA

:O13

1.98

778

167.

183

––

––

5HFC

oA

:H97

ME

T61

:O2.

2748

813

0.91

7

––

––

5HFC

oA

:H10

2IL

E60

:O2.

4614

313

0.48

15S

-Ad

eno

syl

met

hio

nin

e(S

AM

)T

RP

266:

HE

1S

AM

:O5

2.32

881

121.

371

ALA

65:H

NS

AM

:O6

2.20

271

169.

461

TR

P26

6:H

E1

SA

M:O

62.

1458

312

2.29

2A

SN

190:

HD

22S

AM

:N8

2.29

782

132.

763

CY

S29

8:H

NS

AM

:O6

2.34

915

102.

213

SA

M:H

46T

YR

30:O

H2.

1083

616

9.65

3

SA

M:H

45A

SN

131:

OD

12.

2314

215

4.08

2S

AM

:H47

GLY

19:O

2.00

229

135.

148

SA

M:H

46G

LU32

9:O

E1

2.29

379

134.

092

SA

M:H

50A

SP

189:

OD

22.

4498

211

9.99

6

SA

M:H

47G

LU29

7:O

E1

1.85

212

139.

175

––

––

SA

M:H

48G

LU32

9:O

E2

1.95

962

128.

671

––

––

(Con

tinue

d)

Biotechnology and Applied Biochemistry 183

Page 15: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

TAB

LE

6C

on

tin

ued

CO

MT

CC

oAO

MT

Dis

tanc

eD

ista

nce

betw

een

dono

rbe

twee

ndo

nor

Acc

epto

ran

dac

cept

orB

ond

Acc

epto

ran

dac

cept

orB

ond

S.N

o.S

ubst

rate

Don

orat

omat

omat

om(A

)an

gle

(◦ )D

onor

atom

atom

atom

(A)

angl

e(◦ )

16S

-Ad

eno

syl

ho

mo

cyst

ein

e(S

AH

)H

IS26

9:H

E2

SA

H:O

42.

2724

812

4.05

4A

SP

238:

HN

SA

H:O

52.

2401

712

4.18

6

HIS

315:

HD

1S

AH

:O5

2.19

237

151.

429

SA

H:H

43T

YR

30:O

H1.

9549

716

6.20

5

SA

H:H

44G

LY31

2:O

1.79

083

134.

25S

AH

:H45

TH

R62

:O1.

8322

113

6.75

3

––

––

SA

H:H

46M

ET

61:O

1.87

344

124.

169

Thus, H45 of the adenosyl moiety formed a hydrogen bond withOD1 of ASN131, HE1 of TRP266 formed hydrogen bonds withthe O5 and O6 of the methionine moiety, H46 and H48 of theamino group formed a hydrogen bond with OE1 and OE2 ofGLU329, respectively, whereas H47 of the amino group formeda hydrogen bond with OE1 of GLU297. HN of CYS298 formeda hydrogen bond with O6 of methionine moiety and HIS269formed charged interactions with sulfur atom and nitrogen ofthe amino group. GLU297 and GLU329 also formed chargedinteractions with the nitrogen of the amino group. TRP266formed π–cation interaction with the sulfur atom, whereasHIS323 formed π–π interaction with the two rings of adeninemoiety. HIS269 formed π–cation interaction with the nitrogenof NH3 group. SAH binding involved only three hydrogen bondinteractions. Thus, HE2 of HIS269 formed a hydrogen bondwith the O4 of the adenosyl moiety, whereas HD1 of HIS315with O5 of methionine moiety. H44 of the amino group formeda hydrogen bond with O of GLY312.

Among all the substrates analyzed, interacting residuesMET130, HIS183, HIS269, ILE316, and ILE319 were found tobe common. Interacting residues for our best docked substrate,among all analyzed substrates, namely sinapoyl-CoA withCOMT, as a representative case are shown in Fig. 5.

On the basis of a crystallographic study of alfalfa COMT,Zubieta et al. [5] have reported that the hydroxyl group of hy-drophobic residues MET130, PHE176, MET180, and MET320interacted with SAM. Furthermore, the hydrophobic bindingpocket containing LEU136, PHE172, PHE176, and ALA162was also reported to interact well with methoxy groups ofboth ferulic acid and 5-hydroxyconiferaldehyde, suggestingthat these van der Waal interactions were responsible forthe kinetic preferences of COMT toward the 3-methoxy-4,5-dihydroxyl-substituted substrates over the 3,4-dihydroxyl-substituted substrates. The authors have also reported thatthe residues HIS183, ASN131, MET180, MET130, ILE316, andILE319 were interacted with the propanoid tail of substratesand this relative hydrophobicity of the binding pocket is sug-gestive of the selectivity for neutral aldehydes and alcoholsover negatively charged carboxylate groups. Their findingsare in agreement with our docking analysis data where theresidues that were involved in the binding (interaction) offree acids, alcohols, aldehydes, SAM, and SAH are commonand similar to what Zubieta et al. [5] have reported. Thus,the residues that were common between our analysis andthose reported by Zubieta et al. [5] are MET130, ASN131,LEU136, PHE176, MET180, HIS183, TRP266, HIS269, ILE316,ILE319, MET320, and HIS323. Furthermore, Gordon et al.[34], based on the site-directed mutagenesis of ryegrass COMT,have reported that HIS plays a crucial role in COMT catalysisas the substitution of HIS resulted in the complete loss of en-zyme activity. From our study also, HIS looks to be a significantresidue as it is common in binding/interaction of all the putativesubstrates.

184 Overlapping Methylation by COMT and CCoAOMT

Page 16: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

FIG. 5Two-dimensional (2D) view of the dockedsinapoyl-CoA with COMT showing interactingresidues involved in sinapoyl-CoA binding.

3.6.3. Regiospecificity of binding of various putativesubstrates to CCoAOMTRegiospecificity of binding of all 16 putative substrates toCCoAOMT, at the active site of the enzyme, was investigatedand compared using CDOCKER tool of DS. Results are presentedin Table 5. Hydrogen bond interactions between enzyme active-site residues and various putative substrates along with detailsof donor/acceptor atoms involved in these interactions, theirbond lengths, and angles are summarized in Table 6.

Data on regiospecificity of binding of various putativesubstrates to CCoAOMT revealed that caffeic and ferulic acidsbinding to CCoAOMT involved a total of seven and eightresidues, respectively, out of which all the residues, except forASN190, ARG206 (in case of caffeic acid) and SER22, LEU23,SER64 (in case of ferulic acid), were common. Caffeic acidbinding involved two hydrogen bonds, whereas that of ferulicacid involved one hydrogen bond. Thus, HZ2 of LYS21 formedtwo hydrogen bonds with O3 and O4 of the carboxyl groupof caffeic acid; in addition, LYS21 and ARG206 also formedcharged interactions with the oxygen of the carboxylic group ofcaffeic acid. In the case of ferulic acid, HZ2 of LYS21 formed ahydrogen bond with the O4 of the carboxyl group and a chargedinteraction with oxygen of the carboxyl group of ferulic acid.

A comparison of interacting residues for sinapic acidand 5-hydroxyferulic acid with CCoAOMT revealed that all14 interacting residues were common. Both these substratesinvolved two H-bond interactions each, involving LYS21 andARG50. Thus, H27 and O4 of the 4-hydroxyl group of sinapic

acid were involved in the hydrogen bond interaction, oneeach with O of LYS21 and HE of ARG50, respectively, whereasin the case of 5-hydroxyferulic acid O of LYS21 and HE ofARG50 formed hydrogen bonds, one each with the H23 of the4-hydroxyl group and O5 of 5-hydroxyferulic acid, respectively.

Among coniferyl aldehyde and coniferyl alcohol out of18 and 16 interacting residues, respectively, 16 interactingresidues were found to be common, with TYR30 and SER64being unique to coniferyl aldehyde. Coniferyl aldehyde bindinginvolved two hydrogen bond interactions, whereas that ofconiferyl alcohol involved three hydrogen bond interactions.Thus, HG of SER22 formed a hydrogen bond with O2 of the 4-hydroxyl group, whereas HD22 of ASN190 formed a hydrogenbond with the O3 of the carboxyl group of coniferyl aldehyde.H24 of the 4-hydroxyl group, H25 and O3 of the hydroxyl groupof propylene chain of coniferyl alcohol formed hydrogen bondsone each, with O of LYS21, O of MET61, and HG1 of THR63,respectively.

In the case of caffeoyl aldehyde and caffeoyl alcohol, 16and 14 interacting residues, respectively, were observed, outof which 13 residues were common, whereas VAL18, LEU23,TYR30 (in the case of caffeoyl aldehyde), and HIS20 (in thecase of caffeoyl alcohol) were uncommon residues. Caffeoylaldehyde and caffeoyl alcohol both involved four hydrogenbond interactions each. Thus, HG of SER22 formed a hydrogenbond with O2 of the 4-hydroxyl group, whereas O of GLY19formed two hydrogen bonds with H23 and H19 of the 3-hydroxylgroup and HD22 of ASN190 formed a hydrogen bond with theO3 of the aldehydic group of caffeoyl aldehyde, respectively.In caffeoyl alcohol, H20 and H21 of 3- and 4-hydroxyl groupsformed hydrogen bonds with O of MET61 and OD2 of ASP189,respectively, whereas HD22 of ASN190 formed one hydrogen

Biotechnology and Applied Biochemistry 185

Page 17: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

bond with the O1 of the 3-hydroxyl group and H22 of thehydroxyl group of propylene chain of caffeoyl alcohol formed ahydrogen bond with O of LYS21.

Interacting residues in the case of 5-hydroxy coniferylaldehyde and 5-hydroxy coniferyl alcohol were 13 and 14,respectively, out of which 10 residues were common withVAL18, GLY19, ASN190 (in the case of 5-hydroxy coniferylaldehyde) and THR62, THR63, ASP66, ASP189 (in the caseof 5-hydroxy coniferyl alcohol) were uncommon. 5-Hydroxyconiferyl aldehyde involved two hydrogen bond interactions,whereas 5-hydroxy coniferyl alcohol involved three hydrogenbond interactions. Thus, HG of SER22 and O of GLY19 formedone hydrogen bond each with O3 of the 5-hydroxyl group andH23 of the 4-hydroxyl group of 5-hydroxy coniferyl aldehyde,respectively. In case of 5-hydroxy coniferyl alcohol, HD22 ofASN190 and HG of SER22 formed a hydrogen bond, one each,with the O1 of the 3-methoxy group and O4 of the hydroxylgroup of propyl chain of 5-hydroxy coniferyl alcohol, whereasH26 of the alcoholic group formed a hydrogen bond with O ofGLY237.

In the case of caffeoyl-CoA and feruloyl-CoA, there were atotal of 41 and 42 interacting residues, respectively, out of which41 interacting residues were common, with GLU11 (in feruloyl-CoA) as uncommon. Binding of caffeoyl-CoA to the enzymeinvolved three hydrogen bond interactions, whereas that offeruloyl-CoA involved eight hydrogen bond interactions. Thus,in caffeoyl-CoA, H99 of the NH group of mercaptoethylamine ofCoA moiety formed a hydrogen bond with O of GLY19. HD21 ofASN190 formed a hydrogen bond with O18 of the diphosphatemoiety of caffeoyl-CoA, H91 of adenosyl moiety of caffeoyl-CoAformed a hydrogen bond with OH of TYR212. LYS166 formedthe π–cation interaction with the adenine ring of CoA. In thecase of feruloyl-CoA, H97 of OH group of mercaptoethylamineof CoA moiety formed a hydrogen bond with O of LYS21, HN ofGLY13 formed a hydrogen bond with O23 of 4-hydroxyl group offeruloyl moiety, HD22 of ASN190 formed hydrogen bonds, oneeach, with the O8 and O17 of the diphosphate moiety, HZ1 ofLYS166 formed a hydrogen bond with the O7 of the adenosylmoiety, H100 of the amino group of adenosyl moiety formed twohydrogen bonds, one each, with OD1 and OD2 of ASP111. H102of NH group of mercaptoethylamine of CoA moiety formed ahydrogen bond with OH of TYR 30. ASP238 formed a chargedinteraction with oxygen of the diphosphate group of CoA moiety.

Comparing sinapoyl-CoA and 5-hydroxyferuloyl CoA re-vealed 40 and 34 interacting residues, respectively, out ofwhich 30 residues were common with GLU11, LEU23, TRP58,THR63, GLY87, VAL88, TYR89, ASP111, ALA164, ASP165 (inthe case of sinapoyl-CoA) and ILE60, ASP189, ARG206, VAL209(in the case 5-hydroxyferuloyl-CoA) as uncommon residues.Sinapoyl-CoA binding involved seven hydrogen bond inter-actions, whereas that of 5-hydroxyferuloyl CoA involved sixhydrogen bond interactions. Thus, in sinapoyl-CoA, H69 of the3′phosphate group of phosphoadenosine diphosphate moietyformed a hydrogen bond with SD of MET61, H73 of the OHgroup of diphosphate moiety formed a hydrogen bond with O

of ILE60, whereas H72 of the OH group of pentothenate moietyformed a hydrogen bond with O of LYS21, HN of ALA65 formeda hydrogen bond with O20 of β-mercaptoethylamine moiety,HD22 of ASN190 formed two hydrogen bonds, one each, withO11 and O18 of diphosphate moiety and HZ1 of LYS166 formeda hydrogen bond with O5 of adenosyl moiety of sinapoyl-CoA.In 5-hydroxyferuloyl CoA, H97 of OH group, and H102 of NHgroup of pantothenate moiety formed one hydrogen bond eachwith O of MET61 and O of ILE60, HD2 of ASP189, and HD22 ofASN190 formed one hydrogen bond each with O17 and O13 ofdiphosphate moiety, HZ1 and HZ3 of LYS166 formed two hydro-gen bonds with O14 of diphosphate moiety of 5-hydroxyferuloylCoA, respectively. ASP163 and LYS166 formed four chargedinteraction, two each, with oxygen of diphosphate moiety.

A comparison of SAM and SAH binding involved 20 and 19interacting residues, respectively, out of which 17 interactingresidues were common, whereas HIS20, ASN59, ASP163 (inSAM) and ALA65, GLY237 (in SAH) were uncommon. SAMbinding involved five hydrogen bond interactions, whereasthat of SAH involved four hydrogen bond interactions. Thus,in SAM, H46, H47 of the amino group of methionine moietyformed hydrogen bonds one each, with OH of TYR30 and Oof GLY19, respectively. H50 of the amino group of adenosylmoiety formed a hydrogen bond with OD2 of ASP189, HN ofALA65, and HD22 of ASN190 formed a hydrogen bond withN8 of the adenine moiety. ASP238 formed π–sigma interactionwith the adenine moiety of SAM. In the case of SAH, H43 of theamino group of adenine moiety formed a hydrogen bond withOH of TYR30, whereas H45 and H46 of the amino group ofhomocysteine moiety formed hydrogen bonds with O of THR62and O of MET61, respectively. ASP238 formed a hydrogenbond with O5 of homocysteine moiety and was also involved incharge interaction with oxygen of homocysteine moiety.

A comparison of interacting residues of CCoAOMT with allthe 16 putative substrates revealed that GLY19, LYS21, SER22,LEU23, ILE60, ASN190, ASN194, and ASP238 are the residuesthat were common for almost all the substrates. Interactingresidues for our best docked substrate among all analyzed,namely, sinapoyl-CoA with CCoAOMT as a representative case,are shown in Fig. 6.

Our in silico findings regarding residues involved in inter-action with these substrates, are in agreement with those re-ported from the in vitro experimental studies. Thus, Ferrer et al.[14], while studying the interaction of various CoA substrates,namely, feruloyl-CoA, sinapoyl-CoA, and 5-hydroxyferuloylCoA in the crystal structures of alfalfa CCoAOMT, have re-ported that residues LYS21, MET61, ASP163, ASN190, TRP193,ARG206, TYR208, and TYR212 are involved in the formation ofa common substrate-binding pocket. In our in silico analyses,these residues are found to be involved in the formation of asimilar common substrate-binding pocket at the active site ofCCoAOMT. Similarly, Hoffmann et al. [13], working with tobaccoCCoAOMT, have reported involvement of an ASN residue, whichis conserved in all CCoAOMTs, having a crucial role in CCoAOMTcatalysis. Furthermore, the authors, by doing site-directed

186 Overlapping Methylation by COMT and CCoAOMT

Page 18: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

FIG. 62D view of the docked sinapoyl-CoA withCCoAOMT showing interacting residues involvedin sinapoyl-CoA binding.

mutagenesis, have shown that substitution of ARG220 witha THR resulted in the total loss of enzyme activity, whereaschanges of ASP58 to ALA and GLN61 to SER were shownto decrease affinity toward CoA, thereby suggesting that theARG220 might be involved in the electrostatic interaction withthe coenzyme A moiety of the thiol ester substrate [13]. Alsoin the present in silico study, ARG, ASP, and GLN residueshave been found to be common for almost all CoA substrates,suggesting their crucial roles. On the basis of the in silico anal-yses on isozymes of CCoAOMT from Leucaena leucocephala,Pagadala et al. [32] have reported the involvement of MET,THR, VAL, GLU, GLY, SER, ASP, THR, ASN, and ARG residues atthe active site of the isozymes, which are suggested to maintainthe positive charge at the active sites for the efficient binding ofthe most favored substrate, that is, negatively charged caffeoyl-CoA. A similar observation is also evident from our in silicoanalyses. Results of the regiospecificity of binding of varioussubstrates to the enzymes and analysis of noncovalent interac-tions (especially hydrogen bonds) suggest that binding of thepair of putative substrates (substrates/products) is mediated bychange in the conformation in the enzymes (both COMT andCCoAOMT). A similar observation on enzyme conformationalchanges in binding of substrates and products has also beensuggested in the case of Leucaena leucocephala CCoAOMT [32].

3.6.4. Comparison of regiospecificity of bindingof various putative substrates between COMTand CCoAOMTA comparison of regiospecificity of binding of various putativesubstrates to COMT and CCoAOMT revealed that the majorityof residues involved in binding of these substrates (interactingresidues) at the active site of COMT and CCoAOMT are common,suggesting a similar mode of binding of these substrates to their

enzymes (Table 5). Thus, a comparison of interacting residuesbetween COMT and CCoAOMT revealed that ILE is presentin all the substrates. LEU is present in 13 substrates; it isnot common only in the case of caffeic acid, caffeoyl alcohol,and sinapoyl-CoA. ASN is common in 13 substrates; it is notpresent in sinapic acid, 5-hydroxy ferulic acid, and 5-hydroxyconiferyl aldehyde. MET is common in 11 substrates; it is notcommon in all the free acids and 5-hydroxy coniferyl aldehyde.HIS is common in six substrates; it is not present in all thefree acids, alcohols, aldehydes, and SAH. THR is commononly in five substrates, that is, coniferyl aldehyde, caffeoylalcohol, SAM, caffeoyl-CoA, and sinapoyl-CoA. GLY is present ineight substrates, namely 5-hydroxy coniferyl alcohol, caffeoylalcohol, SAH, and all the CoA esters. ASP is common in all theCoA esters and in SAH. TRP, PHE, and VAL are common in onlythe CoA esters. GLU is common in three substrates, caffeoylalcohol, caffeoyl adehyde, and SAM. LYS is common only inferuloyl-CoA and sinapoyl-CoA. GLN is common only in thecase of feruloyl-CoA. Thus, overall it can be concluded that thebinding cavity of both COMT and CCoAOMT has 14 commonresidues, namely ILE, LEU, ASN, MET, HIS, THR, GLY, ASP, TRP,PHE, VAL, GLU, LYS, and GLN.

Data based on in vitro wet laboratory studies have shownthat the SAM binding sites of various O-methyltransferases areconserved. Thus, the methionine part of SAM was reported tointeract with THR and LYS, whereas that of the ribose part ofSAM with ASP in the case of rice COMT [5,13,35]. Similarly, inour study THR was also found to interact with the methioninepart of SAM in the case of COMT, whereas that of CCoAOMTinteracted with both THR and LYS, and the ribose part of SAMinteracted with ASP in the case of CCoAOMT, whereas no suchinteraction was found in the case of COMT.

A comparison of conformation of binding of various sub-strates to COMT and CCoAOMT revealed that the majority ofsubstrates bound to COMT and CCoAOMT in a similar con-formation except for the thiol esters (CoA-based substrates).Thus, a comparison of the data presented in Figs. 5 and 6

Biotechnology and Applied Biochemistry 187

Page 19: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

Biotechnology andApplied Biochemistry

clearly indicated that sinapoyl-CoA binds to COMT in a dif-ferent conformation than that of CCoAOMT. On the basis ofthe energy and affinity constant values, it may be suggestedthat binding of sinapoyl-CoA to CCoAOMT is more favoredthan that of COMT. The significance of these data on variousconformations of substrate binding to the enzymes is difficult toexplain at present but a comparison of data on Km and Vmax ofCOMT and CCoAOMT for these substrates from in vivo/in vitrowet laboratory studies suggests that both enzyme–substratescomplexes (COMT+ sinapoyl-CoA and CCoAOMT+ sinapoyl-CoA) are productive [9]. Therefore, it may be inferred that thesubstrates that are so far not considered as the conventionalsubstrates of these enzymes may also bind these enzymes in anefficient manner and are catalytically active even if they bindin a different conformation to enzymes. In some cases, thesechanged conformations are more effective.

4. ConclusionsLignin, which makes up an average 25% of the plant biomass,is considered to be one of the greatest obstacles to the optimalutilization of the plant biomass for various purposes such aspaper manufacturing, production of highly palatable forage,and bagasse utilization. Therefore, there has been a worldwideeffort in the past two decades for the reduction/alternation oflignin in plant biomass using molecular genetic approaches,for their better utilization. In view of alternative methylationpathways reported for more than a decade, it has been sug-gested that COMTs as well as CCoAOMTs both can efficientlycatalyze the methylation of substrates that are traditionallynot considered as their normal substrates. However, there isstill ambiguity with regard to substrate affinities and efficiencyof catalysis for COMTs and CCoAOMTs. Therefore, this needsfurther validation. In this direction, the present work dealswith analysis and comparison of 10 protein sequences, forboth COMT and CCoAOMT, from selected dicots and monocotsfor parameters like molecular weights, pI, domain analysis,their cellular localization, the evolutionary relationship, andsubstrate specificity. All the COMT and CCoAOMT sequencesanalyzed were found to be acidic and soluble in nature.CCoAOMTs were found to be smaller proteins of monomericnature as compared with those of COMTs, which were dimericwith a unique dimerization domain. Phylogenetic analysis ofthe COMT and CCoAOMT sequences revealed their clusteringinto two major groups: one group composed of COMT, and theother group composed of CCoAOMT; these two groups werefurther subgrouped into monocots and dicots. A comparisonof the modeled COMT and CCoAOMT of M. sativa revealeda similarity at the core region of the protein with a uniquedomain in COMT, which might be involved in the dimerization.The docking analysis of COMT and CCoAOMT with the 16putative substrates (intermediates of monolignol biosynthe-sis pathway) revealed that all these substrates bind to bothCOMT and CCoAOMT, with binding to CCoAOMT being moreefficient. Results of the present molecular docking analyses

have suggested newer roles of these enzymes in methylation ofintermediates of monolignol biosynthesis pathway.

5. AcknowledgementsFinancial support from the Department of Biotechnology,Govt. of India, New Delhi, under the BIF programme andDepartment of Higher Education, Govt. of U.P., under theCenter of Excellence programme is gratefully acknowledged.

6. References[1] Eckardt, N. A. (2002) Plant Cell 14, 1185–1189.[2] Ye, Z. H., Zhong, R., Morrison, W. H., and Himmelsbach, D. S. (2001)

Phytochemistry 57(7), 1177–1185[3] Dixon, R. A., Chen, F., Guo, D., and Parvathi, K. (2001) Phytochemistry 57(7),

1069–1084.[4] Fukushima, K. (2001) J. Plant Res. 114, 499–508.[5] Zubieta, C., Kota, P., Ferrer, J. L., Dixon, R. A., and Noel, J. P. (2002) Plant Cell

14, 1265–1277.[6] Humphreys, J. M., and Chapple, C. (2002) Curr. Opin. Plant Biol. 5, 224–229.[7] Bonawitz, N. D., and Chapple, C. (2010) Annu. Rev. Genet. 44, 337–363[8] Barakat, A., Choi, A., Yassin, N. B., Park, J. S., Sun, Z., and Carlson, J. E.

(2011) Gene 479, 37–46.[9] Maury, S., Geoffroy, P., and Legrand, M. (1999) Plant Physiol. 121, 215–223.

[10] Parvathi, K., Chen, F., Guo, D. J., Blount, J. W., and Dixon, R. A. (2001) PlantJ. 25, 193–202.

[11] Pincon, G., Maury, S., Hoffmann, L., Geoffrey, P., Lapierre, C., Pollet, B., andLegrand, M. (2001) Phytochemistry 57, 1167–1176.

[12] Edwards, R., and Dixon, R. A. (1991) Arch. Biochem. Biophys. 287(2), 372–379.

[13] Hoffmann, L., Maury, S., Bergdoll, M., Thion, L., Erard, M., and Legrand, M.(2001) Plant Physiol. 276, 36831–36838.

[14] Ferrer, J. L., Zubieta, C., Dixon, R. A., and Noel, J. P. (2005) Plant Physiol. 137,1009–1017.

[15] Sali, A., and Blundell, T. L. (1993) J. Mol. Biol. 234, 779–815.[16] Eswar, N., Marti-Renom, M. A., Webb, B., Madhusudhan, M. S., Eramian,

D., Shen, M., Pieper, U., and Sali, A. (2006) Curr. Protoc. Bioinform. 15,5.6.1–5.6.30.

[17] Laskowski, R. A., MacArthur, M. W., Moss, D. S., and Thornton, J. M. (1993)J. Appl. Crystallogr. 26, 283–291.

[18] Colovos, C., and Yeates, T. O. (1993) Protein Sci. 2, 1511–1519.[19] Wiederstein, M., and Sippl, M. J. (2007) Nucleic Acids Res. 35, W407–W410.[20] Dundas, J., Ouyang, Z., Tseng, J., Binkowski, A., Turpaz, Y., and Liang, J.

(2006) Nucleic Acids Res. 34, W116–W118.[21] Wheeler, D. L., Barrett, T., Benson, D. A., Bryant, S. H., Canese, K., Chetvernin,

V., Church, D. M., DiCuccio, M., Edgar, R., Federhen, S., Geer, L. Y., Helmberg,W., Kapustin, Y., Kenton, D. L., Khovayko, O., Lipman, D. J., Madden, T. L.,Maglott, D. R., Ostell, J., Pruitt, K. D., Schuler, G. D., Schriml, L. M., Sequeira,E., Sherry, S. T., Sirotkin, K., Souvorov, A., Starchenko, G., Suzek, T. O.,Tatusov, R., Tatusova, T. A., Wagner, L., and Yaschenko, E. (2006) NucleicAcids Res. 34, D173–D180.

[22] Morris, G. M., Huey, R., Lindstrom, W., Sanner, M. F., Belew, R. K.,Goodsell, D. S., and Olson, A. J. (2009) J. Comput. Chem. 16, 2785–2791.

[23] Hong, B. S., Kim, B. G., Jung, N. R., Lee, L., Lim, Y., Chong, Y., and Ahn, J. H.(2009) Bull. Korean Chem. Soc. 30, 2803–2805.

[24] Ruelland, E., Campalans, A., Housein, G. S., Puigdomenech, P., and Rigau, J.(2003) Physiol. Plant 117, 93–99.

[25] Kersey, R., Inoue, K., Schubert, K. R., and Dixon, R. A. (1999) Protoplasma209, 46–57.

[26] Ibrahim, R. K., Bruneau, A., and Bantignies, B., (1998) Plant Mol. Biol. 36,1–10.

188 Overlapping Methylation by COMT and CCoAOMT

Page 20: Structure-function analyses and molecular modeling of caffeic acid-O-methyltransferase and caffeoyl-CoA-O-methyltransferase: Revisiting the basis of alternate methylation pathways

[27] Ghosh, R., Choi, B. S., Jeong Mi-J., Bae D. W., Shin, S. C., Park, S. U., Lim, H.S., Kim, J., and Bae, H. (2012) Plants Omics J. 5, 184–193

[28] Ma, Q. H., and Xu, Y. (2008) Biochimie 90, 515–524.[29] Li, L., Popko, J. L., Umezawa, T., and Chiang, V. L. (2000) J. Biol. Chem. 275,

6537–6545.[30] Lukacin, R., Matern, U., Specker, S., and Vogt, T. (2004) FEBS Lett. 577,

367–370.[31] Day, A., Neutelings, G., Nolin, F., Grec, S., Habrant, A., Cronier, D., Maher, B.,

Rolando, C., David, H., Chabbert, B., and Hawkins, S. (2009) Plant Physiol.Biochem. 47, 9–19.

[32] Pagadala, S. N., Arha, M., Reddy, P. S., Kumar, R., Sirisha, V. L., Prashant, S.,Janardhan, R. K., Khan, B., Rawal, S. K., and Kavi Kishor, P. B. (2009) J. Mol.Model. 15, 203–221.

[33] Kota, P., Guo, D., Zubieta, C., Noel, J. P., and Dixon, R. A. (2004) Phytochemistry65, 837–846.

[34] Gordon, V. L., Bowman, M. E., Tu, Y, Mouradov, A., Span-genberg, G., and Noel, J. P. (2010) Plant Cell 22, 4114–4127.

[35] Lam, K. C., Ibrahim, R. K., Behdad, B., and Dayanandan, S. (2007) Genome50, 1001–1013.

Biotechnology and Applied Biochemistry 189