ntologies in biological chemistry kirill degtyarenko sergio contrino @ ebi

56
ntologie s in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Upload: corey-patterson

Post on 04-Jan-2016

222 views

Category:

Documents


6 download

TRANSCRIPT

Page 1: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

ntologiesin biologicalchemistry

Kirill Degtyarenko Sergio Contrino @ EBI

Page 3: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

• Structure

• Physico-chemical properties

• Biological function

• 2-D, 2.5-D or 3-D structure

• Not deduced from the structure

• Biochemical reactions, mostly

• Structural

Ontologies for “biochemical” compounds

Page 4: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

hysico-chemical roperties

Page 5: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Physico-chemical property

• Molecular property

• Supramolecular property

• System property

• Reaction property

Page 6: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Molecular entity has…

• Mass (“molecular weight”)

• Size

• Shape

• Charge

• Structure One can derive many properties from

known complete structure

• Spectra ?

Page 7: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Molecular property

Heat capacity

Mass

Net charge

Shape

Size

StructureGeometry

Connectivity

Topography

Method

Calorimetry

Centrifugation

Crystallography

Electrophoresis

Isotope method

Mass spectrometry

Microscopy

Spectroscopy

Page 8: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Raman spectroscopy

Chandrasekhara Venkata Raman

Raman effect

1928

Raman spectrum

IS BASED ON

vibrational spectroscopy

YIELDS

ISA

DISCOVERED BY

DISCOVERED IN

protein secondary structure

protein conformation

ISA

Amide bands

FEATURES

AFFECTS

IS USED TO INVESTIGATE

Page 9: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Physico-chemical ontology

• Physico-chemical property

• Physico-chemical method

Page 10: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI
Page 11: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

tructure

Page 13: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

%observable universe (cf. "Lenin's definition of matter")

<energy (has *no rest mass*)

%electromagnetic radiation

<photon

%(other forms of energy, can elaborate later)

<matter (has *rest mass*)

%free elementary particles having non-zero rest mass (*molar!*)

<elementary particle

%electron

%proton

%neutron

%molecular matter (here the chemical ontology starts)

%grouped_by_composition

%compound ; synonym:chemical substance

<formula unit

<molecular entity

%atom

<electron

<nucleus

<proton

<neutron

%element

%atomic ion

%atomic radical

%molecule

<group

%molecular ion

%molecular radical

%noncovalent crystal molecule

%ionic crystal molecule

%metallic crystal molecule

%covalent molecule

%discrete covalent molecule

%giant covalent molecule

%coordination molecule

(continued)

%ion

%atomic ion

%molecular ion

%radical

%atomic radical

%molecular radical

%mixture

<compound

%heterogeneous mixture

%colloidal suspension

%liquid aerosol

%solid aerosol

%foam

%emulsion

%sol

%solid foam

%gel

%solid sol

%homogeneous mixture

%solution

<solute

<solvent

%solid solution

...

%grouped_by_state_of_matter

%plasma

%gas

%liquid

%solid

%heterogeneous mixture

Page 14: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Molecular ontology

• noncovalent crystal• ionic crystal• metallic crystal

• covalent• giant covalent• discrete covalent

•coordination

Page 15: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

• Metalloproteins

• Organic prosthetic group proteins

• Modified amino acid proteins

Complex proteins

• Proteins consisting of more than one polypeptide chain

• Combinations of all groups

Page 16: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Bioinorganic motif (BIM)

• A common structural feature of a class of functionally related, but not necessarily homologous, proteins, that includes the metal atom(s) and first coordination shell ligands

[Degtyarenko K.N. (2000) Bioinformatics 16, 851–864]

Page 17: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Dimensionality of BIM

Page 18: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Fe0

D

Page 19: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Fe1

P-D-x(2)-H-[DE]-[LI]-[LIVMF]-G-H-[LIVMC]-P-x(n)-E

D

Page 20: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

N

N

O

O

H2O

H2OFe

N

N 2

D

Page 21: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

N

N

O

O

H2O

H2OFe

N

N 2.5

D

Page 22: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

3

D

Page 23: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

http://www.ebi.ac.uk/~kirill/come/

• Italian word come (how)

• English word come (not GO)

• Classification Of Metalloproteins

• COfactors and Metals

• COMplex proteins, etc.

• Co-Ordination of Metals in proteins

Page 24: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI
Page 25: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI
Page 26: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

COMe version 2.11

• Controlled vocabulary• No definitions• 1079 protein classes (PRX)• 351 bioinorganic motifs (BIM)• 132 small molecules (MOL)

organised as:• XML version (master)• Oracle version

monthyear

Page 27: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Relationships in COMe

• isKindOf : inherits all attributes PRX to PRX ; BIM to BIM ; MOL to MOL

• isPartOf : no inheritanceBIM to BIM ; MOL to MOL ; MOL to BIM ; BIM to PRX

• isBoundTo : no inheritanceMOL to PRX

Page 28: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

- <term value="rusticyanin" id="PRX000193" dbxref="InterPro:IPR001243">

</term>

<bim coordination="T-4">Cu(ND.His)2(SD.Met)(SG.Cys)</bim>

N

N

Cu

S Cys

S

N

N

- <substructure id="BIM000085">

</substructure>

Page 29: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

- <substructure id="BIM000245">

</substructure>

<bim>heme(OD.Asp)(OE.Glu)(SD.Met)</bim>

<term value="hemediol-L-aspartyl ester-L-glutamyl ester-L-methionine sulfonium" dbxref="RESID:AA0280" />

<term value="heme m" />

- <substructure id="BIM000246">

<bim>(CBB.heme)(SD.Met)</bim>

</substructure>

- <substructure id="BIM000247">

<bim>(CMD.heme)(OD.Asp)</bim>

</substructure>

- <substructure id="BIM000248">

<bim>(CMB.heme)(OE.Glu)</bim>

</substructure>

Asp

O

ON

N

N

N

Fe

OO

OO

S+Met

O

GluO

Page 30: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

- <substructure id="BIM000281">

</substructure>

<bim>MIO</bim>

<!-- originates from cyclization and dehydration of internal Ala-Ser-Gly -->

<term value="3,5-dihydro-5-methylidene-4H-imidazol-4-one" lref=" MEDLINE:21462607" />

<term value="4-methylidene-imidazole-5-one" dbxref="PDB:1B8F" />

N

N

CH2

O

O

NHNH

Page 31: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Verdict on structure representation in COMe

• Metal-containing BIMIntuitively understandable

• Organic prosthetic groupSometimes not

• Modified amino acid(s)Not really… but we can link to RESID @ http://srs.ebi.ac.uk/

Page 32: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

COMe: future work

• More PRX, BIM, MOL

• Building blocks for BIMs

• Representation of inheritance

• 2.5-D representation of BIM / MOL (?)

Page 33: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

iological function

Page 34: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Biochemical reactions (I)

• Enzymatic reactions

• Non-enzymatic reactions

Page 35: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Biochemical reactions (II)• Binding

A + M A—M (A = “small molecule”)

• Biotransformation A + B C + D (A, B, C, D = small molecules)

• Molecular transport A(compartment X) A(compartment Y)

• Electron and exciton transfer reactions

• Conformation change (e.g. folding)

Page 36: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Biotransformation reactions

• Catalytic Catalyst Enzymatic protein Ribozymatic RNA Heterogeneous surface (e.g. metal) Homogeneous solute (e.g. metal)

• Non-catalytic Photoinduced — “Spontaneous” —

Page 37: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Aromatic amino acid hydroxylasesEC 1.14.16.1

L-Phe + H4B + O2 = L-Tyr + H2B + H2O

EC 1.14.16.2

L-Tyr + H4B + O2 = 3,4-dihydroxy-L-Phe + H2B + H2O

EC 1.14.16.4

L-Trp + H4B + O2 = 5-hydroxy-L-Trp + H2B + H2O

I. RH + tetrahydrobiopterin + O2 =

ROH + 4a-hydroxytetrahydrobiopterin

II. 4a-hydroxytetrahydrobiopterin = 6,7-dihydrobiopterin + H2O

III. 6,7-dihydrobiopterin = 7,8-dihydrobiopterin

In fact...

Page 38: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

ABC of enzyme use

• Alcohol (~2500 BC)

• Bread (~2600 BC)

• Cheese (~1000 BC)

Page 39: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Enzyme Nomenclature• Enzyme Commission: established 1956 by

IUB• Now: NC-IUBMB• Assigns EC numbers which were supposed to

serve as unique identifiers of enzymatic reactions

• Classification by overall reaction catalysed (not by the reaction mechanism nor any other specific property of an enzyme)

• EC numbers form a strict hierarchy of ISA relationships

• http://www.ebi.ac.uk/intenz/

Page 40: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Overall transformations in “enzymatic reactions”

EC1 Oxidoreductases

EC2 Transferases

EC3 Hydrolases

EC4 Lyases

EC5 Isomerases

EC6 Ligases

Aox + Dred Ared + Dox

A-X + B-H A-H + B-X

A–B + HOH A–H + B–OH

X–Y–Z X=Y + Z

A B

A + B + XTP A–B + XDP + Pi

A + B + XTP A–B + XMP + PPi

Page 41: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Overall transformations in organic chemistry

• Addition

• Elimination

• Substitution

• Rearrangement

A + B A–B

A–B A + B

A–X + B–Y A–Y + B–X

A B

(after R.B. Grossman, 1999)

Page 42: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Reaction mechanisms in organic chemistry

• Polar Polar acidic Polar basic

• Free-radical

• Pericyclic

• Metal-mediated and -catalysed

(after R.B. Grossman, 1999)

Page 43: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Mechanism of biochemical reaction

• Mechanism of reaction irrespectively of

catalyst

• e.g. homolytic vs heterolytic bond scission

• Mechanism of reaction according to

catalyst nature

• e.g. Cu-containing vs FAD-containing

Page 44: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

EC 1.2.3.4

Anatomy of an EC number

oxalate + O2 = 2 CO2 + H2O2

EC 1.2.3.4

• EC 1 Oxidoreductase

EC 1.2.3.4

• EC 1.2 Acting on the aldehyde or oxo group of donors

EC 1.2.3.4

• EC 1.2.3 With oxygen as acceptor

EC 1.2.3.4

• EC 1.2.3.4 Oxalate oxidase

Page 45: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Redundancy and deficiency

• Overall transformations in “enzymatic reactions”

are mostly based on those of organic chemistry

• EC3 (Hydrolases), many EC1 (Oxidoreductases)

and some EC4 (Lyases) can be considered kind of

EC2 (Transferases)

• One part of EC6 (Ligases) reactions can be

considered kind of EC3 (Hydrolases)

• No classification for some fundamental reaction

types, e.g. addition not to double bonds

Page 46: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

• EC 4.99.1.1 ferrochelataseprotoporphyrin + Fe2+ = protoheme + 2 H+

creates metal–N bond

EC 4.99 Other Lyases

• EC 4.99.1.2 alkylmercury lyaseRHg+ + H+ = RH + Hg2+

breaks metal–C bond

• Both these enzymes should not be classified as lyases

(example thanks to Keith Tipton)

Page 47: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Other problems

• No explicit difference between enzymatic

reactions and enzymes is made, therefore

• Some enzymes were given different EC numbers

on the basis of different cofactor or origin (!)

• The reactions are always written as if they were

reversible, therefore

• Enzymes catalysing the opposite reactions are

given the same EC number

Page 48: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

In fact...These may be different enzymes!

EC 1.18.1.2

• Ferredoxin:NADP+ reductase

[Fe2S2]+ferredoxin + NADP+ [Fe2S2]2+

ferredoxin + NADPH

• Adrenodoxin reductase

[Fe2S2]2+adrenodoxin + NADPH [Fe2S2]+

adrenodoxin + NADP+

Page 49: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Sub-subclasses in EC1.1–1.10

• EC 1.x.1 With NAD or NADP as acceptor• EC 1.x.2 With a heme protein as acceptor• EC 1.x.3 With oxygen as acceptor• EC 1.x.4 With a disulfide as acceptor• EC 1.x.5 With a quinone as acceptor• EC 1.x.7 With an iron–sulfur protein as acceptor• EC 1.x.6 With a nitrogenous group as acceptor• EC 1.x.8 With a flavin as acceptor• EC 1.x.99 With other acceptors

Page 50: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

ISA EC 1.1.1.2 alcohol dehydrogenase (NADP) ISA EC 1.1.1.91 aryl-alcohol dehydrogenase (NADP) ISA EC 1.1.1.97 3-hydroxybenzyl-alcohol dehydrogenase

Current classification

EC 1.1.1 With NAD or NADP as acceptor

Page 51: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

ISA EC 1.1.1.2 alcohol dehydrogenase (NADP) ISA EC 1.1.1.91 aryl-alcohol dehydrogenase (NADP)

ISA EC 1.1.1.97 3-hydroxybenzyl-alcohol dehydrogenase

Alternative classification

EC 1.1.1 With NAD or NADP as acceptor

Page 52: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Other relationships between enzyme entities

• Structural and functional

relationships

• Historical relationships

• Dodgy relationships

Page 53: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Structural and functional relationships

• (reaction) EC A is KindOf (reaction) EC B

• (enzyme) EC A is PartOf (enzyme) EC B

• EC A is involved in metabolic process Z

• product of EC A is a substrate of EC B

• EC A activates (enzyme) EC B

• EC A inactivates (enzyme) EC B

• EC A phosphorylates (enzyme) EC B

• EC A dephosphorylates (enzyme) EC B

Page 54: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Historical relationships

Entry transfer:

• EC A is transferred to EC B

Entry merger:

• EC A is incorporated in EC B

Entry split:

• EC A is part transferred to EC B1

• EC A is part transferred to EC B2

Entry reinstatement:

• (deleted) EC A reinstated as EC B

Page 55: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Dodgy relationships

• EC A is not identical to EC B (never should)

• EC A may be identical to EC B (if proven, one of the entries should be deleted as “identical to”)

• EC A is probably identical to EC B (same)

• EC A is related to EC B (how?)

Page 56: ntologies in biological chemistry Kirill Degtyarenko Sergio Contrino @ EBI

Acknowledgements

IntEnz team:• Michael Darsow• Astrid Fleischmann• Wolfgang Fleischmann

• John Garavelli (RESID)• Rolf Apweiler