the importance of the inchi identifier as a foundation technology for escience platforms

The Importance of the InChI Identifier as a Foundation Technology for eScience Platforms at RSC

Antony Williams

Bio-IT,

Boston, April 27th 2014

Without the InChI…

• ChemSpider is unlikely to have been built

• It would not have grown into one of the domains primary online chemistry resources

• The Royal Society of Chemistry would not have it as an online database, would not have a large cheminformatics team and would not be involved in a number of large scale funded projects around chemistry data

• ~30 million chemicals and growing

• Data sourced from >500 different sources

• Crowd sourced curation and annotation

• Ongoing deposition of data from our journals and our collaborators

• Structure centric hub for web-searching

• …and a really big dictionary!!!

ChemSpider

Experimental/Predicted Properties

Literature references

Patents references

So what is Yohimbine?

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=UHtwzbNkYcTzFM:&imgrefurl=http://astronutrition.com/blog/all_about_yohimbine&docid=o8XI58393AV72M&imgurl=http://astronutrition.com/blog/files/imagecache/blog_post_default/primaforce-yohimbine.jpg&w=255&h=255&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=VvchJS0muBnj7M:&imgrefurl=http://www.nature-herb.com/products_info/Yohimbine-157046.html&docid=WlooP8nkkLUqNM&imgurl=http://www.nature-herb.com/uploadfile/d5/demon05082625/product/raw-herbal-extract-effective-ingredient-monomer/Yohimbine-1325666613-0.jpg&w=480&h=480&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=rB1GRfqEU5dCSM:&imgrefurl=http://www.salvialight.com/yohimbe_bark.htm&docid=1khoHc_UN2VH5M&imgurl=http://www.salvialight.com/pics/yohimbe-bark-yohimbine.jpg&w=593&h=480&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=dLsdC0pGhFmmrM:&imgrefurl=http://www.seriousnutritionsolutions.com/products/baseline/Yohimbine-25.php&docid=Y2EMJ0RyV1V9YM&imgurl=http://www.seriousnutritionsolutions.com/img/products/Yohimbine-25_SupFacts.gif&w=640&h=466&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=JI6MpRLixT5-9M:&imgrefurl=http://www.drugs.com/mtm/yohimbine.html&docid=U_gr504ZGmQtYM&imgurl=http://images.ddccdn.com/images/pills/mmx/t106809f/yohimbine-hcl.jpg&w=340&h=236&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=INGMtmbbT8yEcM:&imgrefurl=http://www.nature-herb.com/products_info/Yohimbine-157046.html&docid=WlooP8nkkLUqNM&imgurl=http://www.nature-herb.com/uploadfile/d5/demon05082625/product/raw-herbal-extract-effective-ingredient-monomer/Yohimbine-1325666613-2.jpg&w=500&h=378&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=GeAXL9lC6v08cM:&imgrefurl=http://en.wikipedia.org/wiki/Oxaziridine&docid=7V2ROGVvPesl4M&imgurl=http://upload.wikimedia.org/wikipedia/commons/thumb/c/c7/YohimbineSynthesis.png/500px-YohimbineSynthesis.png&w=500&h=249&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=BnbeyD6fd42QrM:&imgrefurl=http://www.lookchem.com/Yohimbine/&docid=ovgDJihGVN_cRM&imgurl=http://www.lookchem.com/300w/2010/0619/146-48-5.jpg&w=343&h=311&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=WEZGkEQDuGfjsM:&imgrefurl=http://www.unitypeptide.com/product_info.php%3Fproducts_id%3D113&docid=ZY4oS7L1MZHPoM&imgurl=http://www.unitypeptide.com/images/up_yohimbine.png&w=308&h=500&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=kmJKKGvSdZrHAM:&imgrefurl=http://juicedmuscle.com/jmblog/content/yohimbine-hcl-helps-loose-stubbern-fat&docid=To5QoKaHt8QObM&imgurl=http://juicedmuscle.com/jmblog/sites/default/files/images/yohimbine.jpg&w=445&h=398&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=zdB7gqi_ob--UM:&imgrefurl=http://www.bodybuilding.com/store/man/yohimbine-hcl.html&docid=GpUyf510un8O4M&imgurl=http://assets.bodybuilding.com/store/prodimage/prod_prod540034/image_prodprod540034_white450px.jpg&w=261&h=450&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=-27vf25GuxHARM:&imgrefurl=http://www.answers.com/topic/yohimbine&docid=SxrdtT--5QNNEM&itg=1&imgurl=http://www.clinicalpharmacology.com/apps/images/structures/001/yohimbin.gif&w=294&h=162&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=6iYsHM45F5MekM:&imgrefurl=http://www.tootoo.com/buy-yohimbe_root/&docid=cY2jVqp9uz_CmM&imgurl=http://img.tootoo.com/mytootoo/upload/46/463855/product/463855_39b3cc66f840c8f68a934a109955efda.jpg&w=648&h=432&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=Egxi5TI66bSqZM:&imgrefurl=http://www.alibaba.com/product-gs/371901252/Yohimbine_HCI/showimage.html&docid=1Rxerfft7suTlM&imgurl=http://i00.i.aliimg.com/photo/v0/371901252/Yohimbine_HCI.jpg&w=500&h=394&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=qhfYvq-eDI7QuM:&imgrefurl=http://www.nutraplanet.com/product/man/yohimbine-hcl-60-capsules.html&docid=iL3hpxxge-MPmM&imgurl=http://www.nutraplanet.com/photos/30688/yohimbine_large.jpg&w=290&h=290&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=D0qBIAWRy4xBdM:&imgrefurl=http://www.steroidtimes.com/acute-neurotoxicity-after-yohimbine-ingestion-by-a-bodybuilder/2009&docid=Pi1Zzkzt0XD7fM&imgurl=http://www.steroidtimes.com/wp-content/uploads/2009/09/yo-300x300.png&w=300&h=300&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=hopW_VsgUrHY5M:&imgrefurl=http://www.walterritter.de/english/html/i_proc.html&docid=YiKUQ2TsUUzCwM&imgurl=http://www.walterritter.de/images/ga_proc_3a.jpg&w=590&h=1074&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=NTRcuk-X-9LcyM:&imgrefurl=http://psycnet.apa.org/journals/bne/102/4/559.html&docid=O5lmT0gtVnRCjM&imgurl=http://psycnet.apa.org/journals/bne/102/4/images/bne_102_4_559_fig5a.gif&w=399&h=345&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=V58_EPsetAUmlM:&imgrefurl=http://www.herb-extract.com/plant-extract/566935.html&docid=3YS9jtyOT7DyNM&imgurl=http://www.herb-extract.com/images/products/201153140211839w300h300uextractpowder/yohimbe-yohimbine.jpg&w=260&h=260&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=2pukvEsahayxkM:&imgrefurl=http://en.wikipedia.org/wiki/File:YohimbineSynthesis.png&docid=gb4bIdP9EvpamM&imgurl=http://upload.wikimedia.org/wikipedia/commons/thumb/c/c7/YohimbineSynthesis.png/1280px-YohimbineSynthesis.png&w=1280&h=636&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=unLf2UaylQq3EM:&imgrefurl=http://www.primaforce.com/products/yohimbine.htm&docid=QNBWckPgjcjZiM&imgurl=http://www.primaforce.com/products/slides/yohimbine.jpg&w=285&h=437&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=ZXrNoBogtqoFUM:&imgrefurl=http://www.thesmoothieshop.com/primaforce.htm&docid=-SaNxNlDX9LzcM&imgurl=http://www.thesmoothieshop.com/images/primaforceY.jpg&w=201&h=387&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=rAvmhvSJ0R1uGM:&imgrefurl=http://www.drugs.com/ingredient/yohimbine.html&docid=gFGTpP19Qr6tNM&imgurl=http://images.ddccdn.com/img/mol/DB01392.mol.t.jpg&w=280&h=280&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=-NztVHfOx3OkNM:&imgrefurl=http://ethnobotanico.com/alkaloids/17-yohimbine-hcl.html&docid=1tp1AbXNzgzZGM&imgurl=http://ethnobotanico.com/17-56-large/yohimbine-hcl.jpg&w=300&h=300&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=59GWOomWOsfDuM:&imgrefurl=http://portal.acs.org/portal/PublicWebSite/molecule/archive/CNBP_026677&docid=fKn6T1rjWCCFJM&imgurl=http://portal.acs.org/portal/binfetch/consumption%253FfileUrl%253D/stellent/groups/web/documents/article/~export/CNBP_026677~1~HTML_DC_TEMPLATE~SNIPPET_LAYOUT/30537-1.jpg&w=316&h=328&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

http://www.google.com/imgres?q=yohimbine&um=1&hl=en&sa=N&qscrl=1&nord=1&rlz=1T4PRFB_enUS467US468&biw=1135&bih=752&tbm=isch&tbnid=KmxM6cWwaPbKwM:&imgrefurl=http://www.iron-dragon.com/popup_image.php%3FpID%3D166%26image%3D0&docid=Mo27y7o4iEa6OM&imgurl=http://www.iron-dragon.com/images/id07_yohimbine.png&w=310&h=500&ei=FGVyT4reMK7ZiALQ4Li9AQ&zoom=1

Of course it is out there…

Drugbox: 3001/5080 with InChIs Chembox:5436/7690 with InChIs

Tell me more…

• Where can I find the molfile for Yohimbine?• Papers/Patents about Yohimbine?• What are the side effects of Yohimbine?• Where can I order Yohimbine?• What are the physicochemical properties?• Metabolic pathways?• Different synonyms of Yohimbine?• Synthesis of Yohimbine?• Side effects of Yohimbine?• Etc….

Quantity!

http://en.wikipedia.org/wiki/Yohimbine





http://www.drugs.com/mtm/yohimbine.html

http://www.mayoclinic.com/health/drug-information/DR601453

http://www.nlm.nih.gov/medlineplus/druginfo/natural/759.html

http://www.amazon.com/Grams-Yohimbine-HCL-Bulk-Powder/dp/B005XOUOA4

http://www.google.com/url?url=http://www.amazon.com/Vitamins-Supplements-Nutrition-Fitness-Products/b%3Fie%3DUTF8%26node%3D3773091&rct=j&sa=X&ei=S11yT5jBNtTSiAKR7pS_AQ&ved=0CGQQ6QUoADAE&q=yohimbine&usg=AFQjCNGDEb-YZkdWfsZeokPkYPNwAQKw1w

http://www.google.com/url?url=http://www.amazon.com/b%3Fie%3DUTF8%26node%3D3764461&rct=j&sa=X&ei=S11yT5jBNtTSiAKR7pS_AQ&ved=0CGUQ6QUoATAE&q=yohimbine&usg=AFQjCNFiYw1g20AP2LEn4UbnhiZlEHFIcQ

Yohimbine on ChemSpider

http://www.chemspider.com/

http://www.chemspider.com/About.aspx?

http://www.chemspider.com/About.aspx

http://www.chemspider.com/Sponsors.aspx

http://www.chemspider.com/Search.aspx?

http://www.chemspider.com/Search.aspx

http://www.chemspider.com/History.aspx

http://www.chemspider.com/AboutServices.aspx?

http://www.chemspider.com/Help.aspx?

http://www.chemspider.com/FAQ.aspx

http://www.chemspider.com/SiteMap.aspx

http://www.chemspider.com/Chemical-Structure.8622.html

http://www.chemspider.com/Molecular-Formula/C21H26N2O3

http://www.chemspider.com/Chemical-Structure.8622.html?rid=e3b60e16-94cc-4d4b-9489-9c07303dd7ca


Downsides of Overall Approach

• Meshing data together based on InChIs worked for simple molecules

• 2D layout errors inherited or limited by algorithm

• Complex molecules that are meant to be the same thing were NOT deduplicated. Compounds differing by one stereocenter, named the same, meant to be the same, are not the same

Yohimbine on ChemSpider..Quality?

So where can we travel???

http://www.google.com/search?q=InChI=1S/C21H26N2O3/c1-26-21(25)19-15-10-17-20-14(13-4-2-3-5-16(13)22-20)8-9-23(17)11-12(15)6-7-18(19)24/h2-5,12,15,17-19,22,24H,6-11H2,1H3/t12-,15-,17-,18-,19+/m0/s1

http://www.google.com/search?q=BLGXFZZNTVWLAY

http://www.google.com/search?q=BLGXFZZNTVWLAY-SCYLSFHTSA-N

http://www.chemspider.com/

http://www.chemspider.com/About.aspx?

http://www.chemspider.com/About.aspx

http://www.chemspider.com/Sponsors.aspx

http://www.chemspider.com/Search.aspx?

http://www.chemspider.com/Search.aspx

http://www.chemspider.com/History.aspx

http://www.chemspider.com/AboutServices.aspx?

http://www.chemspider.com/Help.aspx?

http://www.chemspider.com/FAQ.aspx

http://www.chemspider.com/SiteMap.aspx


http://www.chemspider.com/Molecular-Formula/C21H26N2O3



So where can we travel???




https://www.google.com/webhp?hl=en&tab=ww


http://en.wikipedia.org/wiki/Rauwolscine


http://webbook.nist.gov/cgi/cbook.cgi?ID=C146485&Mask=200

http://www.massbank.jp/jsp/Dispatcher.jsp?type=disp&id=WA002418&site=2

InChI String Search via GoogleGive me InChIKeys…

http://en.wikipedia.org/wiki/File:Yohimbine_structure.svg

http://en.wikipedia.org/wiki/International_Union_of_Pure_and_Applied_Chemistry_nomenclature

http://en.wikipedia.org/wiki/File:Rauwolscine.png

http://en.wikipedia.org/wiki/International_Union_of_Pure_and_Applied_Chemistry_nomenclature

And where can we travel???




ChemSpider

BRENDA

Wikipedia

ChEMBL

ChEBI

DrugBank


http://www.brenda-enzymes.org/php/ligand_flatfile.php4?brenda_ligand_id=17931


https://www.ebi.ac.uk/chembldb/index.php/compound/inspect/115191

http://www.ebi.ac.uk/chebi/searchId.do?chebiId=CHEBI:10093

http://www.drugbank.ca/drugs/DB01392

http://embl-ebi.org/chebi/searchId.do?printerFriendlyView=true&locale=null&chebiId=10093&viewTermLineage=null&structureView=&

Aggregator

Enzymes

Encyclopedia

Pharmacology

Curated Chemicals

Drug-Drug Target


http://www.brenda-enzymes.org/php/ligand_flatfile.php4?brenda_ligand_id=17931


https://www.ebi.ac.uk/chembldb/index.php/compound/inspect/115191

http://www.ebi.ac.uk/chebi/searchId.do?chebiId=CHEBI:10093

http://www.drugbank.ca/drugs/DB01392

http://embl-ebi.org/chebi/searchId.do?printerFriendlyView=true&locale=null&chebiId=10093&viewTermLineage=null&structureView=&

How do we build it?

• We deal in Molfiles or SDF files – with coordinates• Deposit anything that has an InChI – we support

what InChI can handle, good and bad• Standardization based on “InChI standardization”• InChIs aggregate (certain) tautomers• We link out to external sites using their IDs

Downsides of InChI

• InChI was a moving target (multi versions) but overall worked as planned.

• Good for small molecules – but no polymers, issues with inorganics, organometallics, imperfect stereochemistry. ChemSpider is “small molecules”

• InChI used as the “deduplicator” – FIRST version of a compound into the database becomes THE structure to deduplicate against…

Side Effects of InChI Usage

SMILES by comparison…

Side Effects of InChI Usage

Standardization IssuesDepiction based on molfile

Standardize

Use the SRS as a guidance document for standardizationAdjust as necessary to our needs

Nitro groups

Salt and Ionic Bonds

Ammonium salts

NPC Browser Set

http://cv.beta.rsc-us.org/Files.aspx?batch=examples%5Cc76d83b5-4875-4aa4-96c7-a81b993b15ac_jysdkayg4x5/passed&file=1573.png

Checking include InChI

• Many SDF files contain InChIs and SMILES – comparing the structure contained within the file with the associated InChI is useful – turned up a number of errors in checking online databases

So, I’m writing an article…

With these…I will lose data

But linking with InChI …

Structure Searching the Web

Data in Publications

• This is not new, you know the story…• So much data of value is contained within a

publication and delivered in a PDF form• PDF files, and unclear licensing/copyright, limit

access to data so I can rework, reuse, repurpose, text mine etc.

• “I specialize in XXXX. I want a database of YYYY extracted from publications and made available, for free, with the capabilities I need, and the publishers should just do it”

“Data enable” publications?

• We would LOVE to bring data out of our archive• What could we do?

• Find chemical names and generate structures• Find chemical images and generate structures• Find reactions – and make a database!• Find data (MP, BP, LogP) and host. Build

models!• Find figures and database them• Find spectra (and link to structures)• Validate the data algorithmically

RSC Archive – since 1841

Text Mining

The N-(β-hydroxyethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiadiazol-5-yl)urea prepared in Example 6 , thionyl chloride ( 5 ml ) and benzene ( 50 ml ) were charged into a glass reaction vessel equipped with a mechanical stirrer , thermometer and reflux condenser .

The reaction mixture was heated at reflux with stirring , for a period of about one-half hour .

After this time the benzene and unreacted thionyl chloride were stripped from the reaction mixture under reduced pressure to yield the desired product N-(β-chloroethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiaidazol-5-yl)urea as a solid residue

But names = structures

• Systematic names can be generated FROM chemical structures algorithmically

But names = structures

• …and structures from systematic names

But what of trivial names?

• What about trivial names, trade names, CAS numbers, multilingual names etc.?

Searching that lipid in patents

Aspirin on ChemSpider

Work in Progress

But Context Gives Reactions

The N-(β-hydroxyethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiadiazol-5-yl)urea prepared in Example 6 , thionyl chloride ( 5 ml ) and benzene ( 50 ml ) were charged into a glass reaction vessel equipped with a mechanical stirrer , thermometer and reflux condenser .

The reaction mixture was heated at reflux with stirring , for a period of about one-half hour .

After this time the benzene and unreacted thionyl chloride were stripped from the reaction mixture under reduced pressure to yield the desired product N-(β-chloroethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiaidazol-5-yl)urea as a solid residue

ChemSpider Reactions

ChemSpider as a Foundation

• >30 million chemicals (and growing)

• ChemSpider is free to access for everyone – and the API means people program against it

• What projects can we benefit?

Support grant-based services• Multiple European consortium-based grants

• PharmaSea (FP7 funded)• Open PHACTS (IMI funded)

• UK National Chemical Database Service (http://cds.rsc.org) – developing data repository for lab data, integrate Electronic Lab Notebooks

• Open Drug Discovery projects

PharmaSea

• 3-year Innovative Medicines Initiative project

• Integrating chemistry and biology data using semantic web technologies

• Open code, open data, open standards

• Academics, Pharmas, Publishers…

• To put medicines in the pipeline…

Open PHACTS

All Databases We Generate…

• All databases and systems we build now include generated InChIs

• InChIs are facilitating discoverability via searching on Google (see Chris’ talk) but also for querying and linking

But we are still VERY LIMITED

• RSC deals with way more than organics, inorganics, organometallics – we are building a data repository to include materials, polymers, ambiguous materials etc.

• There are many plans for InChI moving forward – Markush, polymers, organometallics etc

The great promise should be obvious

• InChIs are here to stay• They will evolve, they will encompass, we

will adopt and adapt• Public and private databases will federate &

build a linked environment of validated data!• Data validation and standardization is

needed• Open Data will continue to proliferate• InChIs are in the “Semantic Web” already

If InChI never existed …

• ChemSpider would never have been built

• Database linking would suffer dramatically

• The web would not be “structure searchable”

• Cheminformatics tools would likely not be linking to public domain databases in the same way

Thank youEmail: [email protected]: 0000-0002-2668-4821 Twitter: @ChemConnectorPersonal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams