open notebook science web services - acs spring 2011

55
Rapid dissemination of chemical information for people and machines using Open Notebook Science March 30, 2011 American Chemical Society Meeting Jean-Claude Bradley Department of Chemistry Drexel University Andrew Lang Department of Mathematics Oral Roberts University

Upload: jean-claude-bradley

Post on 18-Dec-2014

1.693 views

Category:

Education


4 download

DESCRIPTION

Jean-Claude Bradley presents on March 30, 2011 at the American Chemical Society on Rapid Dissemination of Chemical Information for people and machines using Open Notebook Science.

TRANSCRIPT

Page 1: Open Notebook Science Web Services - ACS Spring 2011

Rapid dissemination of chemical information for people and machines using

Open Notebook Science

March 30, 2011

American Chemical Society Meeting

Jean-Claude Bradley

Department of ChemistryDrexel University

Andrew Lang

Department of MathematicsOral Roberts University

Page 2: Open Notebook Science Web Services - ACS Spring 2011

Motivation: Faster Science, Better Science

Page 3: Open Notebook Science Web Services - ACS Spring 2011

There are NO FACTS, only measurements embedded

within assumptions

Open Notebook Science maintains the integrity of data

provenance by making assumptions explicit

Page 4: Open Notebook Science Web Services - ACS Spring 2011

TRUST

PROOF

Page 5: Open Notebook Science Web Services - ACS Spring 2011

First record then abstract structure

In order to be discoverable use Google friendly formats (simple HTML, no login)

In order to be replicable use free hosted tools (Wikispaces, Google Spreadsheets)

Strategy for an Open Notebook:

Page 6: Open Notebook Science Web Services - ACS Spring 2011

Crowdsourcing Solubility Data

Page 7: Open Notebook Science Web Services - ACS Spring 2011

ONS Challenge Judges

Page 8: Open Notebook Science Web Services - ACS Spring 2011

ONS Challenge Award Winners

Page 9: Open Notebook Science Web Services - ACS Spring 2011

Data provenance: From Wikipedia to…

Page 10: Open Notebook Science Web Services - ACS Spring 2011

…the lab notebook and raw data

Page 11: Open Notebook Science Web Services - ACS Spring 2011

Calculations Made Public on Google Spreadsheets

Page 12: Open Notebook Science Web Services - ACS Spring 2011

Interactive NMR spectra using JSpecView and JCAMP-DX

Page 13: Open Notebook Science Web Services - ACS Spring 2011

Raw Data As Images

Splatter?

Some liquid

Page 14: Open Notebook Science Web Services - ACS Spring 2011

YouTube for demonstrating experimental set-up

Page 15: Open Notebook Science Web Services - ACS Spring 2011

The importance of raw data availability

Missed in a prior publication on solubility

for this compound

Page 16: Open Notebook Science Web Services - ACS Spring 2011

Solubilities collected in a Google Spreadsheet

Page 17: Open Notebook Science Web Services - ACS Spring 2011

Rajarshi Guha’s Live Web Query using Google Viz API

Page 18: Open Notebook Science Web Services - ACS Spring 2011

Web services for summary data

(Andrew Lang)

Page 19: Open Notebook Science Web Services - ACS Spring 2011

Web service calls from within a Google Spreadsheet for solubility measurement and

prediction

(Andrew Lang)

Page 20: Open Notebook Science Web Services - ACS Spring 2011

Integration of Multiple Web Services to Recommend Solvents for Reactions

(Andrew Lang)

Page 21: Open Notebook Science Web Services - ACS Spring 2011
Page 22: Open Notebook Science Web Services - ACS Spring 2011
Page 23: Open Notebook Science Web Services - ACS Spring 2011
Page 24: Open Notebook Science Web Services - ACS Spring 2011

Reaction Attempts Book

Page 25: Open Notebook Science Web Services - ACS Spring 2011

Reaction Attempts Book: Reactants listed Alphabetically

Page 26: Open Notebook Science Web Services - ACS Spring 2011

ONS Challenge Solubility Book cited for nanotechnology application

Page 27: Open Notebook Science Web Services - ACS Spring 2011

Lulu.com Data Disks

Page 28: Open Notebook Science Web Services - ACS Spring 2011

Visualizing molecule-researcher connection maps reveals link between 2 Open Notebooks (Todd and

Bradley)

(Don Pellegrino)

Page 29: Open Notebook Science Web Services - ACS Spring 2011

The Intersection of Open Notebooks (Bradley/Todd) and IP implications

Open Notebook could have blocked patent if

done earlier

Page 30: Open Notebook Science Web Services - ACS Spring 2011

The Chemical Information Validation Sheet

567 curated and referenced measurements from Fall 2010 Chemical Information Retrieval course

Page 31: Open Notebook Science Web Services - ACS Spring 2011

The Chemical Information Validation Explorer

(Andrew Lang)

Page 32: Open Notebook Science Web Services - ACS Spring 2011

Discovering outliers for melting points (stdev/average)

Page 33: Open Notebook Science Web Services - ACS Spring 2011

Investigating the m.p. inconsistencies of EGCG

Page 34: Open Notebook Science Web Services - ACS Spring 2011

Investigating the m.p. inconsistencies of cyclohexanone

Page 35: Open Notebook Science Web Services - ACS Spring 2011

Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for melting

points

Page 36: Open Notebook Science Web Services - ACS Spring 2011

Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for boiling

points

Page 37: Open Notebook Science Web Services - ACS Spring 2011

Sigma-Aldrich, Acros and Wolfram Alpha apparently

DO NOT use the same sources for flash points

Page 38: Open Notebook Science Web Services - ACS Spring 2011

Most popular data sources

Page 39: Open Notebook Science Web Services - ACS Spring 2011

Alfa Aesar donates melting points to the public

Page 40: Open Notebook Science Web Services - ACS Spring 2011

Open Melting Point Explorer

Page 41: Open Notebook Science Web Services - ACS Spring 2011

Outliers

MDPI dataset

EPI (via ChemSpider)

Page 42: Open Notebook Science Web Services - ACS Spring 2011

Outliers

Alfa Aesar

Page 43: Open Notebook Science Web Services - ACS Spring 2011

Inconsistencies and SMILES problems within MDPI dataset

Page 44: Open Notebook Science Web Services - ACS Spring 2011

MDPI Dataset labeled with High Trust Level

Page 45: Open Notebook Science Web Services - ACS Spring 2011

Open Melting Point Datasets

Page 46: Open Notebook Science Web Services - ACS Spring 2011

Open Random Forest modeling of Open Melting Point data using CDK descriptors

(Andrew Lang)

R2 = 0.78, TPSA and nHdon most important

Page 47: Open Notebook Science Web Services - ACS Spring 2011

Melting point prediction service

Page 48: Open Notebook Science Web Services - ACS Spring 2011

Using melting point for temperature dependent solubility prediction

Page 49: Open Notebook Science Web Services - ACS Spring 2011

Decanoic acid

WaterNaCl

Page 50: Open Notebook Science Web Services - ACS Spring 2011

Phrase searching for useful solubility applications

Page 51: Open Notebook Science Web Services - ACS Spring 2011

The challenge of modeling inorganic solubility openly

Page 52: Open Notebook Science Web Services - ACS Spring 2011

All ONS web services

Page 53: Open Notebook Science Web Services - ACS Spring 2011

Dynamic links to private tagged Mendeley collections

(Andrew Lang)

Page 54: Open Notebook Science Web Services - ACS Spring 2011

For all Formats of ONS Projects

Page 55: Open Notebook Science Web Services - ACS Spring 2011

Conclusions

•Abstraction of ONS experimental data into a clear semantic format can be done quickly and easily and greatly leverages the utility of the data

•No information is lost as long as a link to the Open Notebook is captured