the epa icss chemistry dashboard to support compound identification using high resolution mass...

Post on 09-Jan-2017

176 Views

Category:

Science

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High

Resolution Mass Spectrometry Data

Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,

Jordan Foster and Jeff Edwards

National Center for Computational ToxicologyU.S. Environmental Protection Agency, RTP, NC

August 21-25, 2016ACS Fall Meeting, Philadelphia, PA

http://orcid.org/0000-0003-1423-330X

The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA

Who is NCCT?

• National Center for Computational Toxicology – part of EPA’s Office of Research and Development

• Research driven by EPA’s Chemical Safety for Sustainability Research Program– Develop new approaches to evaluate the safety of chemicals– Integrate advances in biology, biotechnology, chemistry, exposure

science and computer science

• Goal - To identify chemical exposures that may disrupt biological processes and cause adverse outcomes.

2

Our Dashboard Applications

• Some of our Web-based Applications

3

Introducing Our Latest Dashboard https://comptox.epa.gov

4

• >720,000 chemicals• >10 years assembling data

Bisphenol A

5

Physicochemical Properties

6

Bioassay Screening Data

7

Functional Use and Composition

8

Advanced MS Searches

9

Monoisotopic Mass Search

10

Monoisotopic Mass Search

11

Found 344 results for '215.096 ± 0.005 amu'

Formula Search

12

Formula Search

13

Found 8 results for 'C8H14ClN5'

Formula SearchingFormulae matching Bisphenol A

14

Formula Search Results

15

Download to Excel

16

Download as SDF file

17

SDF file downloaded to desktop

18

Rank-Ordering of “Known-Unknowns” using ChemSpider

19

Comparing Performance

20

721k structures

Does the Dashboard Add Value?

• Remember:– Focus on high quality data and curation– Data sources include EPA data sources and a focus on

environmental chemistry

• No “dilution” by chemical vendors

21

Dilution Example…Morphine Skeleton

22

Bisphenol A as an exampleChemSpider: 1564 Structures

23

Bisphenol A as an exampleDashboard: 215 Structures

24

Chemical Identification Dashboard vs ChemSpider

Sorted by number of references (ChemSpider) or data sources (Dashboard)

Monoisotopic Mass (+/- 0.005 amu) Search

Position of compound sorted

Source of List # of Compounds

Search Tool Mean Position

Median Position #1 #2 #3 #4 #5+

McEachran et al Wastewater

34 ChemSpider 1.8 1 28 5 0 0 1

Dashboard 1.3 1 31 2 0 0 1

Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1

Dashboard 1.7 1 10 2 0 0 1

Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1Dashboard 1.6 1 12 3 3 1 0

Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0

Dashboard vs ChemSpiderRanking Summary

Mass-based Searching Formula Based SearchingDashboard ChemSpider Dashboard ChemSpider

Cumulative Average Position 1.3 2.2 1.2 1.4% in #1 Position 85% 70% 88% 80%

• Selected peer-reviewed publications• 162 total individual chemicals in search

ChemSpider 6926 Results!!!

27

Tacedinaline

Methyl Red

C.I Disperse Yellow 3

Using Functional Use to Sort Candidates

28

Anti-cancer Drug

Microbiological Indicator Dye

Textile/Product Dye

Same top hits – different ranking90 hits only versus 6926 hits

29

18

17

4 Tacedinaline

Methyl Red

C.I Disperse Yellow 3

Dashboard: External Links to Analytical Methods

30

National Environmental Methods Index

31

RSC Analytical Abstracts

32

Integrated Google Chemical Searches

33

Google Chemical Searches Enhanced with Query Terms

34

Non-Targeted Analysis Research

- 1 Dust Sample- Negative Ionization Mode- 300 Extracted “Molecular Features”

1) Prioritize “Molecular Features”

2) Correctly assign formulas

3) Correctly assign structures

4) Determine chemical sources

5) Predict chemical concentrations

C17H19NO3 12 µg/g

(1)

(2) (3) (4) (5)

What is contained in house dust, waste streams etc???

Previous Work with Suspect-Screening

The dashboard is being enhanced to support Non-targeted Analysis

Future Work

• Presently researching rank-ordering based on other criteria – Pubmed

• Additional links to methods – CDC NIOSH• Links to Mass Spec databases – Thermo’s

mzCloud, Massbank. Metlin etc. • Consider predicting metabolites and

degradants• Searching based on “MS-ready” structures

37

“MS Ready” structures

• Many compounds are salts – searches should be on the “neutral form”

• Need to search for adducts (+Na, +K, +NH4), decarboxylation, loss of water etc.

38

Conclusions

• Dashboard support for MS is focused on NTA research – related to chemical exposure

• Dashboard outperforms ChemSpider for ranking chemicals of environmental concern

• New searches developed with Non-targeted Analysis in mind - new rank-ordering approaches in development

39

Acknowledgements

EPA NCCTChris GrulkeJeff EdwardsAnn RichardJordan FosterJennifer SmithAndrew McEachran*Michelle Krzyzanowski

EPA NERLJon Sobus

* = ORISE Participant

top related