Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis from US-EPA Online Dashboards

AntonyWilliams 468 views 82 slides Jul 15, 2024
Slide 1
Slide 1 of 82
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80
Slide 81
81
Slide 82
82

About This Presentation

In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA C...


Slide Content

The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis from US-EPA Online Dashboards Antony Williams North American Chemical Residue Workshop: July 2024

US EPA: Office of Research and Development Office of Research and Development (ORD) is the research arm of EPA Public health and environmental assessment Computational toxicology, exposure & modeling I work for the Center for Computational Toxicology and Exposure in the Computational Chemistry and Cheminformatics Branch

Free-Access Cheminformatics Tools The Center for Computational Toxicology and Exposure has delivered many tools including CompTox Chemicals Dashboard (primary tool from the center) Proof-of-Concept cheminformatics modules Chemicals Hazard Profiling Chemical Transformations Database Analytical Methods and Spectra Chemical Safety Profiling 2

Research Projects we apply them to

4 Research Projects we apply them to

Research Projects we apply them to

6 Research Projects we apply them to

Curating Chemistry into the DSSTox Database 7 Chemistry underpins all of our tools Data assembly and curation is critical DSSTox assembled over 25 years

CompTox Chemicals Dashboard https://comptox.epa.gov/dashboard/

The Charge for the Dashboard Develop a “first-stop-shop” for environmental chemical data to support EPA and partner decision making: Centralized location for relevant chemical data Chemistry, exposure, hazard and dosimetry Combination of existing data and predictive models Publicly accessible, periodically updated, curated Easy access to data improves efficiency and ultimately a ccelerates chemical risk assessment

Detailed Chemical Pages

Experimental and Predicted Data Physchem and Fate & Transport experimental and predicted data Data can be downloaded as Excel, TSV and CSV files

Chemical Hazard Data

Hazard Data for Atrazine 3723 rows of human/eco hazard data harvested with 3 clicks

Safety Data

Sources of Exposure to Chemicals

Chemical Lists (not all lists are created equal…)

Chemical Lists Chemical lists are focused on regulations, specific research efforts and categories 501 lists and growing TSCA Inventory Clean Water Act Hazardous Substances Consumer Products database Chemicals of Emerging Concern PFAS lists Extractables and Leachables …lists are versioned and updated and new lists added 17

Remember those Research Projects?

Some Research Projects…

Some Research Projects…

Some Research Projects…

What about Pesticides? 16 out of 501 lists 22

EPA OPPIN list of >4000 chemicals 23

Harvesting Data en masse How easy is it to harvest data for 4000 pesticides Physicochemical properties Fate and transport Toxicity values Bioactivity data in 100s of in vitro data Exposure data Chemical identifiers Links to regulatory assessments

Batch Searching

Batch Searching is a big enabler https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273 26

Batch Search

Batch Search – Excel, CSV, SDF file

Batch Search

Filter chemicals and send to batch search 30

There is so much more… In vitro bioactivity data exposing our ToxCast/Tox21 data Exposure data with functional use information and chemical weight fractions in products Real-time QSAR predictions – draw your chemicals to predict 16 toxicity and physicochemical property endpoints Integration to realtime literature searches using Google Scholar and Pubmed Linking to >40 public domain databases 31

So now you know the Dashboard… 32

Lots of “proof-of-concept” tools in development PoCs are research software builds to prove approaches before moving into production software environments PoCs are to figure out how to address specific questions Assemble data, develop data model(s), test user interface approaches , work with test user base to garner feedback Since PoCs are internal access data refreshes and application updates can be more Underlying APIs are being used in our research 33

PoCs have been rebuilt for production Examples of PoCs integrated into production apps WebTEST predictions on the Dashboard Structure/substructure/similarity search 34

How to compare Hazard Data? 35

How to compare Hazard Data? NOT Easy to interpret… 36

Hazard Profile 37 Hazard Comparison module profiles toxicity across chemicals https://www.epa.gov/chemical-research/cheminformatics

Hazard Profile On-click view of underlying data 38

Data to Excel in <60s 39

Simple Analog “read-across” Suppose a chemical has limited data – perform an analog search to find related chemicals with data 40

Simple Analog “read-across” Similarity 41

Hazard Comparison Profiling 42

Hazard Comparison Profiling

Where can our tools be applied Emergency Response utility is obvious… Consider East Palestine 44 https://www.cleveland19.com/2023/02/14/ntsb-announces-preliminary-malfunction-that-caused-east-palestine-train-derailment/ POLYPROPYLENE POLYETHYLENE Residue lube oil VINYL CHLORIDE DIPROPYLENE GLYCOL PROPYLENE GLYCOL DIETHYLENE GLYCOL COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER) SEMOLINA COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE) POLYVINYL PETROLEUM LUBEOIL POLYPROPYL GLYCOL ISOBUTYLENE BUTYL ACRYLATES, STABILIZED PETRO OIL, NEC ADDITIVES, FUEL BALLS,CTN,M EDCL SHEET STEEL VEGTABLE, FROZEN BENZENE PARAFFIN WAX FLAKES, POWDER HYDRAULIC CEMENT AUTOS PASSENGER MALT LIQUORS

Cheminformatics Safety Module (NOT PUBLIC) Integrate multiple data streams… 45

EPA Measurement Data 46 Measurement data are needed to ensure chemical safety Characterize risk Regulate use & disposal Manage human & ecological exposures Ensure compliance under federal statutes Chemical Monitoring Needs Exposure Assessment Dose- Response Assessment Risk Characterization Hazard Identification

Applications of Exposomics at EPA Ongoing efforts applying NTA to exposomics challenges including PFAS identification Pesticides in various matrices CECs in water Biosolids Examples include… 47

Example 1: Consumer Product Analysis 48

Example 2: Recycled Product Analysis 49

Example 3: Placental Tissue Analysis 50

Applications of Exposomics at EPA Ongoing efforts applying NTA to exposomics challenges including PFAS identification Pesticides in various matrices CECs in water Biosolids Cheminformatics is a key component of NTA analysis Structure standardization (MS-Ready structure forms) Predictive models (LCMS amenability, retention time prediction) in silico mass spectrometry prediction Chemical Space Mapping Chemical Transformation database Analytical Methods and Open Spectral database 51

AMOS: Analytical Methods and Open Spectra (NOT PUBLIC yet) Simple Vision: I want to find the best method(s) associated with a chemical and/or class of chemicals Answer the question “I cannot find a method for my chemical” - HELP The Approach: Aggregate MS method documents (and adjust the definition of “what is a useful method”) Extract chemistry (mostly CASRN and Names) Map CASRN and Names to structures Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys and ultimately structure 52

AMOS: Analytical Methods and Open Spectra (NOT PUBLIC yet) Three types of data in the database: Methods (regulatory, lab manuals and SOPs, publications, tech notes) Spectra (from public domain and our own laboratories) Monographs (harvested from SWGDRUG and other sites) Some methods have associated spectra Some data are just externally linked Currently contains around 673k spectra, 600k external links, >27k “Fact Sheets” and >5600 methods Spectra – LC-MS, GC-MS, NMR ALL data are growing in number with weekly releases 53

AMOS database

General Search – Mass Spectra 55

General Search – Mass Spectrum Metadata 56

345 Pesticide Methods for 2705 substances 57

Functional Use Classifications – Directed Graph Visualization 58

…to find methods for those chemicals 59

Mass Spectral Searching 60

Method Search based on chemical similarity 61

Batch Search 62 Search a set of DTXSIDs, download info on spectra and methods and links to original data

Chemical Classification Search: ClassyFire 63

Adding new methods We add methods based on agency needs and feedback Yesterday’s training “What do you have for Milneb”? We search PubMed, Google Scholar, application notes from vendors, government agencies (Domestic and International) 64

Added overnight 65

How do we do it? 66

9 of 10 chemicals found – no “ Manneb ” 67

Chemical Transformation Simulator Database 68

ChET : Chemical Transformations Database

ChET Reaction Map Lists 70

ChET Visual Reaction Maps Compare and overlap maps Load all maps containing a particular chemical Prune and filter maps 71

Our cheminformatics work supports the “NTA WebApp” 72

Our cheminformatics work supports the “NTA WebApp” 73

Full presentation https://t.ly/4MxFe 74

The CompTox API is now public https://api-ccte.epa.gov/docs/index.html 75

Conclusions Underpinning chemistry data is from the DSSTox database CompTox Chemicals Dashboard is public access to DSSTox and other related databases Proof-of-Concept (PoC) tools are built to prove approaches Everything is increasingly API driven and APIs are now public 76

Some Related Publications of Interest

You want to know more… Lots of resources available Presentations: https://tinyurl.com/w5hqs55 Communities of Practice Videos: https://rb.gy/qsbno1 Manual: https://rb.gy/4fgydc Latest News: https://comptox.epa.gov/dashboard/news_info 78

This talk is an overview This talk is a high-level overview only. We can provide trainings into the individual modules and data as required LOTS of training materials are available https://www.epa.gov/chemical-research/new-approach-methods-nams-training

Acknowledgments Our DSSTox curation team SCDCD software development and DevOps teams Scientists and students across CCTE Non-targeted analysis and mass spectrometry team Dashboard project team – Nisha Sipes & Phuc Do Cheminformatics Modules and Modeling Team – Valery Tkachenko, Todd Martin, Nate Charest, Charlie Lowe ChET – Adam Edelman-Munoz, Caroline Stevens and team ChemSTER – Nate Charest and Adam Edelman-Munoz External collaborators and contributors of data 80

Contact Information Contact info: [email protected] Slides available at: https://www.slideshare.net/AntonyWilliams/ Obtain articles from Google Scholar Profile 81