Accessing data to support pesticide residue and emerging contaminant analysis from US-EPA online dashboards

AntonyWilliams 262 views 57 slides Aug 21, 2024
Slide 1
Slide 1 of 57
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57

About This Presentation

The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that serve the chemistry community for many years. This presentation will focus on several web-based software applications which have been developed at the USEPA and made available to the community...


Slide Content

The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis from US-EPA Online Dashboards Antony Williams ACS National Meeting, Denver, CO: August 2024

US EPA: Office of Research and Development Office of Research and Development (ORD) is the research arm of EPA Public health and environmental assessment Computational toxicology, exposure & modeling I work for the Center for Computational Toxicology and Exposure in the Computational Chemistry and Cheminformatics Branch

Free-Access Cheminformatics Tools The Center for Computational Toxicology and Exposure has delivered many tools including CompTox Chemicals Dashboard (primary tool from the center) Proof-of-Concept cheminformatics modules Chemicals Hazard Profiling Chemical Transformations Database Analytical Methods and Spectra Chemical Safety Profiling 2

Curating Chemistry into the DSSTox Database 3 Chemistry underpins all of our tools Data assembly and curation is critical DSSTox assembled over 25 years

CompTox Chemicals Dashboard https://comptox.epa.gov/dashboard/

The Charge for the Dashboard Develop a “first-stop-shop” for environmental chemical data to support EPA and partner decision making: Centralized location for relevant chemical data Chemistry, exposure, hazard and dosimetry Combination of existing data and predictive models Publicly accessible, periodically updated, curated Easy access to data improves efficiency and ultimately a ccelerates chemical risk assessment

Detailed Chemical Pages

Experimental and Predicted Data Physchem and Fate & Transport experimental and predicted data Data can be downloaded as Excel, TSV and CSV files

Chemical Hazard Data

Safety Data

Sources of Exposure to Chemicals

Chemical Lists (not all lists are created equal…)

Chemical Lists Chemical lists are focused on regulations, specific research efforts and categories 501 lists and growing TSCA Inventory Clean Water Act Hazardous Substances Consumer Products database Chemicals of Emerging Concern PFAS lists Extractables and Leachables …lists are versioned and updated and new lists added 12

What about Pesticides? 16 out of 501 lists 13

EPA OPPIN list of >4000 chemicals 14

Harvesting Data en masse How easy is it to harvest data for 4000 pesticides Physicochemical properties Fate and transport Toxicity values Bioactivity data in 100s of in vitro data Exposure data Chemical identifiers Links to regulatory assessments

Batch Searching

Batch Searching is a big enabler https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273 17

Batch Search

Batch Search – Excel, CSV, SDF file

Batch Search

Filter chemicals and send to batch search 21

There is so much more… In vitro bioactivity data exposing our ToxCast/Tox21 data Exposure data with functional use information and chemical weight fractions in products Real-time QSAR predictions – draw your chemicals to predict 16 toxicity and physicochemical property endpoints Integration to realtime literature searches using Google Scholar and Pubmed Linking to >40 public domain databases 22

So now you know the Dashboard… 23

Lots of “proof-of-concept” tools in development PoCs are research software builds to prove approaches before moving into production software environments PoCs are to figure out how to address specific questions Assemble data, develop data model(s), test user interface approaches , work with test user base to garner feedback Since PoCs are internal access data refreshes and application updates can be more Underlying APIs are being used in our research 24

PoCs have been rebuilt for production Examples of PoCs integrated into production apps WebTEST predictions on the Dashboard Structure/substructure/similarity search 25

How to compare Hazard Data? 26

How to compare Hazard Data? NOT Easy to interpret… 27

Hazard Profile 28 Hazard Comparison module profiles toxicity across chemicals https://www.epa.gov/chemical-research/cheminformatics

Hazard Profile On-click view of underlying data 29

Data to Excel in <60s 30

Simple Analog “read-across” Suppose a chemical has limited data – perform an analog search to find related chemicals with data 31

Simple Analog “read-across” Similarity 32

Hazard Comparison Profiling 33

Hazard Comparison Profiling

Where can our tools be applied Emergency Response utility is obvious… Consider East Palestine 35 https://www.cleveland19.com/2023/02/14/ntsb-announces-preliminary-malfunction-that-caused-east-palestine-train-derailment/ POLYPROPYLENE POLYETHYLENE Residue lube oil VINYL CHLORIDE DIPROPYLENE GLYCOL PROPYLENE GLYCOL DIETHYLENE GLYCOL COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER) SEMOLINA COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE) POLYVINYL PETROLEUM LUBEOIL POLYPROPYL GLYCOL ISOBUTYLENE BUTYL ACRYLATES, STABILIZED PETRO OIL, NEC ADDITIVES, FUEL BALLS,CTN,M EDCL SHEET STEEL VEGTABLE, FROZEN BENZENE PARAFFIN WAX FLAKES, POWDER HYDRAULIC CEMENT AUTOS PASSENGER MALT LIQUORS

Cheminformatics Safety Module (NOT PUBLIC) Integrate multiple data streams… 36

EPA Measurement Data 37 Measurement data are needed to ensure chemical safety Characterize risk Regulate use & disposal Manage human & ecological exposures Ensure compliance under federal statutes Chemical Monitoring Needs Exposure Assessment Dose- Response Assessment Risk Characterization Hazard Identification

Applications of Exposomics at EPA Ongoing efforts applying NTA to exposomics challenges including PFAS identification Pesticides in various matrices CECs in water Biosolids Examples include… 38

Example 1: Consumer Product Analysis 39

Example 2: Recycled Product Analysis 40

Example 3: Placental Tissue Analysis 41

Applications of Exposomics at EPA Ongoing efforts applying NTA to exposomics challenges including PFAS identification Pesticides in various matrices CECs in water Biosolids Cheminformatics is a key component of NTA analysis Structure standardization (MS-Ready structure forms) Predictive models (LCMS amenability, retention time prediction) in silico mass spectrometry prediction Chemical Space Mapping Chemical Transformation database Analytical Methods and Open Spectral database 42

AMOS: Analytical Methods and Open Spectra (NOT PUBLIC yet) Simple Vision: I want to find the best method(s) associated with a chemical and/or class of chemicals Answer the question “I cannot find a method for my chemical” - HELP The Approach: Aggregate MS method documents (and adjust the definition of “what is a useful method”) Extract chemistry (mostly CASRN and Names) Map CASRN and Names to structures Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys and ultimately structure 43

AMOS: Analytical Methods and Open Spectra (NOT PUBLIC yet) Three types of data in the database: Methods (regulatory, lab manuals and SOPs, publications, tech notes) Spectra (from public domain and our own laboratories) Monographs (harvested from SWGDRUG and other sites) Some methods have associated spectra Some data are just externally linked Currently contains around 673k spectra, 600k external links, >27k “Fact Sheets” and >5600 methods Spectra – LC-MS, GC-MS, NMR ALL data are growing in number with weekly releases 44

AMOS database

General Search – Mass Spectra 46

General Search – Mass Spectrum Metadata 47

345 Pesticide Methods for 2705 substances 48

Functional Use Classifications – Directed Graph Visualization 49

…to find methods for those chemicals 50

Chemical Transformation Simulator Database 51

ChET : Chemical Transformations Database

ChET Reaction Map Lists 53

ChET Visual Reaction Maps Compare and overlap maps Load all maps containing a particular chemical Prune and filter maps 54

Conclusions Underpinning chemistry data is from the DSSTox database CompTox Chemicals Dashboard is public access to DSSTox and other related databases Proof-of-Concept (PoC) tools are built to prove approaches Everything is increasingly API driven and APIs are now public 55

Contact Information Contact info: [email protected] Slides available at: https://www.slideshare.net/AntonyWilliams/ Obtain articles from Google Scholar Profile 56