PubChem as a resource for chemical information education

SunghwanKim95 383 views 56 slides Aug 11, 2020
Slide 1
Slide 1 of 56
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56

About This Presentation

Presented at the Fall 2020 American Chemical Society (ACS) National Meeting (Virtual) on August 20, 2020.

Sunghwan Kim & Evan Bolton
National Library of Medicine, National Institutes of Health, Rockville, Maryland, United States


==== Abstract ====

PubChem (https://pubchem.ncbi.nlm.nih.gov) i...


Slide Content

PubChem as a Resource for Chemical Information Education ACS Fall 2020 Virtual Meeting August 20, 2020 Sunghwan Kim, Ph.D., M.Sc.

PubChem ( https://pubchem.ncbi.nlm.nih.gov ) Public chemical database. Developed and maintained by the U.S. National Institutes of Health. Contains various chemical entities: Small molecules siRNAs & miRNAs Carbohydrates Lipids Peptides Chemically modified macromolecules ……

PubChem ( https://pubchem.ncbi.nlm.nih.gov ) Collects chemical information from 750+ data sources and disseminates it to the public free of charge. 103 million unique chemical structures. Crosslinks to many other databases. Search, analysis, download and visualization tools. A key resource in many areas: Cheminformatics Chemical biology Medicinal chemistry Drug discovery

PubChem Usage Statistics 2016 2017 2018 >4.3 million unique users per month (Apr. 2020) 2019 2020 Source: Google Analytics

Top 5 Chemistry Websites acs.org rsc.org sigmaaldrich.com pubchem.ncbi.nlm.nih.gov cas.org Source: https://www.alexa.com/topsites/category/Top/Science/Chemistry PubChem is the only public website among them. PubChem Usage Statistics

~36% of PubChem users are between 18-24 . (likely to be college students) PubChem Usage Statistics

PubChem as an online resource for chemical education Popularity: Many young people are already using PubChem. Sustainability: It is sixteen years old and not going away soon. Zero-cost (to students): U.S. taxpayers have already paid for it.

Popularity: Many young people are already using PubChem. Sustainability: It is sixteen years old and not going away soon. Zero-cost (to students): U.S. taxpayers have already paid for it. A strong potential as an education resource, especially for small organizations like: primarily undergraduate institutions (PUIs) community colleges (CCs) PubChem as an online resource for chemical education

How about R1 universities with large endowments? Likely to have access to proprietary databases. Primarily used for research. Inconvenient off-campus access. Students will lose access when they graduate. Most students will eventually rely on public resources.  Need for training/education opportunities while in school. PubChem as an online resource for chemical education

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

He Ne Ar Kr Xe Rn Li Na K Rb Cs Fr Kim et al., Chem. Teacher International , 2020. doi: 10.1515/cti-2020-0006

Exploring Chemical Information in PubChem Search by chemical name Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Why should students learn programmatic access? PubChem users have very diverse backgrounds/interests. PubChem’s web interfaces are optimized to perform commonly requested tasks interactively. Everything you can do with PubChem through the web browser can be automated through PubChem’s programmatic interfaces . Programmatic access enables one to do much more complicated and specialized tasks that cannot be done through the web browser.

Why should students learn programmatic access? Programming skills are essential for: automating routine tasks and processing/analyzing a large data set  Important skills for students pursuing STEM careers in the age of big data.

Programmatic Access to PubChem Multiple programmatic access routes. Two major programmatic access methods. PUG-REST (primarily for computed properties). Kim et al., Nucleic Acids Res. 2018, 46(W1):W563-570. https://pubchemdocs.ncbi.nlm.nih.gov/pug-rest PUG-View (primarily for text information). Kim et al., J. Cheminform . 2019, 11:56. https://pubchemdocs.ncbi.nlm.nih.gov/pug-view Jupyter Notebooks containing sample codes (in python/R) are freely available at LibreTexts: https://chem.libretexts.org/link?143689

Cheminformatics On-Line Chemistry Course (OLCC) Kim et al., J. Chem. Educ. , 2020, submitted.

Cheminformatics OLCC Unique challenges to teaching cheminformatics Cheminformatics is not an established chemistry field.  Chemistry + Informatics + Computer Science + Library Science + Pharmaceutical Science + …… Not so many faculty members with Cheminformatics expertise. No textbook suitable for undergraduate chemistry students.

The Cheminformatics OLCC addresses these issues ! Cheminformatics OLCC Unique challenges to teaching cheminformatics Cheminformatics is not an established chemistry field.  Chemistry + Informatics + Computer Science + Library Science + Pharmaceutical Science + …… Not so many faculty members with Cheminformatics expertise. No textbook suitable for undergraduate chemistry students.

Course website Cheminformatics experts Prepare online reading materials & homework problem sets Cheminformatics OLCC

Course website Cheminformatics experts Prepare online reading materials & homework problem sets Course Instructor Students Run the course using the course materials at multiple schools Cheminformatics OLCC

Course website Cheminformatics experts Prepare online reading materials & homework problem sets Course Instructor Students Run the course using the course materials at multiple schools Face-to-face meeting Online discussion among experts, instructors, & students through the website Cheminformatics OLCC

It was offered three times: Fall 2015: 36 students from 4 schools Spring 2017: 47 students from 9 schools Fall 2019: 23 students from 5 schools All course materials are available at: CCCE website ( http://olcc.ccce.divched.org ) LibreTexts ( https://libretexts.org ) (free online textbook site) Many of the course materials cover PubChem data, tools and services . Cheminformatics OLCC

PubChem-related topics in Cheminformatics OLCC Critical assessment of chemical information Chemical representations (e.g., InChI and SMILES) As alternatives to chemical name queries For chemical data exchange/integration/sharing Search by chemical name Search by chemical structure Identity search 2-D/3-D similarity search Substructure/superstructure search Molecular formula search Structure clustering and structure-activity relationship analysis Automation of chemical data retrieval through a computer code Cheminformatics OLCC

Many PubChem users are likely to be college students. Summary PubChem has a strong potential as a resource for chemical information training because of its: popularity sustainability low cost

Summary PubChem supports various use cases beyond simple chemical name search. Search by chemical structure Search by gene/protein name PubChem Periodic Table and Element pages Programmatic access

Summary PubChem works with the chemical education community to provide chemical information training for students. Please reach out to us for collaboration if you are interested.

Acknowledgements Evan Bolton Jie Chen Tiejun Cheng Asta Gindulyte Jia He Siqian He Qingliang (Leon) Li Benjamin Shoemaker Thiessen Paul Olga Pujolras Bo Yu Leonid Zaslavsky Jian (Jeff) Zhang Zhi (Leon ) Sun The PubChem Team PubChem users, depositors, and collaborators Funded by the National Library of Medicine

Thank you! Questions? Sunghwan Kim, Ph.D., M.Sc. Email: [email protected] SlideShare: https://www.slideshare.net/SunghwanKim95/presentations