Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

PubChemLite for Exposomics + predicted CCS from CCSbase - 5 Sept. 2025

Authors: Schymanski, Emma; Kondic, Todor; Elapavalore, Anjana; Bolton, Evan; Thiessen, Paul; Zhang, Jian; Kim, Sunghwan; +3 Authors

PubChemLite for Exposomics + predicted CCS from CCSbase - 5 Sept. 2025

Abstract

PubChemLite is a subset of PubChem (https://pubchem.ncbi.nlm.nih.gov/) selected from major categories of the Table of Contents page at the PubChem Classification Browser (https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72). This version of PubChemLite for Exposomics has predicted collision cross section (CCS) values for 11 adducts provided by Libin Xu and team at CCSbase (https://ccsbase.net/) calculated from the latest c3sdb code. PubChemLite exposomics is compiled from 11 categories: AgroChemInfo, BioPathway, DrugMedicInfo, FoodRelated, PharmacoInfo, SafetyInfo, ToxicityInfo, KnownUse, DisorderDisease, Identification, ChemClass. CCS adducts provided are: [M+H]+, [M-H]-, [M+Na]+, [M+K]+, [M+NH4]+, [M+H-H2O]+, [M+HCOO]-, [M+CH3COO]-, [M+Na-2H]-, [M]+, [M]- Details on the CCS prediction are given here: Ross et al. (2020) Analytical Chemistry, DOI: 10.1021/acs.analchem.9b05772 PubChemLite is described in Schymanski et al. (2021) J. Cheminformatics, DOI: 10.1186/s13321-021-00489-0 An article describing these joint efforts is available: Elapavalore et al. (2025) ES&T Letters, DOI: 10.1021/acs.estlett.4c01003 PubChemCIDs have been collapsed by InChIKey first block, reporting the structure from the most annotated CID, plus related CIDs. Entries that will be ignored by MetFrag (salts, disconnected substances) or cause errors (e.g. transition metals) have been removed. The Patent and PubMed ID counts are extracted from files on the PubChem FTP site. The "AnnoTypeCount" term counts how many of the categories are represented, the subsequent column (named per category) counts the number of annotation categories available in the next sub-category of the TOC entry. These files can be used "as is" as localCSV for MetFrag Command Line (https://ipb-halle.github.io/MetFrag/) - please do NOT upload these files directly to the web interface, they are too large and will be available in a drop-down menu. Further details are described in Schymanski et al. (2021) DOI:10.1186/s13321-021-00489-0 and Elapavalore et al. (2025) DOI: 10.1021/acs.estlett.4c01003 NOTE: The latest PubChemLite for Exposomics version can be downloaded at DOI:10.5281/zenodo.5995885 (currently updating monthly). This file will be updated shortly after. Please cite this data source and Elapavalore et al. (2025) DOI: 10.1021/acs.estlett.4c01003 when using this dataset.

Please cite this data source, the CCSbase and PubChemLite papers when using this data! More details under DOI: 10.1021/acs.estlett.4c01003

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average