
A 400-record sample of the USDA Dr. Duke's Phytochemical and Ethnobotanical Database, denormalized into a flat 8-column schema and enriched with quantitative signals from four sources: - pubmed_mentions_2026: PubMed publication count per compound (NCBI E-utilities)- clinical_trials_count_2026: ClinicalTrials.gov v2 study count per compound- chembl_bioactivity_count: ChEMBL v35 bioassay data points (CC BY-SA 3.0)- patent_count_since_2020: USPTO patents since 2020-01-01 (PatentsView REST API) Schema: chemical, plant_species, application, dosage, pubmed_mentions_2026, clinical_trials_count_2026, chembl_bioactivity_count, patent_count_since_2020 Records: 400 (top compounds by PubMed mentions)Total dataset: 76,907 records across 24,746 compounds and 2,313 species.Full dataset: https://ethno-api.com Formats: JSON (16 MB) + Parquet (800 KB, Snappy compression).Methodology: https://github.com/wirthal1990-tech/USDA-Phytochemical-Database-JSON/blob/main/METHODOLOGY.md
ethnobotany, PubMed, natural products, JSON, ChEMBL, USPTO, ClinicalTrials, phytochemical, Parquet, USDA, drug discovery
ethnobotany, PubMed, natural products, JSON, ChEMBL, USPTO, ClinicalTrials, phytochemical, Parquet, USDA, drug discovery
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
