
This dataset contains patent-to-paper citations through 2022 as well as patent-paper pairs (through 2021). If you use the data, please cite these two articles: 1. M. Marx & A. Fuegi, "Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations." forthcoming in Journal of Economics and Management Strategy. (http://doi.org/10.1111/jems.12455) 2. M. Marx, & A. Fuegi, "Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles" (2020), Strategic Management Journal 41(9):1572-1594. (https://onlinelibrary.wiley.com/doi/full/10.1002/smj.3145) The datafile containing the citations is _pcs_oa.csv. Each citation has the applicant/examiner flag, confidence score (1-10), whether the reference was a) only on the front page, b) only in the body text, or c) in both, and an indicator for a self-citation (i.e., one of the authors is an inventor on the patent). There are two "shorthand" files, _pcs_countsbypatent.csv and _pcs_countsbypaper.csv, which collapse these to the paper and patent level by citation type. The datafile containing the patent-paper pairs (PPPs) is _patent_paper_pairs.tsv. These are USPTO only, through 2021. Each PPP has a confidence score and the count of days between the publication of the paper and the filing of the patent. (If the patent is a continuation of another patent, the filing date of the original patent is used.) Also, when a paper is paired with multiple patents, an indicator variable reports whether those patents are continuations or otherwise identical. (The redistribution of OpenAlex is temporarily removed, but we hope to re-add it soon.) The above is documented in greater detail in __reliance_on_science.pdf. These data are provided under a Creative Commons Attribution Non-Commercial license. Please contact us regarding commercial use. Questions & feedback to support@relianceonscience.org. This work is sponsored by the Alfred P. Sloan Foundation grant #G-2021-16822.
innovation, patenting, science, citation
innovation, patenting, science, citation
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
