
This dataset is a modification of a snapshot taken on August 15, 2025, from the Wikicite subclass hierarchy Google spreadsheet, which contained mostly machine-generated content. Column headers were rewritten for clarity. This table contains results of queries in the Wikidata Query Service at some unknown point previous to the Wikidata graph split, which occurred May 9, 2025. It shows a list of P31 ("instance of") values that mark items considered "scholarly" for the purposes of the split. Items that have one of these items as values for the P31 property have all their triples directed to the graph queryable in the alternative endpoint (https://query-scholarly.wikidata.org). The table also shows other entities considered, but not included, in the final rule for the split. The presence of sitelinks (links from Wikidata to other Wikimedia projects, such as English Wikipedia) was also considered in the analysis, as the team tried to minimize the number of items with sitelinks in the scholarly graph.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
