Downloads provided by UsageCounts
This dataset consists in two distinct scholarly knowledge graph created from two publicly available bibliographic datasets: 1) a triplestore covering information about the journal Scientometrics provided by OpenCitations (available here), and 2) the AMiner AND benchmark from 2018 available here. This KG was extracted for a research project on knowledge graph embeddings (KGEs) for author disambiguation. Structural triples of the knowledge graphs are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (see arXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and a numeric matrix respectively in the files textual_literals.npy and numeric_literals.npy in order to simplify the representation learning task. The file and_eval.json of each KG contains the evaluation dataset used for evaluating our AND architecture. For the script used to gather this dataset see https://github.com/sntcristian/and-kge/tree/main/src/AMiner-534K and https://github.com/sntcristian/and-kge/tree/main/src/OC-782K.
scholarly data, knowledge graph, author disambiguation, linked data, scientometrics, knowledge graph embeddings
scholarly data, knowledge graph, author disambiguation, linked data, scientometrics, knowledge graph embeddings
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 19 | |
| downloads | 4 |

Views provided by UsageCounts
Downloads provided by UsageCounts