
Sample Graph Datasets in CSV Format Note: none of the data sets published here contain actual data, they are for testing purposes only. Description This data repository contains graph datasets, where each graph is represented by two CSV files: one for node information and another for edge details. To link the files to the same graph, their names include a common identifier based on the number of nodes. For example: dataset_30_nodes_interactions.csv:contains 30 rows (nodes). dataset_30_edges_interactions.csv: contains 47 rows (edges). the common identifier dataset_30 refers to the same graph. CSV nodes Each dataset contains the following columns: Name of the Column Type Description UniProt ID string protein identification label string protein label (type of node) properties string a dictionary containing properties related to the protein. CSV edges Each dataset contains the following columns: Name of the Column Type Description Relationship ID string relationship identification Source ID string identification of the source protein in the relationship Target ID string identification of the target protein in the relationship label string relationship label (type of relationship) properties string a dictionary containing properties related to the relationship. Metadata Graph Number of Nodes Number of Edges Sparse graph dataset_30* 30 47 Y dataset_60* 60 181 Y dataset_120* 120 689 Y dataset_240* 240 2819 Y dataset_300* 300 4658 Y dataset_600* 600 18004 Y dataset_1200* 1200 71785 Y dataset_2400* 2400 288600 Y dataset_3000* 3000 449727 Y dataset_6000* 6000 1799413 Y dataset_12000* 12000 7199863 Y dataset_24000* 24000 28792361 Y dataset_30000* 30000 44991744 Y This repository include two (2) additional tiny graph datasets to experiment before dealing with larger datasets. CSV nodes (tiny graphs) Each dataset contains the following columns: Name of the Column Type Description ID string node identification label string node label (type of node) properties string a dictionary containing properties related to the node. CSV edges (tiny graphs) Each dataset contains the following columns: Name of the Column Type Description ID string relationship identification source string identification of the source node in the relationship target string identification of the target node in the relationship label string relationship label (type of relationship) properties string a dictionary containing properties related to the relationship. Metadata (tiny graphs) Graph Number of Nodes Number of Edges Sparse graph dataset_dummy* 3 6 N dataset_dummy2* 3 6 N
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
