Downloads provided by UsageCounts
The code and comment data are a compilation of code blocks and their related comments. Doxygen successfully ran on 106,304 different GitHub projects. A total of 16,115,540 code-comment pairs were obtained by running Doxygen on C, C++, Java, and Python projects. The source code in these pairs can be of various granularities: classes, methods, functions, and variables. These data provide an association between source code and a description of that code. The data directory contains one directory for each project downloaded from GitHub. These project directories are named with the GraphQL ID from GitHub's GraphQL API. In each of these GraphQL-ID labeled directories, there is a license.txt, a url.txt, and a derivatives directory. The license.txt contains the license for the original project, the url.txt contains a link to the original project on GitHub, and the derivatives directory contains the output of running Doxygen on the project. The Doxygen output is a json file, structured as a dictionary with a "contents" field, where the value of that field is a list of lists containing 3 elements each. The following is a mock example of that structure: {"contents": [[path1, snippet1, comment1], [path2, snippet2, comment2], ...]}. The "path" is a filepath relative to the original project from which the code and comment were obtained. The "snippet" is the actual body of the source code. The "comment" is the corresponding comment. For convenience, there is also an initialize.py python script that iterates through all of the json files in the data directory and stores them in an SQLite database called "all_data.db".
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 154 | |
| downloads | 51 |

Views provided by UsageCounts
Downloads provided by UsageCounts