
A Dataset of Bot and Human Contributors' names in GitHub This repository provides a dataset of 2,150 contributors (1,035 bots and 1,115 humans) that were active enough (made at least 5 events in GitHub) as of 3 May 2024. This dataset accompanies the paper titled A Bot Identification Model and Tool Based on GitHub Activity Sequences published at the Journal of Systems and Software (JSS), see https://doi.org/10.1016/j.jss.2024.112287. This research paper is co-authored by Natarajan Chidambaram, Alexandre Decan and Tom Mens (Software Engineering Lab, University of Mons, Belgium). This work is supported by Service Public de Wallonie Recherche under grant number 2010235 - ARIAC by DigitalWallonia4.AI, by the Fonds de la Recherche Scientifique – FNRS under grant numbers J.0147.24, T.0149.22, and F.4515.23. Files description bots.txt - contains the login name of bots, one per line humans.txt - contains the login name of humans, one per line.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
