Downloads provided by UsageCounts
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Roles of GitHub Users paper. The files are described in more detailed below: processed_ground_truth.csv: A CSV file with the information of the developers considered in the study. Due to privacy issues, we already preprocessed the dataset to remove identification clues. Please contact the authors in case you need the original one. processed_ground_truth_fullstack.csv: Same CSV file but with fullstack developers. script.ipynb, utils.py: Source code of the script used in our study. Dockerfile, docker-compose.yml, requirements.txt: Files to replicate the code environment used in this study. BoW-tuning.csv: List of classifications results for different bag of words parameters.
technical role, machine learning, github, developer profile, expertise, technical recruiters
technical role, machine learning, github, developer profile, expertise, technical recruiters
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 31 | |
| downloads | 40 |

Views provided by UsageCounts
Downloads provided by UsageCounts