
Project Directory Structure Analysis Contains the results of the smell distribution per file and per KLOC. classification Holds the classification of smelly files based on their content. criteria Python package defining the criteria used to filter ML and non-ML projects, including toy projects. csv Hold all the files used to analyze the type of operations carried out in each specific stage Hold the statistics for ML and Non-ML projects Hold the report of the dataset collection Hold the csv file used to evaluate the accuracy, precision and recall of our classifier model Hold the csv files used to assess the performance of the RIdiom tool dataset Contains the list of projects collected per domain for analysis. images Contains heatmaps, boxplots and histogram generated from the analysis. repo_mining Code for querying projects by topic from GitHub. source All scripts used for the analysis. utils Contains various helper functions to reduce code duplication. Zero_shot_classification Code containing our zero-shot classification model
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
