Views provided by UsageCounts
This repository contains data and analysis code for the paper "Garbage In, Garbage Out? Do Machine Learning Application Papers in Social Computing Report Where Human-Labeled Training Data Comes From?", which is to appear in the Proceedings of ACM FAT* 2020. The Jupyter notebook data_analysis_viz.ipynb loads gigo_noscores_dataset_anon.csv, which has the final labels for each of the papers, including metadata about where they were published. Annotation information scores are calculated, and results are calculated and plotted. The notebook then exports gigo_final_dataset_anon.csv, which contains the same columns as gigo_noscores_dataset_anon.csv, but also includes information scores for each paper and some imputed metadata categories about publication type. We have chosen to de-identify the papers presented in this publicly-released dataset, and so papers are only referred to with a unique id. If you are interested in obtaining the identifying information for research purposes, please contact Stuart Geiger.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 9 |

Views provided by UsageCounts