
Supplementary material to the paper:"The Trade-off Between Data Volume and Quality in Predicting User Satisfaction in Software Projects"accepted to Euromicro Conference on Software Engineering and Advanced Applications (SEAA) 2024 The contents of this material is as follows: README -> a file with this information additional_figures.pdf -> a file containing additional figures for which there was no space in the manuscript general.preprocessing.R -> R script for general data preprocessing dq.on.effort.prediction.preparation.R -> R script for detailed data preparation dq.on.effort.prediction.R -> R script for training models and running predictions additional.functions.R -> additional functions used in other scripts res.rds -> raw prediction results for all models, all data variants in all data splits, in R's "rds" format. To open this file run readRDS("res.rds").
data volume, software projects, machine learning, user satisfaction, data quality, prediction, ISBSG
data volume, software projects, machine learning, user satisfaction, data quality, prediction, ISBSG
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
