DMP: NFL Stadium Attendance Prediction with Random Forest Regression (NFL-Attendance-RF)

Data Management Plan — (28 Apr 2025) for the NFL-Attendance-RF projectSubmitted for Part 2 “Data Management” — Data Stewardship UE 2025S, TU Wien 📂 Input & derived data – Kaggle source tables and the four curated splits (train, validation, test, merged baseline) are published in DBRepo (DOIs P1–P4). 🤖 Model & results – The final Random-Forest regressor plus five evaluation/diagnostic artefacts live in TUWRD(DOIs O1–O6). 🧩 Rich metadata – A FAIR4ML record is embedded in the model landing page, and an external CodeMeta file connects author, dependencies and every PID. 🔒 Preservation & security – Redundant storage (GitHub → Zenodo, DBRepo, TUWRD) and TU Wien’s ten-year retention guarantee long-term accessibility. 💻 Full code — notebook, helper scripts, requirements.txt, README, licence and ... — is on GitHub:https://github.com/emilp-tuwien/nfl-attendance-prediction (DOI: https://doi.org/10.5281/zenodo.15292895; clone the repo to rerun the entire pipeline).

Keywords

Machine Learning, Random Forest, Prediction, Hyperparameter Tuning, NFL

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green