How to Overfit

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2025 Denmark Publisher:Elsevier BVJournal:Chemometrics and Intelligent Laboratory Systems, volume 264, page 105,461 (issn: 0169-7439,

Copyright policy )

Authors: Rasmus Bro;

doi: 10.2139/ssrn.5243604 , 10.1016/j.chemolab.2025.105461

How to Overfit

- Summary
- Metrics

Abstract

Overfitting remains a central challenge in modern data science, particularly as complex analytical tools become more accessible and widely applied in fields like chemometrics. This communication outlines a series of common pitfalls that lead to misleading and non-generalizable models-ranging from poor data quality and insufficient sample sizes to misuse of validation strategies and overly complex modeling choices. By illustrating a caricatured protocol for generating bad models, the paper emphasizes the importance of domain knowledge, appropriate experimental design, and rigorous validation. It advocates for "validity by design" as a proactive strategy to ensure robust, interpretable, and scientifically sound results.

Country

Denmark

Related Organizations

View all View all

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Top 10%

Average

Green

hybrid

Related to Research communities

UArctic