A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

Name: A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling
Keywords: remote sensing, correlation, Science, model assessment, Q, sampling algorithm, generalization

Kevin T. Decker; Brett J. Borghetti

Found an issue? Give us feedback

Remote Sensingarrow_drop_down

Remote Sensing

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

Remote Sensing

Article . 2025

Data sources: DOAJ

A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

descriptionPublicationkeyboard_double_arrow_right Article 11 Apr 2025 English Publisher:MDPI AGJournal:Remote Sensing, volume 17, page 1,373 (eissn: 2072-4292,

Copyright policy )

Authors: Kevin T. Decker; Brett J. Borghetti;

doi: 10.3390/rs17081373

A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Identified as early as 2000, the challenges involved in developing and assessing remote sensing models with small datasets remain, with one key issue persisting: the misuse of random sampling to generate training and testing data. This practice often introduces a high degree of correlation between the sets, leading to an overestimation of model generalizability. Despite the early recognition of this problem, few researchers have investigated its nuances or developed effective sampling techniques to address it. Our survey highlights that mitigation strategies to reduce this bias remain underutilized in practice, distorting the interpretation and comparison of results across the field. In this work, we introduce a set of desirable characteristics to evaluate sampling algorithms, with a primary focus on their tendency to induce correlation between training and test data, while also accounting for other relevant factors. Using these characteristics, we survey 146 articles, identify 16 unique sampling algorithms, and evaluate them. Our evaluation reveals two broad archetypes of sampling techniques that effectively mitigate correlation and are suitable for model development.

Related Organizations

Air Force Institute of Technology
United States

Keywords

remote sensing, correlation, Science, model assessment, Q, sampling algorithm, generalization

1 Research products, page 1 of 1

simplify software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

gold

A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

1 Research products, page 1 of 1

simplify software on GitHub