descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 13 May 2023 English Publisher:MDPI AGJournal:Algorithms, volume 16, page 253 (eissn: 1999-4893,

Authors: Leslie R. Pendrill; Jeanette Melin; Anne Stavelin; Gunnar Nordin;

doi: 10.3390/a16050253

Modernising Receiver Operating Characteristic (ROC) Curves

- Summary
- Subjects
- Metrics

Abstract

The justification for making a measurement can be sought in asking what decisions are based on measurement, such as in assessing the compliance of a quality characteristic of an entity in relation to a specification limit, SL. The relative performance of testing devices and classification algorithms used in assessing compliance is often evaluated using the venerable and ever popular receiver operating characteristic (ROC). However, the ROC tool has potentially all the limitations of classic test theory (CTT) such as the non-linearity, effects of ordinality and confounding task difficulty and instrument ability. These limitations, inherent and often unacknowledged when using the ROC tool, are tackled here for the first time with a modernised approach combining measurement system analysis (MSA) and item response theory (IRT), using data from pregnancy testing as an example. The new method of assessing device ability from separate Rasch IRT regressions for each axis of ROC curves is found to perform significantly better, with correlation coefficients with traditional area-under-curve metrics of at least 0.92 which exceeds that of linearised ROC plots, such as Linacre’s, and is recommended to replace other approaches for device assessment. The resulting improved measurement quality of each ROC curve achieved with this original approach should enable more reliable decision-making in conformity assessment in many scenarios, including machine learning, where its use as a metric for assessing classification algorithms has become almost indispensable.

Related Organizations

Swedish Defence University
Sweden
RISE Research Institutes of Sweden
Sweden
RISE Research Institute of Sweden
Sweden

Keywords

measurement system analysis; rating ability; ordinality; receiver operating characteristic; decision risks, receiver operating characteristic, decision risks, Industrial engineering. Management engineering, measurement system analysis, Electronic computers. Computer science, rating ability, ordinality, QA75.5-76.95, T55.4-60.8

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	11
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

Top 10%

Average

Top 10%

gold

Fields of Science (3) View all

medical and health sciences

basic medicine

Fields of Science

medical and health sciences

basic medicine

View all