A Comparative Study of Item Response Theory Models for Mixed Discrete-Continuous Responses

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 25 Feb 2024 English Publisher:MDPI AGJournal:Journal of Intelligence, volume 12, page 26 (eissn: 2079-3200,

Copyright policy )

Authors: Cengiz Zopluoglu; J. R. Lockwood;

doi: 10.3390/jintelligence12030026

pmid: 38535160

pmc: PMC10970766

A Comparative Study of Item Response Theory Models for Mixed Discrete-Continuous Responses

- Summary
- Subjects
- Metrics

Abstract

Language proficiency assessments are pivotal in educational and professional decision-making. With the integration of AI-driven technologies, these assessments can more frequently use item types, such as dictation tasks, producing response features with a mixture of discrete and continuous distributions. This study evaluates novel measurement models tailored to these unique response features. Specifically, we evaluated the performance of the zero-and-one-inflated extensions of the Beta, Simplex, and Samejima’s Continuous item response models and incorporated collateral information into the estimation using latent regression. Our findings highlight that while all models provided highly correlated results regarding item and person parameters, the Beta item response model showcased superior out-of-sample predictive accuracy. However, a significant challenge was the absence of established benchmarks for evaluating model and item fit for these novel item response models. There is a need for further research to establish benchmarks for evaluating the fit of these innovative models to ensure their reliability and validity in real-world applications.

Related Organizations

Keywords

H1-99, item response theory, continuous response model, Article, bounded continuous data, Social sciences (General), dictation task, zero-and-one inflated data, natural language processing, language assessment

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

gold

Fields of Science (4) View all

Fields of Science

Related to Research communities

Digital Humanities and Cultural Heritage