Automatic Short Math Answer Grading via In-context Meta-learning

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint 01 Jan 2022Embargo end date: 01 Jan 2022Publisher:International Educational Data Mining SocietyJournal:CoRR, volume abs/2205.15219Funded by:NSF | Collaborative Research: C...

Authors: Mengxue Zhang; Sami Baral; Neil T. Heffernan; Andrew S. Lan;

doi: 10.48550/arxiv.2205.15219 , 10.5281/zenodo.6853031 , 10.5281/zenodo.6853032

arXiv: 2205.15219

Automatic Short Math Answer Grading via In-context Meta-learning

- Summary
- Subjects
- Metrics

Abstract

Automatic short answer grading is an important research direction in the exploration of how to use artificial intelligence (AI)-based tools to improve education. Current state-of-the-art approaches use neural language models to create vectorized representations of students responses, followed by classifiers to predict the score. However, these approaches have several key limitations, including i) they use pre-trained language models that are not well-adapted to educational subject domains and/or student-generated text and ii) they almost always train one model per question, ignoring the linkage across a question and result in a significant model storage problem due to the size of advanced language models. In this paper, we study the problem of automatic short answer grading for students' responses to math questions and propose a novel framework for this task. First, we use MathBERT, a variant of the popular language model BERT adapted to mathematical content, as our base model and fine-tune it for the downstream task of student response grading. Second, we use an in-context learning approach that provides scoring examples as input to the language model to provide additional context information and promote generalization to previously unseen questions. We evaluate our framework on a real-world dataset of student responses to open-ended math questions and show that our framework (often significantly) outperforms existing approaches, especially for new questions that are not seen during training.

To appear EDM 2022

Related Organizations

University of Massachusetts Amherst
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computation and Language (cs.CL), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average