Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Article Status

Published

Authors/contributors

Zhang, Mengxue (Author)
Heffernan, Neil (Author)
Lan, Andrew (Author)

Title

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Abstract

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score labels. However, since scoring is a subjective process, these human scores are noisy and can be highly variable, depending on the scorer. In this paper, we investigate a collection of models that account for the individual preferences and tendencies of each human scorer in the automated scoring task. We apply these models to a short-answer math response dataset where each response is scored (often differently) by multiple different human scorers. We conduct quantitative experiments to show that our scorer models lead to improved automated scoring accuracy. We also conduct quantitative experiments and case studies to analyze the individual preferences and tendencies of scorers. We found that scorers can be grouped into several obvious clusters, with each cluster having distinct features, and analyzed them in detail.

Repository

arXiv

Archive ID

arXiv:2306.00791

Date

2023-06-01

Citation Key

zhang2023a

URL

http://arxiv.org/abs/2306.00791

Accessed

18/01/2024, 23:30

Library Catalogue

arXiv.org

Extra

arXiv:2306.00791 [cs] <标题>: 短答数学题评分者偏好建模与分析 Read_Status: New Read_Status_Date: 2026-01-26T11:32:29.807Z

Citation

Zhang, M., Heffernan, N., & Lan, A. (2023). Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions (arXiv:2306.00791). arXiv. http://arxiv.org/abs/2306.00791

Link to this record

https://aievidencehub.org/lib/77SQVE6Y