Determinants of grader agreement: an analysis of multiple short answer corpora
Article Status
Published
Authors/contributors
- Padó, Ulrike (Author)
- Padó, Sebastian (Author)
Title
Determinants of grader agreement: an analysis of multiple short answer corpora
Abstract
The ’short answer’ question format is a widely used tool in educational assessment, in which students write one to three sentences in response to an open question. The answers are subsequently rated by expert graders. The agreement between these graders is crucial for reliable analysis, both in terms of educational strategies and in terms of developing automatic models for short answer grading (SAG), an active research topic in NLP. This makes it important to understand the properties that influence grader agreement (such as question difficulty, answer length, and answer correctness). However, the twin challenges towards such an understanding are the wide range of SAG corpora in use (which differ along a number of dimensions) and the hierarchical structure of potentially relevant properties (which can be located at the corpus, answer, or question levels). This article uses generalized mixed effects models to analyze the effect of various such properties on grader agreement in six major SAG corpora for two main assessment tasks (language and content assessment). Overall, we find broad agreement among corpora, with a number of properties behaving similarly across corpora (e.g., shorter answers and correct answers are easier to grade). Some properties show more corpus-specific behavior (e.g., the question difficulty level), and some corpora are more in line with general tendencies than others. In sum, we obtain a nuanced picture of how the major short answer grading corpora are similar and dissimilar from which we derive suggestions for corpus development and analysis.
Publication
Language Resources and Evaluation
Date
2021-6-12
Volume
56
Issue
2
Pages
387-416
Journal Abbr
Lang. Resour. Eval.
Citation Key
pado2022
Accessed
07/10/2024, 20:38
ISSN
1574-020X
Short Title
Determinants of grader agreement
Language
en
Library Catalogue
DOI.org (Crossref)
Extra
<标题>: 评分员一致性的决定因素:多篇简答语料库分析
<AI Smry>: A nuanced picture is obtained of how the major short answer grading corpora are similar and dissimilar from which suggestions for corpus development and analysis are derived.
Read_Status: New
Read_Status_Date: 2026-01-26T11:33:53.529Z
Citation
Padó, U., & Padó, S. (2021). Determinants of grader agreement: an analysis of multiple short answer corpora. Language Resources and Evaluation, 56(2), 387–416. https://doi.org/10.1007/s10579-021-09547-3
Link to this record