Short-answer scoring with ensembles of pretrained language models

Open in Zotero

View on zotero.org

Open in Zotero

View on zotero.org

Article Status

Published

Author/contributor

Ormerod, Christopher (Author)

Title

Short-answer scoring with ensembles of pretrained language models

Abstract

We investigate the effectiveness of ensembles of pretrained transformer-based language models on short answer questions using the Kaggle Automated Short Answer Scoring dataset. We fine-tune a collection of popular small, base, and large pretrained transformer-based language models, and train one feature-base model on the dataset with the aim of testing ensembles of these models. We used an early stopping mechanism and hyperparameter optimization in training. We observe that generally that the larger models perform slightly better, however, they still fall short of state-of-the-art results one their own. Once we consider ensembles of models, there are ensembles of a number of large networks that do produce state-of-the-art results, however, these ensembles are too large to realistically be put in a production environment.

Repository

arXiv

Archive ID

arXiv:2202.11558

Date

2022-02-23

Citation Key

ormerod2022

URL

http://arxiv.org/abs/2202.11558

Accessed

10/05/2024, 02:15

Library Catalogue

arXiv.org

Extra

arXiv:2202.11558 [cs] <标题>: 使用预训练语言模型集群进行简答题评分 Read_Status: New Read_Status_Date: 2026-01-26T11:32:33.304Z

Citation

Ormerod, C. (2022). Short-answer scoring with ensembles of pretrained language models (arXiv:2202.11558). arXiv. http://arxiv.org/abs/2202.11558

Link to this record

https://aievidencehub.org/lib/5IFJPRXI