The use of semantic similarity tools in automated content scoring of fact-based essays written by EFL learners

Article Status
Published
Author/contributor
Title
The use of semantic similarity tools in automated content scoring of fact-based essays written by EFL learners
Abstract
This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard was produced by a native expert. A shortlist of carefully selected tools, including InferSent, spaCy, DKPro, ADW, SEMILAR and Latent Semantic Analysis, generated semantic similarity scores between student writing samples and the expert sample. Three teachers who were lecturers of the course manually graded the student samples on content. To ensure validity of human grades, samples with discrepant agreement were excluded and an inter-rater reliability test was conducted on remaining samples with quadratic weighted kappa. After the grades of the remaining samples were proven valid, a Pearson correlation analysis between semantic similarity scores and human grades was conducted and results showed that InferSent was the most effective tool in predicting the human grades. The study further pointed to the limitations of the six tools and suggested three alternatives to traditional methods in turning semantic similarity scores into reporting grades on content.
Publication
Education and Information Technologies
Volume
27
Issue
9
Pages
13021-13049
Date
2022-6-21
Journal Abbr
Educ Inf Technol
Language
en
ISSN
1360-2357
Accessed
15/02/2023, 23:07
Library Catalogue
DOI.org (Crossref)
Extra
Citation Key: wang2022a <标题>: 在自动评分以英语为外语的学习者所写事实性文章时使用语义相似性工具 <AI Smry>: This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners and found InferSent was the most effective tool in predicting the human grades.
Citation
Wang, Q. (2022). The use of semantic similarity tools in automated content scoring of fact-based essays written by EFL learners. Education and Information Technologies, 27(9), 13021–13049. https://doi.org/10.1007/s10639-022-11179-1
Powered by Zotero and Kerko.