In authors or contributors

4 resources

  • Jill Burstein, Kevin Yancey, Klinton Bic...
    |
    Apr 12th, 2023
    |
    document
    Jill Burstein, Kevin Yancey, Klinton Bic...
    Apr 12th, 2023
  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Apr 12th, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Apr 12th, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Apr 12th, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Apr 12th, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Jul 12th, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Jul 12th, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

Last update from database: 12/04/2025, 18:15 (UTC)
Powered by Zotero and Kerko.