In authors or contributors

4 resources

  • Jill Burstein, Kevin Yancey, Klinton Bic...
    |
    Sep 3rd, 2023
    |
    document
    Jill Burstein, Kevin Yancey, Klinton Bic...
    Sep 3rd, 2023
  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Sep 3rd, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Sep 3rd, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Sep 3rd, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Sep 3rd, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

  • Kevin P. Yancey, Geoffrey Laflair, Antho...
    |
    Jul 3rd, 2023
    |
    conferencePaper
    Kevin P. Yancey, Geoffrey Laflair, Antho...
    Jul 3rd, 2023

    Essay scoring is a critical task used to evaluate second-language (L2) writing proficiency on high-stakes language assessments. While automated scoring approaches are mature and have been around for decades, human scoring is still considered the gold standard, despite its high costs and well-known issues such as human rater fatigue and bias. The recent introduction of large language models (LLMs) brings new opportunities for automated scoring. In this paper, we evaluate how well GPT-3.5 and...

Last update from database: 03/09/2025, 11:15 (UTC)
Powered by Zotero and Kerko.