2 resources

  • Yunting Liu, Shreya Bhandari, Zachary A....
    |
    May 22nd, 2025
    |
    journalArticle
    Yunting Liu, Shreya Bhandari, Zachary A....
    May 22nd, 2025

    Effective educational measurement relies heavily on the curation of well‐designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT‐3.5, GPT‐4, Llama 2, Llama 3, Gemini‐Pro and Cohere Command R Plus) to generate responses with psychometric properties comparable to those of human respondents....

  • Yunting Liu, Shreya Bhandari, Zachary A....
    |
    May 22nd, 2025
    |
    journalArticle
    Yunting Liu, Shreya Bhandari, Zachary A....
    May 22nd, 2025

    Effective educational measurement relies heavily on the curation of well‐designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT‐3.5, GPT‐4, Llama 2, Llama 3, Gemini‐Pro and Cohere Command R Plus) to generate responses with psychometric properties comparable to those of human respondents....

Last update from database: 22/10/2025, 10:15 (UTC)
Powered by Zotero and Kerko.