In authors or contributors

3 resources

  • Elizabeth Clark, Asli Celikyilmaz, Noah ...
    |
    Jan 31st, 2019
    |
    conferencePaper
    Elizabeth Clark, Asli Celikyilmaz, Noah ...
    Jan 31st, 2019
  • Elizabeth Clark, Tal August, Sofia Serra...
    |
    Jan 31st, 2021
    |
    preprint
    Elizabeth Clark, Tal August, Sofia Serra...
    Jan 31st, 2021

    Human evaluations are typically considered the gold standard in natural language generation, but as models' fluency improves, how well can evaluators detect and judge machine-generated text? We run a study assessing non-experts' ability to distinguish between human- and machine-authored text (GPT2 and GPT3) in three domains (stories, news articles, and recipes). We find that, without training, evaluators distinguished between GPT3- and human-authored text at random chance level. We explore...

  • Elizabeth Clark, Tal August, Sofia Serra...
    |
    Jan 31st, 2021
    |
    preprint
    Elizabeth Clark, Tal August, Sofia Serra...
    Jan 31st, 2021

    Human evaluations are typically considered the gold standard in natural language generation, but as models' fluency improves, how well can evaluators detect and judge machine-generated text? We run a study assessing non-experts' ability to distinguish between human- and machine-authored text (GPT2 and GPT3) in three domains (stories, news articles, and recipes). We find that, without training, evaluators distinguished between GPT3- and human-authored text at random chance level. We explore...

Last update from database: 31/01/2026, 17:15 (UTC)
Powered by Zotero and Kerko.