GPT detectors are biased against non-native English writers

Article Status

Published

Authors/contributors

Liang, Weixin (Author)
Yuksekgonul, Mert (Author)
Mao, Yining (Author)
Wu, Eric (Author)
Zou, James (Author)

Title

GPT detectors are biased against non-native English writers

Abstract

The rapid adoption of generative language models has brought about substantial advancements in digital communication, while simultaneously raising concerns regarding the potential misuse of AI-generated content. Although numerous detection methods have been proposed to differentiate between AI and human-generated content, the fairness and robustness of these detectors remain underexplored. In this study, we evaluate the performance of several widely-used GPT detectors using writing samples from native and non-native English writers. Our findings reveal that these detectors consistently misclassify non-native English writing samples as AI-generated, whereas native writing samples are accurately identified. Furthermore, we demonstrate that simple prompting strategies can not only mitigate this bias but also effectively bypass GPT detectors, suggesting that GPT detectors may unintentionally penalize writers with constrained linguistic expressions. Our results call for a broader conversation about the ethical implications of deploying ChatGPT content detectors and caution against their use in evaluative or educational settings, particularly when they may inadvertently penalize or exclude non-native English speakers from the global discourse. The published version of this study can be accessed at: www.cell.com/patterns/fulltext/S2666-3899(23)00130-7

Repository

arXiv

Archive ID

arXiv:2304.02819

Date

2023-07-10

DOI

10.1016/j.patter.2023.100779

URL

http://arxiv.org/abs/2304.02819

Accessed

04/12/2023, 13:54

Library Catalogue

arXiv.org

Extra

arXiv:2304.02819 [cs] <AI Smry>: GPT detectors frequently misclassify non-native English writing as AI generated, raising concerns about fairness and robustness, and addressing the biases in these detectors is crucial to prevent the marginalization of non- natives in evaluative and educational settings. PMID: 37521038

Citation

Liang, W., Yuksekgonul, M., Mao, Y., Wu, E., & Zou, J. (2023). GPT detectors are biased against non-native English writers (arXiv:2304.02819). arXiv. https://doi.org/10.1016/j.patter.2023.100779

Empirical studies

b - key references

Link to this record

https://aievidencehub.org/lib/GBVIPHDT