Question Generation by Transformers

Article Status

Published

Authors/contributors

Kriangchaivech, Kettip (Author)
Wangperawong, Artit (Author)

Title

Question Generation by Transformers

Abstract

A machine learning model was developed to automatically generate questions from Wikipedia passages using transformers, an attention-based model eschewing the paradigm of existing recurrent neural networks (RNNs). The model was trained on the inverted Stanford Question Answering Dataset (SQuAD), which is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles. After training, the question generation model is able to generate simple questions relevant to unseen passages and answers containing an average of 8 words per question. The word error rate (WER) was used as a metric to compare the similarity between SQuAD questions and the model-generated questions. Although the high average WER suggests that the questions generated differ from the original SQuAD questions, the questions generated are mostly grammatically correct and plausible in their own right.

Repository

arXiv

Archive ID

arXiv:1909.05017

Date

2019-09-14

DOI

10.48550/arXiv.1909.05017

URL

http://arxiv.org/abs/1909.05017

Accessed

08/10/2025, 23:11

Library Catalogue

arXiv.org

Extra

arXiv:1909.05017 [cs] Read_Status: New Read_Status_Date: 2025-11-10T07:26:06.688Z Citation Key: kriangchaivech2019

Citation

Kriangchaivech, K., & Wangperawong, A. (2019). Question Generation by Transformers (arXiv:1909.05017). arXiv. https://doi.org/10.48550/arXiv.1909.05017

Link to this record

https://aievidencehub.org/lib/HDFECWTA