Search

Publication year

Between 2000 and 2026
- Between 2020 and 2026
  - 2022

Reset search

64 resources

Abstracts

Towards Human-Like Educational Question Generation with Large Language Models

Zichao Wang, Jakob Valdez, Debshila Basu...
|
Apr 24th, 2022
|
bookSection

Zichao Wang, Jakob Valdez, Debshila Basu...

Apr 24th, 2022
Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension

Ying Xu, Dakuo Wang, Mo Yu
|
Apr 24th, 2022
|
journalArticle

Ying Xu, Dakuo Wang, Mo Yu

Apr 24th, 2022

Question answering (QA) is a fundamental means to facilitate assessment and training of narrative comprehension skills for both machines and young children, yet there is scarcity of high-quality QA datasets carefully designed to serve this purpose. In particular, existing datasets rarely distinguish fine-grained reading skills, such as the understanding of varying narrative elements. Drawing on the reading education research, we introduce FairytaleQA, a dataset focusing on narrative...
Towards a Unified Multi-Dimensional Evaluator for Text Generation

Ming Zhong, Yang Liu, Da Yin
|
Apr 24th, 2022
|
preprint

Ming Zhong, Yang Liu, Da Yin

Apr 24th, 2022

Multi-dimensional evaluation is the dominant paradigm for human evaluation in Natural Language Generation (NLG), i.e., evaluating the generated text from multiple explainable dimensions, such as coherence and fluency. However, automatic evaluation in NLG is still dominated by similarity-based metrics, and we lack a reliable framework for a more comprehensive evaluation of advanced models. In this paper, we propose a unified multi-dimensional evaluator UniEval for NLG. We re-frame NLG...
Towards a Unified Multi-Dimensional Evaluator for Text Generation

Ming Zhong, Yang Liu, Da Yin
|
Apr 24th, 2022
|
preprint

Ming Zhong, Yang Liu, Da Yin

Apr 24th, 2022

Multi-dimensional evaluation is the dominant paradigm for human evaluation in Natural Language Generation (NLG), i.e., evaluating the generated text from multiple explainable dimensions, such as coherence and fluency. However, automatic evaluation in NLG is still dominated by similarity-based metrics, and we lack a reliable framework for a more comprehensive evaluation of advanced models. In this paper, we propose a unified multi-dimensional evaluator UniEval for NLG. We re-frame NLG...

Custom feed

Last update from database: 24/04/2026, 16:15 (UTC)