Bipartite-play Dialogue Collection for Practical Automatic Evaluation of Dialogue Systems

Article Status
Published
Authors/contributors
Title
Bipartite-play Dialogue Collection for Practical Automatic Evaluation of Dialogue Systems
Abstract
Automation of dialogue system evaluation is a driving force for the efficient development of dialogue systems. This paper introduces the bipartite-play method, a dialogue collection method for automating dialogue system evaluation. It addresses the limitations of existing dialogue collection methods: (i) inability to compare with systems that are not publicly available, and (ii) vulnerability to cheating by intentionally selecting systems to be compared. Experimental results show that the automatic evaluation using the bipartite-play method mitigates these two drawbacks and correlates as strongly with human subjectivity as existing methods.
Date
2022-11
Proceedings Title
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: Student Research Workshop
Place
Online
Publisher
Association for Computational Linguistics
Pages
8–16
Citation
Sato, S., Kishinami, Y., Sugiyama, H., Akama, R., Tokuhisa, R., & Suzuki, J. (2022). Bipartite-play Dialogue Collection for Practical Automatic Evaluation of Dialogue Systems. In Y. Hanqi, Y. Zonghan, S. Ruder, & W. Xiaojun (Eds.), Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: Student Research Workshop (pp. 8–16). Association for Computational Linguistics. https://aclanthology.org/2022.aacl-srw.2
Technical methods
Powered by Zotero and Kerko.