2 resources

  • Shiki Sato, Yosuke Kishinami, Hiroaki Su...
    |
    Nov 3rd, 2022
    |
    conferencePaper
    Shiki Sato, Yosuke Kishinami, Hiroaki Su...
    Nov 3rd, 2022

    Automation of dialogue system evaluation is a driving force for the efficient development of dialogue systems. This paper introduces the bipartite-play method, a dialogue collection method for automating dialogue system evaluation. It addresses the limitations of existing dialogue collection methods: (i) inability to compare with systems that are not publicly available, and (ii) vulnerability to cheating by intentionally selecting systems to be compared. Experimental results show that the...

  • Rohan Anil, Andrew M. Dai, Orhan Firat
    |
    May 17th, 2023
    |
    preprint
    Rohan Anil, Andrew M. Dai, Orhan Firat
    May 17th, 2023

    We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on downstream tasks across different model sizes, while simultaneously exhibiting faster and more...

Last update from database: 03/04/2025, 21:15 (UTC)
Powered by Zotero and Kerko.