In authors or contributors

2 resources

  • Yejin Bang, Samuel Cahyawijaya, Nayeon L...
    |
    Feb 28th, 2023
    |
    preprint
    Yejin Bang, Samuel Cahyawijaya, Nayeon L...
    Feb 28th, 2023

    This paper proposes a framework for quantitatively evaluating interactive LLMs such as ChatGPT using publicly available data sets. We carry out an extensive technical evaluation of ChatGPT using 23 data sets covering 8 different common NLP application tasks. We evaluate the multitask, multilingual and multi-modal aspects of ChatGPT based on these data sets and a newly designed multimodal dataset. We find that ChatGPT outperforms LLMs with zero-shot learning on most tasks and even outperforms...

  • Yejin Bang, Samuel Cahyawijaya, Nayeon L...
    |
    Feb 28th, 2023
    |
    preprint
    Yejin Bang, Samuel Cahyawijaya, Nayeon L...
    Feb 28th, 2023

    This paper proposes a framework for quantitatively evaluating interactive LLMs such as ChatGPT using publicly available data sets. We carry out an extensive technical evaluation of ChatGPT using 23 data sets covering 8 different common NLP application tasks. We evaluate the multitask, multilingual and multi-modal aspects of ChatGPT based on these data sets and a newly designed multimodal dataset. We find that ChatGPT outperforms LLMs with zero-shot learning on most tasks and even outperforms...

Last update from database: 28/12/2024, 22:15 (UTC)
Powered by Zotero and Kerko.