Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Article Status
Published
Title
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Date
2025-03-06 19:37:35
Citation Key
zotero-12387
Accessed
06/03/2025, 19:37
Extra
<标题>: Vibe-Eval:用于测量多模态语言模型进度的硬评估套件 Read_Status: New Read_Status_Date: 2026-01-26T11:33:01.187Z
Citation
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models. (2025, March 6). https://arxiv.org/html/2405.02287v1
Powered by Zotero and Kerko.