Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Article Status
Published
Title
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Date
2025-03-06 19:37:35
Accessed
06/03/2025, 19:37
Extra
Citation Key: zotero-12387 <标题>: Vibe-Eval:用于测量多模态语言模型进度的硬评估套件
Citation
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models. (2025, March 6). https://arxiv.org/html/2405.02287v1
Powered by Zotero and Kerko.