Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Article Status
Published
Title
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Accessed
06/03/2025, 19:37
Extra
Citation Key: zotero-12387 <标题>: Vibe-Eval:用于测量多模态语言模型进度的硬评估套件
Citation
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models. (n.d.). Retrieved March 6, 2025, from https://arxiv.org/html/2405.02287v1
Powered by Zotero and Kerko.