Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Article Status
Published
Title
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Accessed
06/03/2025, 19:37
Extra
Citation Key: zotero-12387
<标题>: Vibe-Eval:用于测量多模态语言模型进度的硬评估套件
Citation
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models. (n.d.). Retrieved March 6, 2025, from https://arxiv.org/html/2405.02287v1
Link to this record