論文 Hugging Face 発表: 2026-05-13 HF ↑46

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

著者: Minghao Guo, Qingyue Jiao, Zeru Shi, Yihao Quan, Boxuan Zhang ほか12名

要約

Long-term agent memory is increasingly multimodal, yet existing evaluations rarely test whether agents preserve the visual evidence needed for later reasoning. In prior work, many visually grounded questions can be answered using only captions or textual traces, allowing answers to be inferred witho…

#multimodal#agent#benchmark

同じカテゴリの記事