論文 Hugging Face 発表: 2026-05-26 HF ↑11

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

著者: HuiMing Fan, Xiao Wang, Zheng Chu, Qianyu Wang, Zhuoyao Wang ほか3名

要約

Are LLM-based search agents genuinely searching, or using the web to verify what they already know? We study this question on BrowseComp with three diagnostics. Our analysis reveals Intrinsic Knowledge Dependence (IKD): even with tool access, agents often rely on intrinsic knowledge — information e…

#agent#benchmark#llm

同じカテゴリの記事