論文 深掘り Hugging Face 発表: 2026-05-19 HF ↑33

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

著者: Rongbin Tan, Fangfang Lin, Zhenlong Yuan, Min Qiu, Kejin Cui ほか8名

要約

Multimodal large language models (MLLMs) have shown remarkable capability in bridging visual perception and textual reasoning, enabling zero-shot understanding across diverse industrial scenarios. However, their performance in open-vocabulary industrial anomaly detection (IAD) is often limited by do…

#agent#llm#rl#multimodal#fine-tuning

同じカテゴリの記事