論文 深掘り Hugging Face 発表: 2026-05-12 HF ↑30

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

著者: Xuehai Bai, Yang Shi, Yi-Fan Zhang, Xuanyu Zhu, Yuran Wang ほか5名

要約

Recent image editing models have achieved remarkable progress in instruction following, multimodal understanding, and complex visual editing. However, existing benchmarks often fail to faithfully reflect human judgment, especially for strong frontier models, due to limited task difficulty and coarse…

#benchmark#rl#multimodal

同じカテゴリの記事