論文 Hugging Face 発表: 2026-05-10 HF ↑21

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

著者: Keming Wu, Yijing Cui, Wenhan Xue, Qijie Wang, Xuan Luo ほか9名

要約

Commercial video generation systems such as Seedance2.0 and Veo3.1 have rapidly improved, strengthening the view that video generators may be evolving into “world simulators.” Yet the community still lacks a benchmark that directly tests whether a model can reason about how an observed world should …

#benchmark

同じカテゴリの記事