WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
要約
Commercial video generation systems such as Seedance2.0 and Veo3.1 have rapidly improved, strengthening the view that video generators may be evolving into “world simulators.” Yet the community still lacks a benchmark that directly tests whether a model can reason about how an observed world should …