論文 Hugging Face 発表: 2026-05-13 HF ↑44

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

著者: Haoyi Zhu, Haozhe Liu, Yuyang Zhao, Tian Ye, Junsong Chen ほか4名

要約

We introduce SANA-WM, an efficient 2.6B-parameter open-source world model natively trained for one-minute generation, synthesizing high-fidelity, 720p, minute-scale videos with precise camera control. SANA-WM achieves visual quality comparable to large-scale industrial baselines such as LingBot-Worl…

#diffusion#benchmark

同じカテゴリの記事