論文 深掘り Hugging Face 発表: 2026-05-27 HF ↑35

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

著者: Min Zhao, Hongzhou Zhu, Bokai Yan, Zihan Zhou, Yimin Chen ほか7名

要約

Recent video diffusion foundation models have achieved remarkable progress in high-quality video generation, yet turning them into real-time interactive video world models remains challenging. Interactive world models require controllable, causal, and low-latency rollout, which in practice demands a…

#diffusion#fine-tuning

同じカテゴリの記事