論文 arXiv 発表: 2026-05-25

Looped Diffusion Language Models

Looped Diffusion Language Models

著者: Sanghyun Lee, Chunsan Hong, Seungryong Kim, Jonghyun Lee, Jongho Park ほか1名

要約

Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models for language modeling, yet the effective design of transformer architectures for MDMs remains underexplored. In this paper, we show that selectively looping the early-middle transformer layers significant…

#diffusion#benchmark

同じカテゴリの記事