論文 Hugging Face 発表: 2026-06-10 HF ↑22

On Subquadratic Architectures: From Applications to Principles

On Subquadratic Architectures: From Applications to Principles

著者: Anamaria-Roberta Hartl, Levente Zólyomi, David Stap, Pieter-Jan Hoedt, Niklas Schmidinger ほか4名

要約

Transformers dominate modern sequence modeling, but their quadratic attention incurs substantial computational cost. Subquadratic architectures offer a scalable alternative. However, it remains unclear which designs yield the most effective sequence models. We compare three leading approaches: xLSTM…

#llm#benchmark

同じカテゴリの記事