論文深掘り arXiv 発表: 2026-05-21

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

著者: Ali Hatamizadeh, Yejin Choi, Jan Kautz

要約

Linear attention replaces the unbounded cache of softmax attention with a fixed-size recurrent state, reducing sequence mixing to linear time and decoding to constant memory. The hard part is not just what to forget, but how to edit this compressed memory without scrambling existing associations. De…

#coding#benchmark

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

要約

同じカテゴリの記事

Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

World-R1: テキストから動画生成における3D制約の強化学習による整合