AURA: Action-Gated Memory for Robot Policies at Constant VRAM
AURA: Action-Gated Memory for Robot Policies at Constant VRAM
要約
The KV-cache is the right memory for datacenters but the wrong memory for robots. Datacenter inference batches many short requests and resets them, amortizing an attention cache across a crowd. Embodied agents instead run one long, non-resetting episode on bandwidth-limited edge hardware, where high…