AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents
AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents
要約
Vision-language model (VLM) agents increasingly rely on memory-augmented reinforcement learning to reuse experience across long-horizon tasks, yet most existing frameworks store memory as text and depend on proprietary teacher models to summarize or refine it. This design is poorly matched to spatia…