When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems
When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems
要約
The design space of agentic AI inference spans two extremes: frontier large language models (LLMs), typically hosted in the cloud and offering strong performance across a wide range of tasks at substantially high cost, and more cost-efficient small language models (SLMs), which are amenable to on-de…