ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
要約
Computer Use Agents (CUAs) can act through both atomic GUI actions, such as click and type, and high-level tool calls, such as API-based file operations, but this hybrid action space often leaves them uncertain about when to continue with GUI actions or switch to tools, leading to suboptimal executi…