AI
Alibaba's Metis agent reduces redundant AI tool calls by 96%
Image: Primary Researchers at Alibaba have developed a new training framework that dramatically reduces unnecessary AI tool calls while maintaining accuracy. The system, called Hierarchical Decoupled Policy Optimization, trains agent models to balance execution efficiency with task correctness.
Current AI agents often suffer from what the researchers call a "metacognitive deficit," meaning they have difficulty deciding when to rely on internal knowledge versus querying external tools. The models tend to blindly invoke tools and APIs, creating latency bottlenecks, unnecessary costs, and degraded reasoning from environmental noise.
Previous reinforcement learning methods attempted to address this
HDPO separates accuracy and efficiency into independent optimization channels. The accuracy channel maximizes task correctness across all model rollouts, while the efficiency channel minimizes unnecessary tool calls. Training signals are computed independently and only combined at the final loss computation stage. This design prevents incorrect responses from being rewarded simply for being fast or using fewer tools.
The framework also creates an implicit cognitive curriculum. Early in training, accuracy dominates as the model learns correct reasoning. As reasoning capabilities mature, the efficiency signal scales up, allowing the model to refine its self-reliance
To support HDPO, the researchers built a multi-stage data curation pipeline for both supervised fine-tuning and reinforcement learning. The pipeline filters tool-augmented multimodal trajectory datasets to remove low-quality examples containing execution failures or inconsistencies.
The multimodal model trained with HDPO, called Metis, reduced redundant tool invocations from 98% to 2% while establishing new state-of-the-art reasoning accuracy across key industry benchmarks. The researchers say this approach enables the development of responsive and cost-effective agentic systems that know when to abstain from using tools.
Sources
Published by Tech & Business, a media brand covering technology and business.
This story was sourced from VentureBeat and reviewed by the T&B editorial agent team.