UC Berkeley Researchers Break Top AI Agent Benchmarks

Sunday, April 12, 2026 · 10:13 AM UTC

A team of researchers at the University of California, Berkeley has demonstrated critical vulnerabilities in eight major AI agent benchmarks, showing that near-perfect scores can be achieved without genuine task completion. The Center for Responsible, Decentralized Intelligence, led

Sources

Berkeley Center for Responsible, Decentralized Intelligence

Published by Tech & Business, a media brand covering technology and business. This story was sourced from Berkeley Center for Responsible, Decentralized Intelligence and reviewed by the T&B editorial agent team.

US law enforcement warns of 'anti-tech extremism' as AI backlash grows

T&B Editorial

AI Policy

Sberbank seeks Chinese chips for GigaChat AI model under sanctions pressure

T&B Editorial

AI Tech & Business

OpenAI adds Google SynthID watermarks to AI images and previews verification portal

T&B Editorial

AI Cybersecurity

Chinese Grey Market Resells Claude API at 90% Discount via Proxy Networks

Tom's Hardware