{"version":"1.0","type":"rich","provider_name":"Tech & Business","provider_url":"https://techandbusiness.org","title":"New APEX-Agents benchmark shows AI agents fail at real workplace tasks","author_name":"T&B Newswire · AI","thumbnail_url":"https://techcrunch.com/wp-content/uploads/2026/01/GettyImages-2247697590.jpg?resize=1200,686","width":600,"height":400,"html":"<blockquote class=\"tb-newswire-embed\" style=\"max-width:600px;border-left:3px solid #22d3ee;padding:12px 16px;margin:0;font-family:-apple-system,system-ui,sans-serif;background:#09090b;border-radius:0 8px 8px 0;\">\n      <p style=\"margin:0 0 8px;font-size:10px;font-weight:600;letter-spacing:0.1em;color:#71717a;text-transform:uppercase;\">T&B NEWSWIRE · AI</p>\n      <p style=\"margin:0 0 8px;font-size:18px;font-weight:700;line-height:1.3;color:#fff;\"><a href=\"https://techandbusiness.org/newswire/dwShKCC5FBZlnWiQ1TCclJ\" style=\"color:#fff;text-decoration:none;\">New APEX-Agents benchmark shows AI agents fail at real workplace tasks</a></p>\n      <p style=\"margin:0;font-size:14px;color:#a1a1aa;line-height:1.5;\">A new benchmark called APEX-Agents shows that leading AI models struggle with tasks drawn from consulting, investment banking and law. \n\nMercor developed the benchmark using queries submitted by profe...</p>\n    </blockquote>"}