{"version":"1.0","type":"rich","provider_name":"Tech & Business","provider_url":"https://techandbusiness.org","title":"Fast LLM inference techniques detailed for Anthropic and OpenAI models","author_name":"T&B Newswire · AI","thumbnail_url":"https://www.seangoedecke.com/og-image.jpg","width":600,"height":400,"html":"<blockquote class=\"tb-newswire-embed\" style=\"max-width:600px;border-left:3px solid #22d3ee;padding:12px 16px;margin:0;font-family:-apple-system,system-ui,sans-serif;background:#09090b;border-radius:0 8px 8px 0;\">\n      <p style=\"margin:0 0 8px;font-size:10px;font-weight:600;letter-spacing:0.1em;color:#71717a;text-transform:uppercase;\">T&B NEWSWIRE · AI</p>\n      <p style=\"margin:0 0 8px;font-size:18px;font-weight:700;line-height:1.3;color:#fff;\"><a href=\"https://techandbusiness.org/newswire/WMYow9Ig064KslncDOfOUW\" style=\"color:#fff;text-decoration:none;\">Fast LLM inference techniques detailed for Anthropic and OpenAI models</a></p>\n      <p style=\"margin:0;font-size:14px;color:#a1a1aa;line-height:1.5;\">Anthropic and OpenAI recently announced fast mode options that let users interact with their top coding models at higher speeds. The two versions differ in speed, model quality and underlying techniqu...</p>\n    </blockquote>"}