# Scaling Karpathy's Autoresearch with GPU cluster

_Friday, June 26, 2026 at 6:15 PM EDT · AI · Latest · Tier 2 — Notable_

![Scaling Karpathy's Autoresearch with GPU cluster — Primary](https://blog.skypilot.co/scaling-autoresearch/assets/banner.png)

A team directed the coding agent Claude Code at Andrej Karpathy's autoresearch project and provided access to 16 GPUs on a Kubernetes cluster. The agent employed SkyPilot to launch and manage jobs across the infrastructure. It utilized a combination of H100 and H200 GPUs as they became available.

Over eight hours the agent submitted about 910 experiments. Scaling model width emerged as more impactful than changes to any other single hyperparameter. The agent lowered val_bpb from 1.003 to 0.974 for a 2.87 percent improvement over the baseline.

Parallel operation permitted the agent to run factorial grids of 10 to 13 experiments in each wave. This capability revealed interaction effects among parameters that sequential testing would miss. The agent learned to screen ideas on H100 GPUs and promote successful ones to H200 GPUs for validation.

The session produced roughly 90 experiments per hour. Throughput increased by a factor of nine relative to single-GPU runs. About 700 experiments yielded valid results.

## Sources

- [SkyPilot Blog](https://blog.skypilot.co/scaling-autoresearch/)

---
Canonical: https://techandbusiness.org/newswire/dwShKCC5FBZlnWiQ1QUNzP
Retrieved: 2026-06-27T05:34:01.862Z
Publisher: Tech & Business (techandbusiness.org)
