Karpathy's AutoResearch runs 700 experiments in 2 days

// 111d agoOPENSOURCE RELEASE

Karpathy's AutoResearch runs 700 experiments in 2 days

Karpathy's autoresearch repo lets an agent edit a single training file, run fixed 5-minute experiments on a single GPU, and keep or discard changes overnight on a small LLM. Fortune says the loop hit 700 experiments in two days, found 20 optimizations, and then turned them into an 11% training speedup on a larger model.

// ANALYSIS

This is less self-improving AGI than agentic AutoML with tighter ergonomics, and that's exactly why it matters: once you can define a metric and a time box, the search loop becomes software.

–700 experiments in 2 days is the point; Fortune reports Tobias Lutke tried the same loop on internal data overnight and got a 19% gain after 37 experiments. [Fortune](https://fortune.com/2026/03/17/andrej-karpathy-loop-autonomous-ai-agents-future/)
–The repo is intentionally tiny: `program.md` sets the mission, `train.py` is the only editable code path, and each trial is capped at 5 minutes. [GitHub](https://github.com/karpathy/autoresearch)
–That makes the pattern easy to port to other measurable workflows, from model tuning to data prep to CI-style optimization.
–It is still narrow, not recursive intelligence, but the narrowness is the whole value proposition because teams can ship it today.

// TAGS

autoresearchagentresearchautomationopen-sourcellmmlops

DISCOVERED

111d ago

2026-03-23

PUBLISHED

111d ago

2026-03-23

RELEVANCE

9/ 10

AUTHOR

tekz

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS1h ago

Ivan Raskovsky, CTO and Co-founder of GenLayer Foundation, joins RallyOnChain to discuss the protocol's Internet Court initiative and the upcoming Clark Testnet roadmap.

GenLayer Foundation's CTO and Co-founder, Ivan Raskovsky, was featured on the RallyOnChain Community Space (Episode 27) hosted by stargirl_hills and 0X_CUPZ. The discussion centered on GenLayer's vision for an "Internet Court"—a decentralized system enabling AI agents to resolve subjective disputes using natural language processing and consensus. Raskovsky highlighted their progress, including an internal Epoch Zero test run and the roadmap for the upcoming Clark Testnet, which is targeted at autonomous network operations following their initial Asimov and Bradbury testnets.

UPDATE2h ago

Native SDK v0.5 compiles TypeScript to native

Vercel Labs has released Native SDK v0.5, introducing TypeScript support to compile applications directly to native machine code without a JavaScript engine or garbage collector. Designed with AI agents in mind, the update features 83ns update dispatch latency, supports robust TypeScript features, and allows developers to eject to Zig at any point.

UPDATE2h ago

SST Console demos AI-built settings screen

SST co-founder Dax Raad demonstrated a new settings screen for the SST Console built entirely via an interactive, Slack-integrated AI coding agent. The development involved collaborative team prompting and iterative feedback loops with the agent, resulting in a functional interface and automated walkthrough video.