disco-torch brings DeepMind Disco103 to PyTorch

// 125d agoOPENSOURCE RELEASE

disco-torch brings DeepMind Disco103 to PyTorch

disco-torch is a new open-source PyTorch port of DeepMind's Disco103 meta-learned reinforcement learning update rule from the 2025 Nature paper, packaged with a pip install, Colab notebook, pretrained weights, and a higher-level DiscoTrainer API. The repo claims numerical parity with the JAX reference and a 99% catch rate on the reference Catch benchmark, making a research-heavy result much easier to experiment with.

// ANALYSIS

The Claude Code angle is the hook, but the durable story is reproducibility: a state-of-the-art RL update rule just got turned into something ordinary PyTorch users can actually run. That matters more than the Reddit post itself because ports like this shorten the gap between reading a paper and testing whether it survives outside the original lab stack.

–Packaging Disco103 as a pip-installable PyTorch library lowers the barrier for RL researchers who do not want to work inside JAX-first research code
–The included Colab notebook, pretrained weights, and DiscoTrainer wrapper make this feel closer to a usable research toolkit than a one-off code dump
–If the validation numbers hold up beyond the Catch benchmark, this could become a convenient baseline for testing learned update rules against hand-designed PPO- and GRPO-style training
–The repo is still very early, so the real signal will be independent reproduction and whether the port works cleanly in larger custom agent pipelines

// TAGS

disco-torchresearchopen-sourceagentsdk

DISCOVERED

125d ago

2026-03-09

PUBLISHED

125d ago

2026-03-08

RELEVANCE

7/ 10

AUTHOR

Far-Respect-4827

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO44m ago

Higgsfield drops developer CLI and MCP server

Higgsfield has launched a developer CLI and MCP server, allowing programmers and autonomous agents to programmatically trigger, customize, and edit marketing ads and cinematic videos directly through terminal commands. Demonstrated by developer Cole Medin using Anthropic's Claude Code and the Archon workflow engine, the toolkit enables fully automated video production pipelines.

OPEN SOURCE44m ago

AI Content Factory automates video ads

AI Content Factory is an open-source workflow that automates bulk marketing video generation from a product catalog. Built on the Archon agentic engine and Higgsfield CLI, it reduces costs by gating expensive video rendering behind cheap image exploration and human approval.

NEWS2h ago

George Hotz shares his enthusiasm for LLMs and open-source coding agents while criticizing doom-mongering and the overinflated valuations of frontier AI labs.

George Hotz (geohot) details his excitement for the practical applications of AI—such as LLMs, self-driving cars, video generation models, and AI coding agents—highlighting his successful setup of the open-source agent OpenCode on a local GLM-5.2 model. However, he strongly criticizes the prevailing industry hype, safety-related doom-mongering, and the multibillion-dollar valuations of frontier AI labs. Hotz argues that frontier labs will fail to capture most of the AI value because AI is a commodity driven by Moore's law and general computing progress. He also frames coding models not as autonomous creators, but as valuable productivity tools analogous to compilers, find-and-replace, or Stack Overflow that are changing the nature of programming.