HRM-Text slashes pretraining compute, data

// 45d agoMODEL RELEASE

HRM-Text slashes pretraining compute, data

HRM-Text is a 1B text-generation model and training framework that swaps standard Transformers for a hierarchical recurrent architecture. The paper claims competitive benchmark results after training from scratch on 40B tokens with about $1,500 in compute.

// ANALYSIS

This is a serious efficiency claim, not just another architecture tweak: HRM-Text argues you can buy down pretraining cost by changing both the model and the objective, not by scaling harder.

–The headline numbers matter: 60.7% MMLU, 81.9% ARC-C, 84.5% GSM8K, and 56.2% MATH from a 1B model trained on a relatively small budget
–The repo packages the work as an end-to-end pretraining stack, with data sampling, FSDP2 training, evaluation, and checkpoint conversion, so it is meant to be reproduced, not just admired
–The main novelty is architectural plus training-objective co-design: hierarchical recurrence, MagicNorm, deep credit-assignment warmup, and PrefixLM masking
–If the results hold up outside the authors’ setup, this is a useful signal for labs that want controlled, cheaper pretraining runs rather than internet-scale brute force
–The obvious caveat is that this is still a research result until broader replication and independent evals confirm the compute savings and quality tradeoffs

// TAGS

llmreasoningtrainingopen-sourceresearchhrm-text

DISCOVERED

45d ago

2026-05-21

PUBLISHED

45d ago

2026-05-21

RELEVANCE

9/ 10

AUTHOR

AlphaSignalAI

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

xAI releases Grok Build 0.2.87

Grok Build 0.2.87 is a quality-of-life release for xAI's command-line interface coding agent. The update introduces automatic detection of subscription upgrades to eliminate CLI restarts and adds a persistent "Never allow" option to Bash permission prompts.

NEWS3h ago

Developer Pairs Codex and Cursor for AI Coding

The post highlights a developer's workflow combining OpenAI's Codex model with the Cursor IDE. The developer notes that an IDE is essential for reviewing Codex's outputs and maintaining a project overview, and praises Cursor's built-in Composer 2.5 model as a highly effective tool for many development tasks.

MODEL3h ago

Grok 4.5 enters private beta

Grok 4.5, xAI's next-generation large language model, is reportedly in private beta testing at Tesla and SpaceX. Powered by a massive 1.5 trillion-parameter V9 model, its early performance is described by Elon Musk as close to, or perhaps exceeding, Anthropic's Claude 3 Opus, signaling a significant capability upgrade for xAI's suite of products.