HRM-Text-1B shows architecture still beats scale
HRM-Text-1B is Sapient Intelligence’s open-source 1B-parameter language model built on the Hierarchical Reasoning Model architecture. The release includes weights and code, and the model card describes it as trained from scratch on structured public datasets; the paper reports roughly 40B unique tokens and about $1,500 of compute, with benchmark results that compete with much larger 2B-7B open models on reasoning-heavy tasks.
This is less about a tiny model beating a bigger one and more about proving that architecture and training objective still matter when you optimize for reasoning efficiency. The release is genuinely open, with weights, code, and a reproducible pipeline, and the reported cost/performance ratio is strong on math and reasoning benchmarks. The main caveat is that the model card describes it as a pre-alignment PrefixLM checkpoint, so it is not a drop-in general-purpose chat assistant, and the benchmark claims should be read as task-specific rather than universal. The "thinks internally" framing is shorthand for the HRM design, not evidence of human-like reasoning.
DISCOVERED
1h ago
2026-05-22
PUBLISHED
2h ago
2026-05-22
RELEVANCE
AUTHOR
AlphaSignalAI