State Flow Machine tops transformers on length extrapolation

// 73d agoBENCHMARK RESULT

State Flow Machine tops transformers on length extrapolation

A solo researcher has open-sourced State Flow Machine (SFM), a non-transformer architecture using explicit memory "state slots" instead of attention heads, achieving 62% accuracy at 4x training-length sequence extrapolation versus ~2% for transformers of any size on a synthetic state-tracking benchmark.

// ANALYSIS

A clean experimental result on a narrow synthetic task — the real test comes when Mamba, RWKV, and other SSMs are added to the comparison table, since those are the natural competitors transformers were never designed to beat here.

–State slots replace attention with 16 named memory registers updated via gated DeltaNet recurrent cells — an explicit, directly addressable alternative to attention's implicit token-history compression
–The transformer collapse at 4x length (2%) is theoretically expected: TC0 circuit complexity limits make vanilla attention provably weak at algorithmic state-tracking tasks
–Important caveat: SFM uses intermediate state supervision (auxiliary loss at every operation step), giving it significantly more gradient signal than transformer baselines — disclosed but not equalized
–No comparison to Mamba, RWKV, or other SSMs yet, which the author acknowledges — those architectures are designed for exactly this kind of recurrent state tracking
–Built with AI assistance (Claude Opus 4.6 as co-author), runs only on Huawei Ascend NPUs, and has zero stars on a 3-day-old repo — reproducibility for most researchers is limited until a CUDA port appears

// TAGS

state-flow-machinellmopen-sourcebenchmarkresearchreasoning

DISCOVERED

73d ago

2026-03-16

PUBLISHED

73d ago

2026-03-16

RELEVANCE

5/ 10

AUTHOR

Own-Albatross868

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS3m ago

Anthropic hits profitability as Claude Code usage surges

Anthropic achieved its first operating profit in Q2 2026, driven by a massive shift toward usage-based enterprise pricing. The company's agentic CLI, Claude Code, has become its primary revenue engine by consuming high volumes of tokens for autonomous coding tasks.

NEWS3m ago

Anthropic hits first profit on $10.9B Q2 revenue

Anthropic is poised to record its first operating profit in Q2 2026, driven by a massive $10.9 billion revenue run and a strategic pivot to enterprise sales. The financial turnaround highlights the explosive monetization potential of developer-focused coding agents like Claude Code.

NEWS52m ago

Anthropic readies Opus 4.8 release amid leaks

Rumors of an imminent Claude Opus 4.8 launch swirl as model slugs appear in staging and OpenAI drops stealth updates. The anticipated release signals a pivot toward deeper agentic capabilities and integrated developer workflows.