Mamba-3 Weights Stay Missing After Release

// 45d agoRESEARCH PAPER

Mamba-3 Weights Stay Missing After Release

A LocalLLaMA thread flags an awkward gap in the Mamba-3 release: the paper and Together AI blog report pretrained benchmark results, and the GitHub repo exposes Mamba-3 code/kernels, but the official Hugging Face listings still appear to cover Mamba and Mamba-2 weights rather than the benchmarked Mamba-3 checkpoints.

// ANALYSIS

Mamba-3 looks technically important, but the release lands in an uncomfortable middle ground: enough code to study the architecture, not enough weights to reproduce the headline claims cleanly.

–The paper claims Mamba-3 improves downstream accuracy and inference efficiency at the 1.5B scale, including a stronger MIMO variant with no added decode latency.
–Together AI says kernels are open-sourced, but that is not the same as releasing the trained checkpoints behind the benchmark tables.
–The state-spaces GitHub README lists pretrained Hugging Face models for Mamba and Mamba-2, while its Mamba-3 demo uses random tensors for block-level usage.
–For developers, this makes Mamba-3 more of a research architecture drop than a usable model release until official weights or reproducible training configs appear.

// TAGS

mamba-3llminferenceresearchbenchmark

DISCOVERED

45d ago

2026-04-22

PUBLISHED

45d ago

2026-04-22

RELEVANCE

7/ 10

AUTHOR

Designer_Win6465

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE23m ago

Antigravity CLI updates add LaTeX and model selection

Three releases for the Antigravity CLI were rolled out in the past week, delivering numerous quality-of-life improvements based on user feedback. The updates include support for LaTeX math equations, the introduction of a new --model flag along with the agy models command, and a new /permissions command for managing permissions.

OPEN SOURCE30m ago

A new collection of 205 ready-to-run AI agent templates has been released for the OpenClaw ecosystem.

Awesome OpenClaw Agents is a newly released collection featuring 205 ready-to-run AI agent templates designed for the OpenClaw ecosystem. The agents are packaged as simple copy-paste SOUL.md files and span 24 categories including DevOps, Legal, Healthcare, and E-Commerce. To ensure a seamless setup experience, each template comes complete with a Dockerfile, docker-compose configuration, a bot, and a detailed README.

UPDATE37m ago

Mint.gg remasters 2D games into 3D worlds

Developer Tamrat Alemu showcased a demo for Mint.gg that converts classic 2D game environments, such as Pokémon Ruby, into interactive 3D worlds. The platform enables users to compose assets generated by various AI models with physics, multiplayer, and spatial audio directly on the web.

Mamba-3 Weights Stay Missing After Release