Mnemic steers MoE routers at inference time

// 90d agoOPENSOURCE RELEASE

Mnemic steers MoE routers at inference time

Mnemic is an early-alpha open-source package that claims it can add or surface new knowledge in frozen mixture-of-experts models by steering expert routing at inference time, without weight updates, LoRA, or RAG. The project packages the approach as Adaptive Cognitive Intelligence with Engram, MRE, and Guardrails, and says it has mostly been tested on Gemma 4 26B.

// ANALYSIS

Hot take: if even a fraction of this holds up outside the author’s own benchmarks, it’s a legitimately new inference-time control surface for MoE models rather than just another memory/RAG wrapper.

–The repo explicitly positions Mnemic as zero-training, zero-weight-modification runtime knowledge assimilation for MoE systems, with `mnemic-mre` published on GitHub and PyPI-style install instructions.
–The strongest claim is not “better retrieval,” but “route the model into the right experts,” which is interesting because it leans on architecture already present in the base model.
–The current evidence base is thin: the README says alpha, the tests are mostly on Gemma 4 26B, and the demo claims are self-reported, so replication by third parties matters a lot.
–The product feels most plausible as a research-to-tooling bridge for MoE experimentation, not as a drop-in universal knowledge layer yet.

// TAGS

moeexpert-routinginference-timeknowledge-assimilationllmgemmaguardrailsopen-sourcealpha

DISCOVERED

90d ago

2026-04-18

PUBLISHED

90d ago

2026-04-18

RELEVANCE

9/ 10

AUTHOR

superman_27

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE26m ago

Netlify docs earn 97/100 AI agent readiness

Netlify has optimized its developer documentation (docs.netlify.com) for AI coding agents, achieving a 97/100 (A) agent readiness score on benchmarks by Fern and Mintlify. To ensure its documentation is easily parsed by autonomous AI agents, Netlify serves clean markdown natively across all pages via .md URLs, supports the Accept: text/markdown request header, integrates llms.txt, and uses server-rendered content to avoid JavaScript-heavy page shells.

BENCHMARK27m ago

Mintlify: llms.txt cuts agent 404s by 90%

Mintlify's new benchmark across 20 documentation sites shows that publishing a standard llms.txt index reduces AI agent 404 errors by up to 90%. By testing formats like raw HTML, plain markdown, and inlined indexes, the study finds that serving markdown via content negotiation alongside llms.txt is the most token-friendly setup.

LAUNCH1h ago

Merge launches Embedded Routing Stack

Merge has released its Embedded Routing Stack, a control plane and gateway designed to route LLM requests across major providers like Anthropic, OpenAI, and Google AI while utilizing the customer's own API keys. This solution addresses enterprise data sovereignty by ensuring that personally identifiable information (PII) never leaves the local environment and all requests remain hosted within the EU, facilitating compliant AI agent development for highly regulated industries.