DeepSeek's Engram adds conditional memory to LLMs

// 110d agoRESEARCH PAPER

DeepSeek's Engram adds conditional memory to LLMs

DeepSeek's Engram is an open-source conditional-memory module for LLMs, using hashed n-gram lookup tables to fetch static patterns in O(1) time instead of routing every recall step through dense transformer compute. The paper and official repo report gains over an iso-parameter, iso-FLOPs MoE baseline on knowledge, reasoning, code, math, and long-context benchmarks.

// ANALYSIS

This feels less like a clever tweak and more like an attempt to give sparse LLMs a real memory plane: let the model store reusable facts cheaply, and spend neural depth on actual reasoning. If those results generalize, Engram is the kind of architectural idea that could outlive a single model family.

–It cleanly splits conditional compute from conditional memory, which is a more interesting scaling axis than simply adding more experts.
–The strongest signal is breadth: the paper claims wins on knowledge, reasoning, code, math, and long-context tasks, not just factual recall.
–Deterministic addressing and host-memory prefetching make the efficiency story compelling for serving, especially when long-context throughput matters.
–The big risk is operational: memory collisions, table growth, and retrieval quality will decide whether Engram stays a research win or becomes a production primitive.

// TAGS

engramllmembeddingreasoningbenchmarkinferenceresearchopen-source

DISCOVERED

110d ago

2026-03-24

PUBLISHED

110d ago

2026-03-24

RELEVANCE

9/ 10

AUTHOR

Two Minute Papers

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE35m ago

Perplexity Computer integrates Grok 4.5

Perplexity has integrated xAI's Grok 4.5 as the orchestrator for Perplexity Computer, achieving a top score of 0.328 on its internal WANDR benchmark. The integration is highly cost-effective, running at approximately half the cost of Anthropic's Claude Opus 4.8.

UPDATE47m ago

Inference optimizations boost GPT-5.6 Sol usage limits

Recent updates for Codex and ChatGPT Work have introduced inference optimizations, the savings of which are being passed directly to users. This results in approximately 10% more usage for all GPT-5.6 Sol subscriptions, with an emphasis on providing improvements without any feature restrictions.

UPDATE1h ago

Claude Code ignores admin SCIM plugin policies

An enterprise user highlighted a critical gap where marketplace plugin selection policies configured in the Claude Admin panel and mapped to SCIM groups do not sync or apply to Claude Code. This limitation breaks the centralized context administration model for organizations attempting broad, secure deployments of Claude across developer environments, as the CLI continues to rely on localized configuration controls instead of real-time organization policies.