llama.cpp adds activations, control-vector steering

// 114d agoINFRASTRUCTURE

llama.cpp adds activations, control-vector steering

llama.cpp now exposes `/activations` endpoints in `llama-server`, letting users capture per-layer activations, stream per-token vectors to disk, and feed them into a sparse autoencoder workflow. The companion pipeline turns those features into GGUF control vectors for real-time steering and interpretability work.

// ANALYSIS

This is a genuinely useful bridge between mechanistic interpretability and a production-adjacent local serving stack, not just another notebook experiment.

–The live capture API makes activation analysis practical inside the server you already run, instead of forcing a separate instrumentation stack.
–The binary collection format and top-K mean view keep the data path simple enough to automate and scale.
–The inter-cluster differential scoring is the smartest part here: it targets behavior-specific features, not just whatever lights up on a single phrase set.
–The MoE scale caveat matters a lot; control vectors are powerful, but they are also model- and embedding-dimension-sensitive enough to require calibration.
–For local model users, this opens a clean path from observability to intervention: collect, train, probe, export, steer.

// TAGS

llama-cppllmopen-sourceinferenceresearch

DISCOVERED

114d ago

2026-03-20

PUBLISHED

114d ago

2026-03-20

RELEVANCE

8/ 10

AUTHOR

wattswrites

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO1h ago

Higgsfield drops developer CLI and MCP server

Higgsfield has launched a developer CLI and MCP server, allowing programmers and autonomous agents to programmatically trigger, customize, and edit marketing ads and cinematic videos directly through terminal commands. Demonstrated by developer Cole Medin using Anthropic's Claude Code and the Archon workflow engine, the toolkit enables fully automated video production pipelines.

OPEN SOURCE1h ago

AI Content Factory automates video ads

AI Content Factory is an open-source workflow that automates bulk marketing video generation from a product catalog. Built on the Archon agentic engine and Higgsfield CLI, it reduces costs by gating expensive video rendering behind cheap image exploration and human approval.

NEWS3h ago

George Hotz shares his enthusiasm for LLMs and open-source coding agents while criticizing doom-mongering and the overinflated valuations of frontier AI labs.

George Hotz (geohot) details his excitement for the practical applications of AI—such as LLMs, self-driving cars, video generation models, and AI coding agents—highlighting his successful setup of the open-source agent OpenCode on a local GLM-5.2 model. However, he strongly criticizes the prevailing industry hype, safety-related doom-mongering, and the multibillion-dollar valuations of frontier AI labs. Hotz argues that frontier labs will fail to capture most of the AI value because AI is a commodity driven by Moore's law and general computing progress. He also frames coding models not as autonomous creators, but as valuable productivity tools analogous to compilers, find-and-replace, or Stack Overflow that are changing the nature of programming.