llama.cpp mmap path enables live tampering

// 123d agoSECURITY INCIDENT

llama.cpp mmap path enables live tampering

A new proof of concept shows a running llama-server can start reading modified GGUF weights mid-inference when the model file is memory-mapped and another process still has write access to it. That turns shared volumes and weak file isolation into a real integrity risk for local LLM deployments, even without restarting the server or injecting code.

// ANALYSIS

This is the kind of LLM security issue developers underestimate because nothing “crashes” and the server still looks healthy. It is less a model bug than an ops-layer failure mode where mmap, shared storage, and permissive mounts quietly become part of the attack surface.

–The PoC targets output.weight in a GGUF file and shows token logits can be biased live, forcing responses like “Pwned” across both completion and chat endpoints.
–The attack needs write access to the model artifact, not root, ptrace, or code injection, which makes sloppy Docker and local dev setups the real problem zone.
–--no-mmap, read-only model mounts, dedicated serving users, and runtime integrity checks look a lot less optional after this.
–For teams shipping local copilots or on-prem inference, model files need to be treated like executable assets, not passive data blobs.

// TAGS

llama-cppllmopen-sourceinferencesafety

DISCOVERED

123d ago

2026-03-11

PUBLISHED

124d ago

2026-03-10

RELEVANCE

8/ 10

AUTHOR

Acanthisitta-Sea

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE2h ago

git/star-history-chart embeds star charts in READMEs

git/star-history-chart is a skill for the Claude Code Templates CLI that generates a repository's star history chart as an SVG and embeds it in the README. The system uses the repository's native GITHUB_TOKEN to fetch stargazer data via a GitHub Actions workflow and commits the output directly, eliminating the need for third-party services or external secret configurations.

VIDEO2h ago

Higgsfield drops developer CLI and MCP server

Higgsfield has launched a developer CLI and MCP server, allowing programmers and autonomous agents to programmatically trigger, customize, and edit marketing ads and cinematic videos directly through terminal commands. Demonstrated by developer Cole Medin using Anthropic's Claude Code and the Archon workflow engine, the toolkit enables fully automated video production pipelines.

OPEN SOURCE2h ago

AI Content Factory automates video ads

AI Content Factory is an open-source workflow that automates bulk marketing video generation from a product catalog. Built on the Archon agentic engine and Higgsfield CLI, it reduces costs by gating expensive video rendering behind cheap image exploration and human approval.