Prompt Injection Scanner flags hidden skill attacks

// 113d agoOPENSOURCE RELEASE

Prompt Injection Scanner flags hidden skill attacks

MikeVeerman’s proof of concept scans `SKILL.md` files for hidden `!` directives using a local, non-tool-calling model at install time. The goal is to catch prompt injection before a skill ever reaches a live agent.

// ANALYSIS

This is less a polished product than a timely security pattern, and that’s exactly why it matters: the risky part of third-party skills is not the markdown itself, but the execution boundary hidden inside it.

–The core insight is strong: keep the main agent out of the loop and hand only extracted directives to a separate classifier.
–Using `mistral-small:latest` locally makes the check cheap enough to run at install time, which is where this defense belongs.
–The benchmark result is promising for a narrow threat model, but the repo is explicit that it does not yet cover multi-file payloads, obfuscation, or network-fetched content.
–This feels more like an early antivirus-style guardrail for AI tools than a full security system, which is probably the right mental model.

// TAGS

prompt-injection-scannersafetyopen-sourceself-hostedllmprompt-engineering

DISCOVERED

113d ago

2026-03-20

PUBLISHED

113d ago

2026-03-20

RELEVANCE

7/ 10

AUTHOR

MikeNonect

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE42m ago

Lightpanda merges IndexedDB support for automation

Lightpanda, the open-source headless browser engine written in Zig for web automation and AI agents, has added base implementation support for IndexedDB to its main branch. This update allows scripts that depend on IndexedDB for client-side storage to execute successfully, removing a significant barrier for automation and scraping workflows on modern web applications.

OPEN SOURCE50m ago

LangChain-Chatchat builds local private RAG pipelines

LangChain-Chatchat is an open-source, local knowledge-based QA application and RAG framework built on LangChain, FastAPI, and Streamlit. It provides a private, offline pipeline that integrates with Ollama and Xinference to support open-source models like Llama3 and Qwen2.

OPEN SOURCE1h ago

prose stylesheet forces clean AI writing

prose is a lightweight, single-file Markdown prompt configuration that guides AI coding agents to communicate like a direct, confident senior engineer. Appended directly to local agent instruction files, it establishes clear rules to eliminate common AI patterns like cheesy setups, over-bulleted reasoning, and theatrical language.