DataBoundary puts delimiter defense at 100%

// 90d agoBENCHMARK RESULT

DataBoundary puts delimiter defense at 100%

DataBoundary is a prompt-injection benchmark and defense lab that wraps untrusted text in random delimiters and tests whether models keep treating it as data. In its latest run, several weaker models jumped from poor baseline defense to 99-100% once delimiters and a strict boundary prompt were added.

// ANALYSIS

Useful signal, not a universal fix: delimiter framing is a strong, low-cost defense for single-turn document ingestion, but the repo also shows the gains depend on model and prompt wording.

–Gemma 4 E4B moved from 21.6% defense without delimiters to 100% with delimiters, and the strict prompt closed the last gaps on the weaker models.
–The terse "strict" template beat a more explanatory "contextual" version, which suggests boundary clarity matters more than persuasion.
–The hardest attacks were delimiter mimicry and gradual drift, so this is still defense in depth, not a solved problem.
–The benchmark is most relevant for RAG and web-document workflows where the model reads untrusted text directly.
–The dataset and harness are open, which makes the result more useful than a one-off demo because others can reproduce and extend it.

// TAGS

databoundarybenchmarkevaluationsecuritysafetyprompt-engineeringdata-tools

DISCOVERED

90d ago

2026-05-05

PUBLISHED

90d ago

2026-05-05

RELEVANCE

9/ 10

AUTHOR

User_Deprecated

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

TUTORIAL36m ago

Prompting ChatGPT for editable text blocks improves iteration

Riley Brown shared a quick prompting tip for working with OpenAI Codex and ChatGPT: requesting responses in an "editable text block." This technique formats the generated text so users can easily make manual edits, make further AI-assisted tweaks, or copy the content directly to their clipboard.

TUTORIAL1h ago

Dani Avila shares Claude Code session cheat sheet

Developer Dani Avila shared an updated cheat sheet detailing session management commands for Claude Code and clarifying when to use each. The guide highlights recent command renames from the changelog, noting that `/fork` duplicates a session to run independently, while `/subtask` delegates work to a sub-agent that reports results back to the primary session.

OPEN SOURCE1h ago

LogoCreator v2 Drops Open-Source Logo Generator

LogoCreator v2 is an open-source web application designed to generate professional logos and complementary brand images within seconds. Built by developer Hassan El Mghari (Nutlope), the tool gives indie hackers, designers, and creators a free and efficient way to assemble complete visual branding for their projects.