BACK_TO_FEEDAICRIER_2
DataBoundary puts delimiter defense at 100%
OPEN_SOURCE ↗
REDDIT · REDDIT// 4h agoBENCHMARK RESULT

DataBoundary puts delimiter defense at 100%

DataBoundary is a prompt-injection benchmark and defense lab that wraps untrusted text in random delimiters and tests whether models keep treating it as data. In its latest run, several weaker models jumped from poor baseline defense to 99-100% once delimiters and a strict boundary prompt were added.

// ANALYSIS

Useful signal, not a universal fix: delimiter framing is a strong, low-cost defense for single-turn document ingestion, but the repo also shows the gains depend on model and prompt wording.

  • Gemma 4 E4B moved from 21.6% defense without delimiters to 100% with delimiters, and the strict prompt closed the last gaps on the weaker models.
  • The terse "strict" template beat a more explanatory "contextual" version, which suggests boundary clarity matters more than persuasion.
  • The hardest attacks were delimiter mimicry and gradual drift, so this is still defense in depth, not a solved problem.
  • The benchmark is most relevant for RAG and web-document workflows where the model reads untrusted text directly.
  • The dataset and harness are open, which makes the result more useful than a one-off demo because others can reproduce and extend it.
// TAGS
databoundarybenchmarkevaluationsecuritysafetyprompt-engineeringdata-tools

DISCOVERED

4h ago

2026-05-05

PUBLISHED

4h ago

2026-05-05

RELEVANCE

9/ 10

AUTHOR

User_Deprecated