BACK_TO_FEEDAICRIER_2
Frank Bot shares hard-won RAG lessons
OPEN_SOURCE ↗
REDDIT · REDDIT// 14d agoTUTORIAL

Frank Bot shares hard-won RAG lessons

Frank Bot is a RAG assistant for Australian workplace compliance, deployed across construction, aged care, and mining. The post distills the production lessons that mattered most: query expansion, document-title boosting, layered prompts, local embeddings, and per-client isolation.

// ANALYSIS

Regulated RAG is mostly a systems problem wearing an ML costume. The model matters, but retrieval routing, prompt boundaries, and tenant isolation are what make the system reliable enough to ship.

  • Four query rewrites via Claude Haiku improve recall more than fiddling with chunk size alone, especially when users and policy docs use different jargon.
  • Named-document boosting is a smart compliance hack because the right answer often lives in one obvious source of truth, not the nearest semantic neighbor.
  • Immutable Layer 1 prompts are the real guardrail in multi-tenant bots; without them, client instructions become an attack surface.
  • Local `all-MiniLM-L6-v2` embeddings plus ChromaDB are a sane tradeoff when the LLM and retrieval logic do the heavy lifting.
  • One VM per client is operationally boring in the best way: simpler isolation, fewer cross-contamination risks, and less shared-state pain.
// TAGS
frank-botragchatbotvector-dbembeddingprompt-engineeringself-hostedllm

DISCOVERED

14d ago

2026-03-29

PUBLISHED

14d ago

2026-03-29

RELEVANCE

8/ 10

AUTHOR

Neoprince86