OPEN_SOURCE ↗
REDDIT · REDDIT// 27d agoOPENSOURCE RELEASE
NexusRAG brings image, table-aware RAG to Ollama
NexusRAG is a new open-source, self-hostable RAG platform that uses vision LLMs and Docling to caption images and summarize tables before embedding, so document queries actually surface chart data and tabular content. It supports both local Ollama and Gemini backends with a three-stage hybrid retrieval pipeline including vector search, a knowledge graph, and cross-encoder reranking.
// ANALYSIS
Image and table blindness is one of the most persistent pain points in production RAG — NexusRAG's approach of caption-augmented chunking is pragmatic and deployable today without exotic infrastructure.
- –Vision LLM captions are appended directly to page chunks before embedding, meaning no separate image index or retrieval path is needed — a clean architectural choice
- –Tables are exported as structured Markdown, summarized by an LLM, then injected back into the chunk — preserves both structure and semantic searchability
- –Hybrid chunking via Docling's HybridChunker respects document structure, never splitting mid-table or mid-heading
- –Three-stage retrieval (ChromaDB vector over-fetch → LightRAG knowledge graph → BGE cross-encoder reranker) is more sophisticated than most hobbyist RAG projects
- –Full Ollama support makes this fully air-gapped deployable — relevant for enterprise and privacy-sensitive use cases
// TAGS
nexusragragopen-sourcellmvector-dbmultimodalself-hosted
DISCOVERED
27d ago
2026-03-15
PUBLISHED
27d ago
2026-03-15
RELEVANCE
7/ 10
AUTHOR
Alternative_Job8773