Kreuzberg 4.5.0 integrates Docling layout model
Kreuzberg v4.5.0 brings native document structure and table understanding by embedding Docling's RT-DETR v2 model into its Rust-native pipeline. The integration delivers comparable accuracy to Docling but runs nearly three times faster with significantly lower memory overhead and no Python dependency.
By porting Docling's excellent layout models to a Rust-native pipeline, Kreuzberg offers a compelling, high-performance alternative for heavy document processing pipelines.
- –Replacing Python with Rust and using pdfium for character-level text extraction yields a massive 2.8x performance boost and lowers memory requirements
- –Cross-language support (bindings for 12 languages including Python, JS, Go, Java) means you aren't forced into a Python stack just for document AI
- –SLANet-Plus integration specifically targets table structure prediction, which is notoriously difficult in PDF extraction
- –The project proves that leveraging permissive open-source models (Apache) inside highly optimized execution environments is a winning strategy for infrastructure tools
DISCOVERED
67d ago
2026-03-22
PUBLISHED
67d ago
2026-03-22
RELEVANCE
AUTHOR
Eastern-Surround7763