BACK_TO_FEEDAICRIER_2
IBM drops Granite-4.0-3B-Vision for document extraction
OPEN_SOURCE ↗
REDDIT · REDDIT// 14d agoMODEL RELEASE

IBM drops Granite-4.0-3B-Vision for document extraction

IBM has released Granite-4.0-3B-Vision, a specialized vision-language model (VLM) optimized for enterprise-grade document data extraction. Built as a LoRA adapter on top of the Granite-4.0-Micro base, it excels at converting complex charts, tables, and semantic key-value pairs into structured formats like JSON, CSV, and HTML.

// ANALYSIS

IBM's "Western Qwen" moment delivers a surgical tool for the least sexy but most valuable enterprise task: turning messy PDFs into structured data.

  • Unique LoRA adapter design on a hybrid Mamba-2/Transformer base slashes RAM usage by 70%, allowing a single 3B deployment to pivot between multimodal and text-only tasks.
  • Specialized task tags for Chart2CSV and Table2JSON move beyond simple OCR, offering semantic extraction that integrates directly into IBM's Docling pipeline.
  • ISO 42001 certification and Apache 2.0 licensing provide the "clean" provenance required by regulated industries that often skip less-governed open-weights.
  • Optimized for 8GB VRAM consumer hardware while supporting massive context windows, it represents a direct challenge to Alibaba’s small-model dominance.
// TAGS
ibm-granitevlmmultimodalopen-weightsdoclingenterprise-aidocument-extractiongranite-4.0-3b-vision

DISCOVERED

14d ago

2026-03-28

PUBLISHED

14d ago

2026-03-28

RELEVANCE

9/ 10

AUTHOR

jacek2023