DharmaOCR tops OCR bench, cuts cost

// 45d agoRESEARCH PAPER

DharmaOCR tops OCR bench, cuts cost

DharmaOCR Full and Lite are 7B and 3B structured-OCR models from Dharma-AI built with SFT plus DPO. The paper says they beat commercial OCR systems and open-source baselines on a new benchmark while reducing degeneration and per-page inference cost.

// ANALYSIS

This is a strong reminder that specialization can beat bigger general-purpose models when the task has a rigid output format and a measurable failure mode.

–DPO here is not just alignment theater; using degenerate generations as rejected samples directly targets the looping and runaway outputs that hurt OCR pipelines.
–The reported scores, 0.925 for the 7B model and 0.911 for the 3B model, are impressive, but they are still benchmark-specific, so the claim is strongest for structured OCR rather than broad document understanding.
–AWQ cutting per-page cost by about 22% with negligible quality loss is the part that matters operationally, because OCR workloads are usually judged on throughput and unit economics as much as accuracy.
–The comparison set is broad, spanning commercial APIs and open-source OCR stacks, which makes the result more interesting than a narrow internal eval.

// TAGS

dharmaocrllmfine-tuningopen-sourcebenchmarkinference

DISCOVERED

45d ago

2026-04-17

PUBLISHED

45d ago

2026-04-17

RELEVANCE

9/ 10

AUTHOR

Flat_Divide9839

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE20m ago

Claude Code drops "workflow" for "ultracode"

Anthropic's developer CLI tool, Claude Code, has released version 2.1.160, replacing "workflow" with "ultracode" as its dynamic workflow trigger. Because "workflow" frequently activated runs unintentionally during regular prompts, the new keyword is designed to provide a cleaner developer experience with fewer false positives.

UPDATE37m ago

Anthropic hardens Claude Code agent security

Anthropic has released Claude Code version 2.1.160, bringing 27 CLI updates focused on developer environment security. The release introduces prompts before modifying sensitive shell startup, git, and package configurations, while also adding the claude-cli-design-sync model and deprecating several legacy debugging flags.

UPDATE45m ago

Entire Dispatch 0x0010 streamlines onboarding

Entire has released Entire Dispatch 0x0010, which primarily focuses on polishing the developer onboarding experience and CLI integration. The update introduces guided CLI setup for new users during their initial repository sync, smoother web-to-CLI authentication, properly routed session transcript links, and clearer visualization of repository checkpoints and activity on repo pages.

DharmaOCR tops OCR bench, cuts cost