BACK_TO_FEEDAICRIER_2
Local 4B LLM forecaster beats GPT-4 on cheap GPU
OPEN_SOURCE ↗
REDDIT · REDDIT// 4h agoOPENSOURCE RELEASE

Local 4B LLM forecaster beats GPT-4 on cheap GPU

PredictBot is an open-source forecasting tool that runs Qwen 3.5 4B locally on consumer hardware to predict yes/no event outcomes. By applying a custom calibration pipeline to shrink overconfident raw outputs, it achieves a 0.186 Brier score, outperforming GPT-4 on structured prediction market questions.

// ANALYSIS

The real breakthrough here isn't the model itself, but the clever calibration pipeline that fixes the inherent overconfidence of small LLMs. It proves that aggressive post-processing on a 4B model can yield superforecaster-level accuracy without touching the cloud.

  • Raw LLM output scored a poor 0.25 Brier score, but applying a shrinkage formula toward a measured base rate improved it to 0.186
  • Runs entirely locally via llama.cpp taking just 3.4 GB VRAM, making it accessible on a $300 secondhand GPU
  • Includes a built-in RAG pipeline with stealth browser fetching for grounding predictions in current events
  • Shows particular strength in discrete events and market questions, though struggles with continuous macro time series data
// TAGS
predictbotllmagentragopen-sourceinference

DISCOVERED

4h ago

2026-04-18

PUBLISHED

5h ago

2026-04-18

RELEVANCE

8/ 10

AUTHOR

Salok1