BACK_TO_FEEDAICRIER_2
gumbel-mcts ships numba-accelerated MCTS library
OPEN_SOURCE ↗
REDDIT · REDDIT// 16d agoOPENSOURCE RELEASE

gumbel-mcts ships numba-accelerated MCTS library

gumbel-mcts is a lightweight Python/numba Monte Carlo tree search library with PUCT, dense Gumbel, and sparse Gumbel implementations. The author says the code was validated against a gold-standard baseline and tuned for fast self-play experiments, especially in large-action-space games.

// ANALYSIS

This is the kind of quiet infra release that matters to anyone actually running search-heavy agents. The real win is less the algorithm name than the combination of validated parity, low-budget Gumbel search, and a compact codebase people can inspect and fork.

  • `puct.py` claims exact policy parity with a reference MCTS while delivering a multi-x speedup.
  • `gumbel_dense.py` is the more interesting part for MuZero-style setups where simulation budget is tight.
  • `gumbel_sparse.py` is a practical fit for large branching factors, which is why chess is called out explicitly.
  • MIT licensing and a pure Python/numba stack lower the barrier for experimentation, benchmarking, and integration into self-play loops.
// TAGS
gumbel-mctsopen-sourcereasoningresearchbenchmarkdevtool

DISCOVERED

16d ago

2026-03-26

PUBLISHED

17d ago

2026-03-26

RELEVANCE

8/ 10

AUTHOR

randomwalkin