BACK_TO_FEEDAICRIER_2
Chess-GPT impossible-move test probes board model
OPEN_SOURCE ↗
REDDIT · REDDIT// 2h agoNEWS

Chess-GPT impossible-move test probes board model

This Reddit discussion proposes stress-testing Karvonen’s chess transformer with illegal, trajectory-impossible, and ambiguous moves to see whether its latent board-state probes stay coherent or break in distinct ways. The experiment is aimed at separating rule tracking, current-position tracking, attack geometry, piece identity, and strategic expectation into different failure modes.

// ANALYSIS

Good idea, and more interesting than a generic robustness test: it turns “the model has a board representation” into a causal question about how that representation behaves under structured contradiction. The real signal would be qualitative dissociations, not just worse accuracy.

  • Karvonen’s prior work already shows linear probes and interventions can recover and edit latent board state, so impossible inputs are a direct test of whether that state is actually used
  • Rule violations should pressure the model’s update mechanism; trajectory violations test whether it tracks history or only final configuration
  • “Impossible threat” cases are the sharpest probe for relational structure, because the square occupancy can be fine while attack geometry is nonsense
  • Referential ambiguity is a separate axis: if probes commit to one knight, that suggests piece identity is encoded; if they preserve ambiguity, occupancy may dominate over object tracking
  • Strategic absurdity should mainly hit skill or move-prior estimates, which gives a useful control for separating tactical confusion from world-model collapse
// TAGS
chess-gptllmreasoningresearchopen-source

DISCOVERED

2h ago

2026-04-16

PUBLISHED

6h ago

2026-04-16

RELEVANCE

8/ 10

AUTHOR

Infamous-Payment-164