Nando de Freitas calls for CANDI benchmarks
A discussion on X regarding the CANDI paper, which explores text diffusion. While the analysis is praised as a wonderful new direction beyond just perplexity metrics, there is an urgent call for post-training benchmark results and direct performance comparisons against established autoregressive LLMs to properly measure any existing performance gaps.
The move from autoregressive models to text diffusion is promising, but the community is right to demand rigorous benchmarking to prove viability. It highlights a critical gap in current text diffusion research: the lack of standard post-training benchmarks. Perplexity alone is insufficient to gauge real-world performance against autoregressive giants. The ultimate success and adoption of diffusion models in text will likely hinge on these direct comparisons.
DISCOVERED
2h ago
2026-07-03
PUBLISHED
2h ago
2026-07-03
RELEVANCE
AUTHOR
NandoDF