Nando de Freitas calls for CANDI benchmarks

// 2h agoRESEARCH PAPER

Nando de Freitas calls for CANDI benchmarks

A discussion on X regarding the CANDI paper, which explores text diffusion. While the analysis is praised as a wonderful new direction beyond just perplexity metrics, there is an urgent call for post-training benchmark results and direct performance comparisons against established autoregressive LLMs to properly measure any existing performance gaps.

// ANALYSIS

The move from autoregressive models to text diffusion is promising, but the community is right to demand rigorous benchmarking to prove viability. It highlights a critical gap in current text diffusion research: the lack of standard post-training benchmarks. Perplexity alone is insufficient to gauge real-world performance against autoregressive giants. The ultimate success and adoption of diffusion models in text will likely hinge on these direct comparisons.

// TAGS

text-diffusionllmaillmsbenchmarkscandi

DISCOVERED

2h ago

2026-07-03

PUBLISHED

2h ago

2026-07-03

RELEVANCE

6/ 10

AUTHOR

NandoDF

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE3h ago

OpenAI replaces ChatGPT Canvas with Writing Blocks

OpenAI has updated ChatGPT by replacing its side-by-side Canvas editing mode with inline Writing Blocks to streamline document and code drafting. While the new interface is intended to improve latency and simplify the layout, early user feedback indicates that the editor within Writing Blocks is less capable, making simple rewrites that were previously effortless feel frustratingly limited.

UPDATE4h ago

Grok reportedly coming to X group chats

A leak reveals xAI's Grok chatbot is coming to X group chats, enabling users to interact with the AI assistant in group messaging like a Discord bot. While users are excited for quick thread queries, some note limitations such as Grok's lack of long-term memory.

MODEL5h ago

Gemini 3.5 Pro leaks suggest UI, design upgrade

Unconfirmed developer leaks circulating online claim that Google's upcoming Gemini 3.5 Pro model will offer a significant leap in visual design quality, UI layout generation, and SVG code output compared to Gemini 3.1 Pro. The reports emphasize particularly strong performance in one-shot frontend generation, aiming to provide developers with ready-to-use user interface components.