Researchers introduce DiffusionBench for holistic DiT evaluation
Researchers argue that current Diffusion Transformer (DiT) evaluations over-rely on ImageNet, which poorly correlates with real-world text-to-image performance. To address this, they introduce NanoGen for unified training and DiffusionBench, a holistic benchmark for evaluating DiTs across both tasks.
The negative correlation between ImageNet FID improvements and text-to-image success is a significant wake-up call for the generative AI community, showing how narrow benchmarks can mislead research directions. By introducing NanoGen, the authors eliminate the common excuse that text-to-image evaluation is too costly, democratizing comprehensive model testing. DiffusionBench has strong potential to become the new standard in DiT research, steering the field toward architectural innovations that generalize rather than overfit to standard datasets.
DISCOVERED
2h ago
2026-06-29
PUBLISHED
2h ago
2026-06-28
RELEVANCE
AUTHOR
_akhaliq