Krea releases Krea 2 technical report

// 1h agoRESEARCH PAPER

Krea releases Krea 2 technical report

Krea has published the technical report for Krea 2 (K2), its family of open-weights text-to-image foundation models designed for creative exploration. The release includes the model weights for both the undistilled K2 Raw and the fast, distilled K2 Turbo variants.

// ANALYSIS

By sharing the end-to-end recipe for Krea 2, Krea proves that open-weights image models can move beyond generic "AI aesthetics" through modular training and thoughtful data curation. The report shows that RL and careful post-training alignment are just as critical for text-to-image generation as they are for LLMs.

–**Steerability over defaults**: Instead of optimizing solely for a polished default aesthetic, Krea 2 focuses on creative exploration using a prompt expander (trained via RL with a diversity reward) and a style-reference system to minimize content leakage.
–**Architectural efficiency**: The model streamlines the standard Diffusion Transformer (DiT) by replacing heavy per-block modulation MLPs with simple block-level biases (saving 20-30% of parameter count) and adopting GQA with gated sigmoid attention for stable training.
–**Advanced post-training stack**: The training pipeline employs a custom DPO variant (STPO) to prevent policy divergence and high-frequency artifacts, followed by a multi-reward GRPO-style RL stage using rubric-based prompt evaluation and a dedicated structural artifact classifier.
–**Custom database and training infra**: Krea built a custom PG-sharded metadata queue called "krablets" to scale data processing, and utilized FSDP2/Megatron-LM TP alongside the Weka filesystem to support aggressive checkpointing (completed in under 30 seconds).

// TAGS

krea-2kreaimage-genopen-weightstrainingresearchdataset

DISCOVERED

1h ago

2026-06-23

PUBLISHED

1h ago

2026-06-23

RELEVANCE

9/ 10

AUTHOR

krea_ai

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE25m ago

Seedance 2.0 drops native 4K

Higgsfield has added native 4K support to the Seedance 2.0 video model on its platform, eliminating the need for post-production upscaling. The update delivers high-resolution, cinema-grade details and stable motion consistency directly from generation.

NEWS35m ago

Linton proposes tiered model agent execution

AI entrepreneur Morgan Linton argues that current coding agent implementations are over-provisioned at the execution layer. He proposes a three-tier architecture (orchestrate, execute, review) using different models or variable effort levels to optimize performance and cost.

BENCHMARK37m ago

GLM-5.2 matches Opus quality, cuts costs

Evaluation tests comparing Z.ai's open-weights GLM-5.2 against Anthropic's Claude Opus show the open model delivers comparable coding quality at three times lower cost and faster speeds. Developer Hassan El Mghari plans to open-source the complete prompt and cost testing suite.