Krea 2: SOTA open-weights 12B image model(krea.ai)
395 points by mattnewton 1 day ago | 43 comments
tl;dr: Krea released Krea 2, a 12B open-weights diffusion transformer for text-to-image generation that ranks in the top 10 on Artificial Analysis's leaderboard and 2nd among independent labs. The model uses a multi-stage pipeline (pretraining, midtraining, SFT, preference optimization, and RL with multiple reward models) on a curated dataset that explicitly excludes AI-generated images, paired with Qwen3-VL as text encoder, GQA with gated sigmoid attention, and a custom STPO variant of DPO. It ships with a prompt expander and style-reference system aimed at creative exploration rather than a single default aesthetic.
HN Discussion:
  • Authors introducing the model and technical report, inviting questions
  • Benchmark results confirm strong performance, especially for a locally hostable model
  • License restrictions mean it isn't truly open source despite the claim
  • Model targets outdated text-to-image paradigm while competitors moved to agentic image-to-image
  • ~Technical concerns about specific components like the Qwen VAE choice