Excited to share our Unified Multimodal Models new work Reconstruction Alignment (RecA)! ๐ Just 6 ร 80GB A100s ร 4.5 hours to boost BAGEL performance across all tasks! Outperforms FLUX-Kontext in image editing capabilities!
โ RecA trains UMMs to reconstruct images from their own visual understanding encoder embeddings โ big gains in image generation ๐จ & editing โ๏ธ.