Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Paper • 2504.21356 • Published Apr 30, 2025 • 2
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published Apr 24, 2025 • 40
EliGen: Entity-Level Controlled Image Generation with Regional Attention Paper • 2501.01097 • Published Jan 2, 2025 • 2
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2