Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27 • 215
Sing-On-Your-Beat: Simple Text-Controllable Accompaniment Generations Paper • 2411.01661 • Published Nov 3, 2024
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging Paper • 2505.11872 • Published May 17