MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control Paper • 2604.06156 • Published Apr 7 • 10
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control Paper • 2604.06156 • Published Apr 7 • 10
TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation Paper • 2503.07050 • Published Mar 10, 2025 • 1
SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model Paper • 2510.12709 • Published Oct 14, 2025 • 14
Scalable Vision Language Model Training via High Quality Data Curation Paper • 2501.05952 • Published Jan 10, 2025 • 5
SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement Paper • 2507.01643 • Published Jul 2, 2025 • 2
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement Paper • 2508.09670 • Published Aug 13, 2025
SAIL-VL Collection Scalable Vision Language Model Training via High Quality Data Curation • 6 items • Updated Sep 18, 2025 • 1