Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL Paper • 2604.28123 • Published 6 days ago • 34
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published 12 days ago • 12
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published 8 days ago • 32
Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing Paper • 2604.22782 • Published Apr 3 • 7
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 7 days ago • 34
Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation Paper • 2604.25819 • Published 9 days ago • 17
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 14 days ago • 33
dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model Paper • 2604.22152 • Published 13 days ago • 4
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 22 days ago • 31
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 13 days ago • 223
EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale Paper • 2604.17406 • Published 18 days ago • 6
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 16 days ago • 249
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 17 days ago • 45
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 21 days ago • 69