CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward • 5 items • Updated Aug 31, 2025 • 7
Tar Collection [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations • 11 items • Updated Sep 20, 2025 • 1
ReflectionFlow release Collection https://diffusion-cot.github.io/reflection2perfection/ • 6 items • Updated Apr 23, 2025 • 13
Science-T2I Collection Addressing Scientific Illusions in Image Synthesis • 10 items • Updated Apr 27, 2025 • 4
SFTvsRL Models & Data Collection This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 7 items • Updated Mar 13, 2025 • 9
Eagle Collection Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input. • 15 items • Updated 8 days ago • 38
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format • 11 items • Updated Jul 2, 2024 • 11
MGM Collection Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47