daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 3 days ago • 117
EmbeddingGemma: Powerful and Lightweight Text Representations Paper • 2509.20354 • Published Sep 24, 2025 • 44
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16, 2025 • 53
unsloth/GLM-4.1V-9B-Thinking-GGUF Image-Text-to-Text • 9B • Updated Jul 25, 2025 • 1.65k • 39
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8, 2025 • 3.91M • 2.95k • 641
SynLogic Collection Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond • 5 items • Updated Jun 3, 2025 • 15