MiniCPM-V 4.6 Collection A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone • 11 items • Updated 7 days ago • 6
LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs? Paper • 2605.08985 • Published 12 days ago • 21
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction Paper • 2604.27393 • Published 21 days ago • 71
InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs Paper • 2601.06666 • Published Jan 10 • 1
From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition Paper • 2512.14244 • Published Dec 16, 2025 • 2
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published Dec 23, 2025 • 9
VoxCPM Collection Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning • 5 items • Updated 7 days ago • 13
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16, 2025 • 59
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 32 items • Updated 7 days ago • 83
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 273
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 7 days ago • 86