Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance Paper • 2606.19195 • Published 8 days ago • 132
MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction Paper • 2606.18558 • Published 8 days ago • 50
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 10 days ago • 222
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 106 items • Updated 7 days ago • 727
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 15 days ago • 74
Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models Paper • 2606.11167 • Published 15 days ago • 5
Interactivity Alignment Collection Full-duplex speech models post-trained with reinforcement learning for improved conversational interactivity. • 4 items • Updated 15 days ago • 6
Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition Paper • 2312.17279 • Published Dec 27, 2023 • 4
view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 21 days ago • 64
CoreML Speech Models Collection Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. • 28 items • Updated 2 days ago • 4
MLX Speech Models Collection Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 57 items • Updated 3 days ago • 5
Unified Panoramic Geometry Estimation via Multi-View Foundation Models Paper • 2605.26368 • Published May 25 • 4
CubePart: An Open-Vocabulary Part-Controllable 3D Generator Paper • 2605.28763 • Published 29 days ago • 14