Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 74 Who needs 1T parameters? Olympiad proofs with a 4B model
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 76
ai4bharat/indic-conformer-600m-multilingual Automatic Speech Recognition • Updated Feb 7 • 52.1k • 81
view post Post 3021 A few days ago, Thinking Machines Lab released “LoRA Without Regret”, showing that LoRA can match full fine-tuning performance when configured right.Naturally, we decided to reproduce the results with TRL and release a guide!https://huggingface.co/docs/trl/main/en/lora_without_regret See translation 🔥 11 11 + Reply
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 388k • 1.6k
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 7 items • Updated Dec 24, 2025 • 55
Cosmos Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated 17 days ago • 301
Running 596 Scaling test-time compute 📈 596 Run advanced search strategies to boost LLM problem solving