Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published 15 days ago • 107
view article Article "Darwin-27B-Opus: Surpassing the Foundation Model Without Training" FINAL-Bench • Apr 13 • 13
view article Article Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion FINAL-Bench • Apr 15 • 13
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • 24 days ago • 45
view article Article Building a Fast Multilingual OCR Model with Synthetic Data nvidia • 30 days ago • 33
view article Article 1.7x Faster on a 218B Model: EAGLE3 Speculative Decoding for GLM-4.7 lujangusface • Apr 15 • 1
view article Article We Pitted the Cheapest TPU Against an NVIDIA L4. Here's What 6 Experiments Revealed. lujangusface • 30 days ago • 1
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs nielsr • Apr 7 • 61
view article Article MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning FINAL-Bench • Mar 9 • 16
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 152