view article Article Streaming datasets: 100x More Efficient +3 andito, lhoestq, burtenshaw, pcuenq, merve • Oct 27, 2025 • 86
view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models nvidia • Oct 20, 2025 • 19
view article Article Training and Finetuning Embedding Models with Sentence Transformers tomaarsen • May 28, 2024 • 274
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers tomaarsen, arthurbresnu • Jul 1, 2025 • 138
view article Article KV Cache from scratch in nanoVLM +3 ariG23498, kashif, lusxvr, andito, pcuenq • Jun 4, 2025 • 119
view article Article AutoThink: Adaptive Reasoning for Large Language Models codelion • May 27, 2025 • 8
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼💻 sasha • May 28, 2025 • 22
view article Article CodeAgents + Structure: A Better Way to Execute Actions akseljoonas, m-ric • May 28, 2025 • 82
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 83