view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 122
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 24 days ago • 79
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Training and Finetuning Embedding Models with Sentence Transformers tomaarsen • May 28, 2024 • 275
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
view article Article MCP for Research: How to Connect AI to Research Tools dylanebert • Aug 18, 2025 • 70
view article Article Accelerating Document AI +2 rajistics, nielsr, florentgbelidji, nbroad • Nov 21, 2022 • 81