view article Article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 30
view article Article 🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷 RDTvlokip • May 5 • 6
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders Nicolas-BZRD • Apr 7 • 28
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 9 • 62