view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) Jan 19, 2025 • 46
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 266