Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts Paper • 2606.05922 • Published 22 days ago • 67
Faro Series Collection Faro chat models are fine-tuned on Fusang, focusing on practicality and long-context modeling. • 6 items • Updated Apr 11, 2024 • 3