view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model georgefen • Jan 1 • 19
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624