SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper • 2512.03244 • Published Dec 2, 2025 • 17
AI Debate Aids Assessment of Controversial Claims Paper • 2506.02175 • Published Jun 2, 2025 • 1
AI Debate Aids Assessment of Controversial Claims Paper • 2506.02175 • Published Jun 2, 2025 • 1
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance Paper • 2511.12997 • Published Nov 17, 2025 • 11
The African Languages Lab: A Collaborative Approach to Advancing Low-Resource African NLP Paper • 2510.05644 • Published Oct 7, 2025 • 25
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization Paper • 2404.00530 • Published Mar 31, 2024
OpenThoughts: Data Recipes for Reasoning Models Paper • 2506.04178 • Published Jun 4, 2025 • 51
AI Debate Aids Assessment of Controversial Claims Paper • 2506.02175 • Published Jun 2, 2025 • 1
ModelCitizens: Representing Community Voices in Online Safety Paper • 2507.05455 • Published Jul 7, 2025 • 5
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models Paper • 2407.00369 • Published Jun 29, 2024
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation Paper • 2505.18842 • Published May 24, 2025 • 36
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published Apr 15, 2025 • 35
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published Apr 15, 2025 • 35
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published Apr 15, 2025 • 35