RefineBench: Evaluating Refinement Capability of Language Models via Checklists
Paper
•
2511.22173
•
Published
•
12
None defined yet.
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model