CHIMERA
Collection
The collection for the Paper "CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning" • 3 items • Updated
CHIMERA-4B-RL is CHIMERA-4B-SFT further trained with reinforcement learning on the CHIMERA dataset.
| Model | GPQA-D | AIME 24 | AIME 25 | AIME 26 | HMMT Feb 25 | HMMT Nov 25 | HLE |
|---|---|---|---|---|---|---|---|
| Qwen3-4B-Thinking-2507 | 65.8 | 81.6 | 81.0 | 80.8 | 59.2 | 57.3 | 7.3 |
| CHIMERA-4B-SFT | 68.8 | 86.5 | 79.8 | 80.3 | 63.1 | 66.3 | 9.0 |
| CHIMERA-4B-RL | 70.1 | 86.9 | 80.7 | 82.7 | 65.7 | 67.0 | 9.0 |