TIGER-Lab
/

One-Shot-CFT-Math-Qwen-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

ubowang commited on Jun 4, 2025

Commit

85f5fbc

·

verified ·

1 Parent(s): ef1062b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ Instead of learning from reference answers (as in supervised fine-tuning) or rew
 One-shot CFT consistently improves mathematical and logical reasoning.
 <strong>Left:</strong> Average accuracy on six mathematical reasoning benchmarks for Qwen and LLaMA models, comparing base, SFT, RLVR, and CFT with only one training example.
 <strong>Right:</strong> In-domain accuracy on three logic reasoning benchmarks (BBEH subtasks) for Qwen2.5-Math-7B.
-Across both domains, CFT with a single problem significantly outperforms standard supervised fine-tuning and matches or exceeds reinforcement learning with much lower compute.
 </em></p>

 One-shot CFT consistently improves mathematical and logical reasoning.
 <strong>Left:</strong> Average accuracy on six mathematical reasoning benchmarks for Qwen and LLaMA models, comparing base, SFT, RLVR, and CFT with only one training example.
 <strong>Right:</strong> In-domain accuracy on three logic reasoning benchmarks (BBEH subtasks) for Qwen2.5-Math-7B.
+Across both domains, CFT with a single problem significantly outperforms standard SFT and matches or exceeds reinforcement learning with much lower compute.
 </em></p>