lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step250 Text Generation • 196k • Updated 21 days ago • 25
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step300 Text Generation • 196k • Updated 21 days ago • 357
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150 Text Generation • 196k • Updated 21 days ago • 395
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200 Text Generation • 196k • Updated 21 days ago • 774
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150 Text Generation • 196k • Updated 21 days ago • 266
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step200 Text Generation • 196k • Updated 23 days ago • 481
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step150 Text Generation • 196k • Updated 23 days ago • 505
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200 Text Generation • 196k • Updated 23 days ago • 476
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step150 Text Generation • 196k • Updated 23 days ago • 455
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step100 Text Generation • 196k • Updated 24 days ago • 735
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step100 Text Generation • 196k • Updated 24 days ago • 647
lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100 Text Generation • 196k • Updated 26 days ago • 184
lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50 Text Generation • 196k • Updated 26 days ago • 149
lihaoxin2020/qwen3-4b-refiner-gpt54-instance-rubric-gpt54-grpo-step50 Text Generation • 196k • Updated 27 days ago • 318
lihaoxin2020/qwen3-4B-refiner-rubric-rl-step50 Text Generation • 196k • Updated about 1 month ago • 46
lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-resume-step100 Text Generation • 196k • Updated Apr 14 • 26
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step50 Text Generation • 196k • Updated Apr 12 • 4 • 1