lihaoxin2020/agentic-search-rl-mixed-shortform-dr-tulu-longform-v1 Viewer • Updated 2 days ago • 6.37k • 14
lihaoxin2020/agentic-search-rl-mixed-shortform-dr-tulu-longform-v1 Viewer • Updated 2 days ago • 6.37k • 14
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step250 Text Generation • 196k • Updated 9 days ago • 24
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step250 Text Generation • 196k • Updated 9 days ago • 24
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step300 Text Generation • 196k • Updated 9 days ago • 209
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step300 Text Generation • 196k • Updated 9 days ago • 209
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150 Text Generation • 196k • Updated 10 days ago • 327
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150 Text Generation • 196k • Updated 10 days ago • 327
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200 Text Generation • 196k • Updated 10 days ago • 543
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200 Text Generation • 196k • Updated 10 days ago • 543
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150 Text Generation • 196k • Updated 10 days ago • 196
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150 Text Generation • 196k • Updated 10 days ago • 196
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step200 Text Generation • 196k • Updated 12 days ago • 391
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step200 Text Generation • 196k • Updated 12 days ago • 391
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step150 Text Generation • 196k • Updated 12 days ago • 412
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step150 Text Generation • 196k • Updated 12 days ago • 412
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200 Text Generation • 196k • Updated 12 days ago • 384