lllqaq/R2EGym-7B-Agent-Coder-Instruct-num02_posttrain_r2egym_32768_8gpu Text Generation • 333k • Updated 1 day ago • 21
lllqaq/R2EGym-7B-Agent-Coder-Instruct-num01_posttrain_r2egym_32768_8gpu Text Generation • 333k • Updated 1 day ago • 16
lllqaq/R2EGym-32B-Agent-Coder-Instruct_merged_bucketab_4sources_20260228_101548_32768_8gpu Text Generation • 1.12M • Updated 9 days ago • 14
lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketab_4sources_20260228_101548_32768_3gpu_oomfix Text Generation • 333k • Updated 9 days ago • 38
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj_reward1_loose_4sources_shuf42_ckpt2400 841k • Updated 12 days ago • 11
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-merged_bucketab_4sources_20260228_101548_32768_4gpu_oomfix Text Generation • 841k • Updated 13 days ago • 14
lllqaq/R2EGym-14B-Agent-Coder-Instruct-traj_bucketAB_multi_3sources_bucketAB_sft_shuf42 Text Generation • 841k • Updated 14 days ago • 12
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fimMidPostV2-r2egym-32k-ckpt808 1.12M • Updated 18 days ago • 10
lllqaq/R2EGym-14B-Agent-Coder-Instruct-trajmix-gpt5miniAB-claude45AB-r2egymSFT-shuf42-32k-8gpu-oomfix Text Generation • 841k • Updated 19 days ago • 11
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fim_midtrain_data_0108_212k_posttrain_r2egym_32768_8gpu Text Generation • 1.12M • Updated 20 days ago • 14
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fim_midtrain_data_0108_212k_32768_8gpu Text Generation • 1.12M • Updated 23 days ago • 10
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj-gpt5mini-full-bucketAB_32768_oomfix Text Generation • 841k • Updated 26 days ago • 13
lllqaq/R2EGym-32B-Agent-Coder-Instruct-r2egym_32768_8gpu Text Generation • 1.12M • Updated 26 days ago • 16
lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketAB_32768_8gpu_oomfix Text Generation • 333k • Updated about 1 month ago • 2
lllqaq/R2EGym-14B-Agent-Coder-Instruct-merged_bucketAB_32768_8gpu_oomfix Text Generation • 841k • Updated about 1 month ago • 1
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-traj-gpt5mini-ab-sample400 Text Generation • 333k • Updated Jan 28
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-r2egym-official-first400 Text Generation • 333k • Updated Jan 28 • 1
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42-ropeyarn Text Generation • 333k • Updated Jan 27 • 2
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42 Text Generation • 841k • Updated Jan 27
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42 Text Generation • 333k • Updated Jan 27
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5-traj-run1-filtered1 Text Generation • 333k • Updated Jan 26 • 2