rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01-merged 4B • Updated about 16 hours ago • 30
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01-merged 4B • Updated about 16 hours ago • 30
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01 Text Generation • Updated about 16 hours ago • 13
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01 Text Generation • Updated about 16 hours ago • 13
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3_merged 7B • Updated 1 day ago • 32
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3 Text Generation • Updated 1 day ago • 15
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3_merged 7B • Updated 1 day ago • 32
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3 Text Generation • Updated 1 day ago • 15
rghosh8/arc-grpo-nemotron-mini-4b-instruct-beta-0.01-adapter Text Generation • Updated 1 day ago • 13
rghosh8/arc-grpo-nemotron-mini-4b-instruct-beta-0.01-adapter Text Generation • Updated 1 day ago • 13