agurung/lcb-ft-v2-qwen3-4b-rft-iid-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago • 35
agurung/lcb-ft-v2-qwen3-4b-rft-mixed-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago
agurung/lcb-ft-v2-qwen3-4b-dft-mixed-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago • 46
agurung/lcb-ft-v2-qwen3-4b-sft-mixed-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago • 50
agurung/lcb-ft-v2-qwen3-4b-dft-iid-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago • 48
agurung/lcb-ft-v2-qwen3-4b-sft-iid-24-lora-r128-a32-lr2p5e-4-const-lr2p5e-4-qps8-gpuauto-ep4 Updated 3 days ago • 64
agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason Reinforcement Learning • 4B • Updated Mar 10 • 7
agurung/flawed-fictions-qwen25-7b-lengthpenalty-litereason Reinforcement Learning • 8B • Updated Feb 22 • 8