CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 4B • Updated 10 days ago • 249
CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 4B • Updated 10 days ago • 249
CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 2B • Updated 12 days ago • 293
CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 2B • Updated 12 days ago • 293
CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384 Viewer • Updated 15 days ago • 64k • 28
CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384 Viewer • Updated 15 days ago • 64k • 28
CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384 Viewer • Updated 20 days ago • 42.8k • 32
CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384 Viewer • Updated 20 days ago • 42.8k • 32
CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384 Viewer • Updated 20 days ago • 3.2k • 32
CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384 Viewer • Updated 20 days ago • 3.2k • 32
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 501