xfxcwynlc/llama-nemotron-post-training-qwen3-1.7B-packed2048-1500000-sampled80code10chat10math Preview • Updated 22 days ago • 17
xfxcwynlc/retrieve_all_no_longbench-qwen3-tokenized-packed-16384 Viewer • Updated Jun 25, 2025 • 251k • 3