AdaptRL / math_2509_DeepSeek-R1-Distill-Qwen-1.5B_grpo
21.3 GB
williamium's picture
Upload folder using huggingface_hub
0894f5a verified