| license: apache-2.0 | |
| base_model: | |
| - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | |
| datasets: | |
| - 0xZee/dataset-CoT-Space-Physics-Astrophysics-76 | |
| tags: | |
| - unsloth | |
| - trl | |
| - sft | |
| # Finetunning deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B on 0xZee/dataset-CoT-Physics-Astrophysics |