| | --- |
| | license: apache-2.0 |
| | tags: |
| | - unsloth |
| | - trl |
| | - sft |
| | - deepseek-r1-distill-llama-8b |
| | datasets: |
| | - FreedomIntelligence/medical-o1-reasoning-SFT |
| | base_model: |
| | - unsloth/DeepSeek-R1-Distill-Llama-8B |
| | --- |
| | |
| | Model was trained on the first 500 rows of the dataset with RunPod Pytorch 2.4.0, GPU A40 (48 GB VRAM, 50GB RAM 9vCPU). |
| | Duration: 11m 38s |
| |
|
| | From W&B |
| | OS Linux-6.8.0-49-generic-x86_64-with-glibc2.35 |
| | Python version CPython 3.11.10 |
| | |
| | System Hardware |
| | CPU count 48 |
| | Logical CPU count 96 |
| | GPU count 1 |
| | GPU type NVIDIA A40 |
| | |
| | |