File size: 552 Bytes
2b1379f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
license: apache-2.0
tags:
- unsloth
- trl
- sft
- deepseek-r1-distill-llama-8b
datasets:
- FreedomIntelligence/medical-o1-reasoning-SFT
base_model:
- unsloth/DeepSeek-R1-Distill-Llama-8B
---

Model was trained on the first 500 rows of the dataset with RunPod Pytorch 2.4.0, GPU A40 (48 GB VRAM, 50GB RAM 9vCPU). 
Duration: 11m 38s

From W&B
OS                 Linux-6.8.0-49-generic-x86_64-with-glibc2.35
Python version     CPython 3.11.10

System Hardware
CPU count	        48
Logical CPU count	96
GPU count	        1
GPU type	        NVIDIA A40