Update README.md

2b1379f verified about 1 year ago

552 Bytes

license: apache-2.0
tags:
  - unsloth
  - trl
  - sft
  - deepseek-r1-distill-llama-8b
datasets:
  - FreedomIntelligence/medical-o1-reasoning-SFT
base_model:
  - unsloth/DeepSeek-R1-Distill-Llama-8B

Model was trained on the first 500 rows of the dataset with RunPod Pytorch 2.4.0, GPU A40 (48 GB VRAM, 50GB RAM 9vCPU). Duration: 11m 38s

From W&B OS Linux-6.8.0-49-generic-x86_64-with-glibc2.35 Python version CPython 3.11.10

System Hardware CPU count 48 Logical CPU count 96 GPU count 1 GPU type NVIDIA A40