Fornit
/

DeepSeek-R1-Medical-COT-LORA

deepseek-r1-distill-llama-8b

Model card Files Files and versions

DeepSeek-R1-Medical-COT-LORA / README.md

Fornit's picture

Update README.md

2b1379f verified about 1 year ago

|

history blame contribute delete

552 Bytes

	---
	license: apache-2.0
	tags:
	- unsloth
	- trl
	- sft
	- deepseek-r1-distill-llama-8b
	datasets:
	- FreedomIntelligence/medical-o1-reasoning-SFT
	base_model:
	- unsloth/DeepSeek-R1-Distill-Llama-8B
	---

	Model was trained on the first 500 rows of the dataset with RunPod Pytorch 2.4.0, GPU A40 (48 GB VRAM, 50GB RAM 9vCPU).
	Duration: 11m 38s

	From W&B
	OS Linux-6.8.0-49-generic-x86_64-with-glibc2.35
	Python version CPython 3.11.10

	System Hardware
	CPU count 48
	Logical CPU count 96
	GPU count 1
	GPU type NVIDIA A40