FreedomIntelligence/medical-o1-reasoning-SFT
Viewer • Updated • 90.1k • 5.92k • 1.1k
DeepSeek-R1-Medical-COT is a 4-bit fine-tuned language model optimized for medical reasoning and clinical scenario interpretation.
It is based on unsloth/DeepSeek-R1-Distill-Llama-8B and fine-tuned on the FreedomIntelligence/medical-o1-reasoning-SFT dataset to provide structured, step-by-step clinical reasoning and evidence-based conclusions.
from unsloth import FastLanguageModel
from transformers import AutoTokenizer
model_name = "DeepSeek-R1-Medical-COT"
# Load model
model, tokenizer = FastLanguageModel.from_pretrained(model_name, load_in_4bit=True)
# Example inference
prompt = """
### Clinical Scenario:
A 54-year-old man complains of frequent urinary urgency, nocturia, and a weak urinary stream. His prostate is moderately enlarged. Predict likely cystometric findings.
"""
inputs = tokenizer([prompt], return_tensors="pt").to("cuda")
outputs = model.generate(
input_ids=inputs.input_ids,
attention_mask=inputs.attention_mask,
max_new_tokens=500
)
print(tokenizer.decode(outputs[0]))
<think>...</think>) for step-by-step reasoning Recommendation: Always review model outputs with a qualified healthcare professional.
Mohamed Adel (2026). DeepSeek-R1-Medical-COT. Retrieved from https://huggingface.co/DeepSeek-R1-Medical-COT
Base model
deepseek-ai/DeepSeek-R1-Distill-Llama-8B