llama-3.1-8b-clinical-thinking-grpo / model.safetensors.index.json

Commit History

GRPO-trained model from checkpoint-2450
75928d5
verified

CodCodingCode commited on