--- {} --- ## Finetune of DeepSeek-R1 Llama-3-8B distil - Dataset: alignment_1 - Dataset size: 200 samples - Epochs: 10