47bf85e 6dd9e69
1
2
3
4
5
6
7
8
9
10
--- {} --- ## Finetune of DeepSeek-R1 Llama-3-8B distil - Dataset: alignment_1 - Dataset size: 200 samples - Epochs: 10