Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
zheminh
/
DeepSeek-R1-Distill-Qwen-7B-RFT-R1-Distill-ckpt400
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
DeepSeek-R1-Distill-Qwen-7B-RFT-R1-Distill-ckpt400
/
images
358 kB
1 contributor
History:
1 commit
zheminh
Add files using upload-large-folder tool
1ae62b4
verified
10 months ago
eval_loss.png
19.5 kB
Add files using upload-large-folder tool
10 months ago
eval_runtime.png
18.7 kB
Add files using upload-large-folder tool
10 months ago
eval_samples_per_second.png
23.2 kB
Add files using upload-large-folder tool
10 months ago
eval_steps_per_second.png
20 kB
Add files using upload-large-folder tool
10 months ago
eval_token_acc.png
19.4 kB
Add files using upload-large-folder tool
10 months ago
train_epoch.png
16.8 kB
Add files using upload-large-folder tool
10 months ago
train_grad_norm.png
22.9 kB
Add files using upload-large-folder tool
10 months ago
train_learning_rate.png
21.8 kB
Add files using upload-large-folder tool
10 months ago
train_loss.png
36.5 kB
Add files using upload-large-folder tool
10 months ago
train_memory(GiB).png
15.9 kB
Add files using upload-large-folder tool
10 months ago
train_token_acc.png
42.8 kB
Add files using upload-large-folder tool
10 months ago
train_total_flos.png
14.3 kB
Add files using upload-large-folder tool
10 months ago
train_train_loss.png
15.8 kB
Add files using upload-large-folder tool
10 months ago
train_train_runtime.png
15.1 kB
Add files using upload-large-folder tool
10 months ago
train_train_samples_per_second.png
14.3 kB
Add files using upload-large-folder tool
10 months ago
train_train_speed(iter_s).png
23.6 kB
Add files using upload-large-folder tool
10 months ago
train_train_steps_per_second.png
17.4 kB
Add files using upload-large-folder tool
10 months ago