Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lewtun
/
does-deepspeed-still-work-sft
like
0
Text Generation
Transformers
Safetensors
trl-lib/Capybara
qwen2
Generated from Trainer
open-r1
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
does-deepspeed-still-work-sft
/
eval_results.json
lewtun
HF Staff
End of training
381693b
verified
about 1 year ago
raw
Copy download link
history
blame
164 Bytes
{
"eval_loss"
:
1.116114616394043
,
"eval_runtime"
:
0.664
,
"eval_samples"
:
200
,
"eval_samples_per_second"
:
301.2
,
"eval_steps_per_second"
:
6.024
}