Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ibisbill
/
4b_SFT
like
0
Text Generation
Transformers
Safetensors
qwen3
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
License:
other
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
4b_SFT
/
train_results.json
Ibisbill
Initial upload
cecb3a3
verified
7 months ago
raw
Copy download link
history
blame
contribute
delete
206 Bytes
{
"epoch"
:
2.0
,
"total_flos"
:
1486449496031232.0
,
"train_loss"
:
0.24971539962006917
,
"train_runtime"
:
42813.5195
,
"train_samples_per_second"
:
7.007
,
"train_steps_per_second"
:
0.014
}