Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
QizhiPei
/
DiffGen-8B
like
1
Text Generation
Transformers
Safetensors
qwen3
llama-factory
full
Generated from Trainer
math-reasoning
conversational
text-generation-inference
arxiv:
2509.21070
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
DiffGen-8B
/
train_results.json
QizhiPei
Upload initial question generator
1ab8ba9
verified
8 months ago
raw
Copy download link
history
blame
Safe
201 Bytes
{
"epoch"
:
1.0
,
"total_flos"
:
92324696948736.0
,
"train_loss"
:
0.788116905017593
,
"train_runtime"
:
3845.5406
,
"train_samples_per_second"
:
49.839
,
"train_steps_per_second"
:
0.39
}