Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
diffusion-reasoning
/
wll_kodcode_lr3e-6_iter5000
like
0
Follow
Diffusion LLMs Reasoning
3
Transformers
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
wll_kodcode_lr3e-6_iter5000
/
generation_config.json
Sangwoong
Upload wd1 checkpoint-5000 from local training
834957f
verified
about 1 month ago
raw
Copy download link
history
blame
contribute
delete
Safe
121 Bytes
{
"_from_model_config"
:
true
,
"bos_token_id"
:
126080
,
"eos_token_id"
:
126081
,
"transformers_version"
:
"4.49.0"
}