Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
difanjiao
/
ThinkTwice-Olmo3-7B-Instruct
like
0
Safetensors
English
olmo3
reasoning
math
rlvr
self-refinement
grpo
arxiv:
2604.01591
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
ThinkTwice-Olmo3-7B-Instruct
/
tokenizer.json
Commit History
Upload folder using huggingface_hub
6708be4
verified
difanjiao
commited on
6 days ago