Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
difanjiao
/
ThinkTwice-Olmo3-7B-Instruct
like
0
Safetensors
English
olmo3
reasoning
math
rlvr
self-refinement
grpo
arxiv:
2604.01591
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
ThinkTwice-Olmo3-7B-Instruct
Commit History
Add model card with paper link to arXiv:2604.01591
d13fb94
verified
difanjiao
commited on
Apr 9
Upload folder using huggingface_hub
6708be4
verified
difanjiao
commited on
Apr 9
initial commit
213c829
verified
difanjiao
commited on
Apr 9