Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RTO-RL
/
Llama3-8B-RTO
like
1
Follow
Reinforced Token Optimization
4
Safetensors
weqweasdas/ultra_train
llama
Model card
Files
Files and versions
xet
Community
main
Llama3-8B-RTO
Commit History
Update README.md
7d45fa0
verified
zkshan2002
commited on
Feb 11, 2025
Update README.md
76e1665
verified
zkshan2002
commited on
Feb 11, 2025
Create README.md
71c49be
verified
zkshan2002
commited on
Dec 29, 2024
initial commit
2e81574
verified
zkshan2002
commited on
Dec 29, 2024
initial commit
fba0fe9
verified
zkshan2002
commited on
Dec 29, 2024