Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Reinforced Token Optimization
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Team members
1
RTO-RL
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
zkshan2002
published
2 models
12 months ago
RTO-RL/Llama3-8B-RTO_RPP
8B
•
Updated
Apr 10, 2025
•
3
•
1
RTO-RL/Llama3-8B-RPP
8B
•
Updated
Apr 10, 2025
•
3
•
1
zkshan2002
published
a model
about 1 year ago
RTO-RL/Llama3-8B-TDPO
8B
•
Updated
Feb 11, 2025
•
3
•
1
zkshan2002
updated
a model
about 1 year ago
RTO-RL/Llama3-8B-TDPO
8B
•
Updated
Feb 11, 2025
•
3
•
1
zkshan2002
published
a model
about 1 year ago
RTO-RL/Llama3-8B-SimPO
8B
•
Updated
Feb 11, 2025
•
17
zkshan2002
updated
a model
about 1 year ago
RTO-RL/Llama3-8B-SimPO
8B
•
Updated
Feb 11, 2025
•
17
zkshan2002
published
a model
about 1 year ago
RTO-RL/Llama3-8B-RDPO
8B
•
Updated
Feb 11, 2025
•
2
•
1
zkshan2002
updated
a model
about 1 year ago
RTO-RL/Llama3-8B-RDPO
8B
•
Updated
Feb 11, 2025
•
2
•
1
zkshan2002
published
a model
about 1 year ago
RTO-RL/Llama3-8B-PPO
8B
•
Updated
Feb 11, 2025
•
14
•
1
zkshan2002
updated
5 models
about 1 year ago
RTO-RL/Llama3-8B-PPO
8B
•
Updated
Feb 11, 2025
•
14
•
1
RTO-RL/Llama3-8B-RTO
8B
•
Updated
Feb 11, 2025
•
17
•
1
RTO-RL/Llama3.2-1B-RewardModel
1B
•
Updated
Feb 11, 2025
•
6
RTO-RL/Llama3-8B-RewardModel
8B
•
Updated
Feb 11, 2025
RTO-RL/Llama3-8B-DPO
8B
•
Updated
Feb 11, 2025
•
15
zkshan2002
published
a model
about 1 year ago
RTO-RL/Llama3-8B-RTO
8B
•
Updated
Feb 11, 2025
•
17
•
1