Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
NYCU-RL-Bandits-Lab
rl-bandits-lab
Follow
rl-bandits-lab
AI & ML interests
Reinforcement learning
Organizations
None yet
models
4
Sort:Â Recently updated
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
1
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
5
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
9
datasets
2
Sort:Â Recently updated
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
9
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
13