Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
NYCU-RL-Bandits-Lab
rl-bandits-lab
Follow
rl-bandits-lab
AI & ML interests
Reinforcement learning
Recent Activity
updated
a model
26 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
published
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
updated
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
View all activity
Organizations
None yet
models
6
Sort:Â Recently updated
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
Updated
26 days ago
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
Text Generation
•
Updated
27 days ago
•
27
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
•
1
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
4
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
2
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
1
datasets
2
Sort:Â Recently updated
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
167
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
911