Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
NYCU-RL-Bandits-Lab
rl-bandits-lab
Follow
rl-bandits-lab
AI & ML interests
Reinforcement learning
Recent Activity
updated
a model
26 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
published
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
updated
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
View all activity
Organizations
None yet
rl-bandits-lab
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
26 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
Updated
26 days ago
published
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
Updated
26 days ago
updated
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
Text Generation
•
Updated
27 days ago
•
27
published
a model
27 days ago
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
Text Generation
•
Updated
27 days ago
•
27
updated
2 datasets
6 months ago
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
167
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
911
published
2 datasets
6 months ago
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
911
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
167
updated
a model
10 months ago
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
•
1
published
a model
10 months ago
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
•
1
updated
a model
11 months ago
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
4
published
a model
11 months ago
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
4
updated
a model
12 months ago
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
2
published
a model
12 months ago
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
2
updated
a model
12 months ago
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
1
published
a model
12 months ago
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
1