Alignment - a georgebu Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

georgebu 's Collections

Alignment

updated May 16, 2025

DPO model, PPO model, reward model

georgebu/reward_model

Text Classification • 0.1B • Updated Mar 28, 2025 • 2
georgebu/dpo_model

Text Generation • 0.1B • Updated Mar 28, 2025 • 3
georgebu/ppo_model

Text Generation • 0.1B • Updated Mar 28, 2025 • 1

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs