Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Vibudhbh
/
gpt2-rlhf-implementation
like
0
Text Generation
Transformers
Safetensors
Anthropic/hh-rlhf
gpt2
rlhf
reinforcement-learning-from-human-feedback
anthropic-hh-rlhf
chatgpt-style-training
ppo
supervised-fine-tuning
human-preferences
ai-alignment
text-generation-inference
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
gpt2-rlhf-implementation
Commit History
Add training metadata
d6754ac
verified
Vibudhbh
commited on
Oct 2
Add comprehensive model card
baca339
verified
Vibudhbh
commited on
Oct 2
Upload RLHF-trained GPT-2 model
7d66f27
verified
Vibudhbh
commited on
Oct 2
initial commit
b14994e
verified
Vibudhbh
commited on
Oct 2