A set of models from my experiments with Reinforcement Learning from Human Feedback
Samir R.
sr5434
AI & ML interests
NLP
Recent Activity
updated
a collection
3 days ago
RLHF Models updated
a collection
3 days ago
RLHF Models updated
a collection
3 days ago
RLHF Models Organizations
None yet