Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Sergey Pankevich
spankevich
Follow
PankevichSergey
AI & ML interests
None yet
Organizations
None yet
spankevich
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
collection
11 months ago
llm-hw-2
Collection
collection of ppo, dpo and reward model
•
3 items
•
Updated
Mar 9, 2025
•
1