Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pbedrin
's Collections
llm-course-hw3
llm-course-hw2
llm-course-hw2
updated
Mar 30
VK LLM Course. Задание #2. Дообучение LLM методами DPO и PPO
Upvote
-
pbedrin/llm-course-hw2-reward-model
Text Classification
•
0.1B
•
Updated
Mar 30
•
9
pbedrin/llm-course-hw2-dpo
Text Generation
•
0.1B
•
Updated
Mar 30
•
8
pbedrin/llm-course-hw2-ppo
Text Generation
•
0.1B
•
Updated
Mar 30
•
7
Upvote
-
Share collection
View history
Collection guide
Browse collections