A collection of models that were finetuned in hw3 of LLM course in HSE.
Iurii Pustovalov
xiryss
·
AI & ML interests
None yet
Organizations
None yet
models 9
xiryss/llm-course-hw3-dora
Text Generation • 0.3B • Updated • 1
xiryss/llm-course-hw3-lora
Text Generation • 0.3B • Updated
xiryss/llm-course-hw3-tinyllama-qlora
Updated
xiryss/llm-course-hw3-tinyllamma-qlora
Updated
xiryss/llm-course-hw2-ppo
Text Generation • 0.1B • Updated • 1
xiryss/llm-course-hw2-dpo
Text Generation • 0.1B • Updated • 6
xiryss/llm-course-hw2-reward-model
Text Classification • 0.1B • Updated
xiryss/reward_model_output
Text Classification • 0.1B • Updated
xiryss/llm-course-hw1
Text Generation • Updated • 1
datasets 0
None public yet