llm-course-hw3-PEFT Дообучение LLM с помощью методов PEFT MurDanya/llm-course-hw3-lora 0.3B • Updated Apr 12, 2025 • 2 MurDanya/llm-course-hw3-dora 0.3B • Updated Apr 12, 2025 MurDanya/llm-course-hw3-tinyllama-qlora Updated Apr 12, 2025
llm-course-hw2-alignment Дообучение моделей с помощью DPO и PPO MurDanya/llm-course-hw2-dpo 0.1B • Updated Mar 30, 2025 • 2 MurDanya/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Mar 30, 2025 • 1 MurDanya/llm-course-hw2-ppo 0.1B • Updated Mar 30, 2025 • 1
llm-course-hw3-PEFT Дообучение LLM с помощью методов PEFT MurDanya/llm-course-hw3-lora 0.3B • Updated Apr 12, 2025 • 2 MurDanya/llm-course-hw3-dora 0.3B • Updated Apr 12, 2025 MurDanya/llm-course-hw3-tinyllama-qlora Updated Apr 12, 2025
llm-course-hw2-alignment Дообучение моделей с помощью DPO и PPO MurDanya/llm-course-hw2-dpo 0.1B • Updated Mar 30, 2025 • 2 MurDanya/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Mar 30, 2025 • 1 MurDanya/llm-course-hw2-ppo 0.1B • Updated Mar 30, 2025 • 1