llm-hw-3 Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA. spankevich/llm-course-hw3-lora Text Generation • 0.3B • Updated Mar 29, 2025 • 1 spankevich/llm-course-hw3-dora Text Generation • 0.3B • Updated Mar 29, 2025 • 1 spankevich/llm-course-hw3-tinyllamma-qlora Updated Mar 29, 2025
llm-hw-2 collection of ppo, dpo and reward model spankevich/llm-hw-2-dpo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 spankevich/llm-hw-2-ppo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 spankevich/trainer_output Text Classification • 0.1B • Updated Mar 9, 2025 • 1
llm-hw-3 Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA. spankevich/llm-course-hw3-lora Text Generation • 0.3B • Updated Mar 29, 2025 • 1 spankevich/llm-course-hw3-dora Text Generation • 0.3B • Updated Mar 29, 2025 • 1 spankevich/llm-course-hw3-tinyllamma-qlora Updated Mar 29, 2025
llm-hw-2 collection of ppo, dpo and reward model spankevich/llm-hw-2-dpo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 spankevich/llm-hw-2-ppo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 spankevich/trainer_output Text Classification • 0.1B • Updated Mar 9, 2025 • 1