PEFT xinyuema/llm-course-hw3-dora Text Generation • 0.3B • Updated Apr 12, 2025 • 3 xinyuema/llm-course-hw3-lora Text Generation • 0.3B • Updated Apr 12, 2025 • 1 xinyuema/llm-course-hw3-tinyllamma-qlora-tokenizer Updated Apr 7, 2025 xinyuema/llm-course-hw3-tinyllama-qlora-model Updated Apr 7, 2025
collactions_of_dpo_and_ppo collactions_of_dpo_and_ppo xinyuema/llm-course-hw2-reward-model-module Text Classification • 0.1B • Updated Mar 30, 2025 xinyuema/llm-course-hw2-dpo 0.1B • Updated Mar 28, 2025 xinyuema/trainer_output Text Classification • 0.1B • Updated Mar 30, 2025 xinyuema/llm-course-hw2-ppo Text Generation • 0.1B • Updated Mar 28, 2025 • 3
PEFT xinyuema/llm-course-hw3-dora Text Generation • 0.3B • Updated Apr 12, 2025 • 3 xinyuema/llm-course-hw3-lora Text Generation • 0.3B • Updated Apr 12, 2025 • 1 xinyuema/llm-course-hw3-tinyllamma-qlora-tokenizer Updated Apr 7, 2025 xinyuema/llm-course-hw3-tinyllama-qlora-model Updated Apr 7, 2025
collactions_of_dpo_and_ppo collactions_of_dpo_and_ppo xinyuema/llm-course-hw2-reward-model-module Text Classification • 0.1B • Updated Mar 30, 2025 xinyuema/llm-course-hw2-dpo 0.1B • Updated Mar 28, 2025 xinyuema/trainer_output Text Classification • 0.1B • Updated Mar 30, 2025 xinyuema/llm-course-hw2-ppo Text Generation • 0.1B • Updated Mar 28, 2025 • 3