llm-course-hw3 01eg0/llm-course-hw3-lora Text Generation • 0.3B • Updated Nov 23, 2025 01eg0/llm-course-hw3-tinyllama-qlora Updated Nov 23, 2025 01eg0/llm-course-hw3-dora Text Generation • 0.3B • Updated Nov 23, 2025 01eg0/llm-course-hw3-tinyllamma-qlora Updated Nov 23, 2025
llm-course-hw2 01eg0/llm-course-hw2-ppo Text Generation • 0.1B • Updated Nov 9, 2025 01eg0/llm-course-hw2-dpo Text Generation • 0.1B • Updated Nov 9, 2025 01eg0/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Nov 9, 2025 • 2
llm-course-hw3 01eg0/llm-course-hw3-lora Text Generation • 0.3B • Updated Nov 23, 2025 01eg0/llm-course-hw3-tinyllama-qlora Updated Nov 23, 2025 01eg0/llm-course-hw3-dora Text Generation • 0.3B • Updated Nov 23, 2025 01eg0/llm-course-hw3-tinyllamma-qlora Updated Nov 23, 2025
llm-course-hw2 01eg0/llm-course-hw2-ppo Text Generation • 0.1B • Updated Nov 9, 2025 01eg0/llm-course-hw2-dpo Text Generation • 0.1B • Updated Nov 9, 2025 01eg0/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Nov 9, 2025 • 2