Fix hidden_size: 4096 -> 3584 to match Qwen2.5-Coder-7B-Instruct 691fc84 Faaz commited on 30 days ago
Fix: register LLM as nn.Module submodule so optimizer finds LoRA params cdc806e Faaz commited on about 1 month ago