Fix Phase 2: fusion layer processes text-only via learnable residual gate for gradient flow 4e9835e Faaz commited on Apr 15
Fix: register LLM as nn.Module submodule so optimizer finds LoRA params cdc806e Faaz commited on Apr 15
Add GPU diagnostic script, fix architecture loading with low_cpu_mem_usage and sync 5fb9ec3 Faaz commited on Apr 15