Fix hidden_size: 4096 -> 3584 to match Qwen2.5-Coder-7B-Instruct 691fc84 Faaz commited on 29 days ago
Add WebSight vision data pipeline: download script, image-aware data loader, phase data routing 672896a Faaz commited on 30 days ago
Add GPU diagnostic script, fix architecture loading with low_cpu_mem_usage and sync 5fb9ec3 Faaz commited on 30 days ago
Day 2 COMPLETE: 1.48M examples processed, 6GB dataset, WebSight done 59c6c97 Faaz commited on about 1 month ago
Day 1 Complete: Tokenizer setup — Qwen2.5-Coder-7B base + 22 MINDI special tokens (vocab 151,685), wrapper class, full format test 11e0d89 Faaz commited on about 1 month ago