Commit History

Fix Phase 2: fusion layer processes text-only via learnable residual gate for gradient flow
4e9835e

Faaz commited on

Fix: register LLM as nn.Module submodule so optimizer finds LoRA params
cdc806e

Faaz commited on

Fix extra_special_tokens: list to dict for transformers 4.55
02eef51

Faaz commited on

Add requirements-training.txt for MI300X Docker
24b5fb1

Faaz commited on

Remove HSA_OVERRIDE_GFX_VERSION - ROCm 7.0 native MI300X support
4a33f96

Faaz commited on

Track large tokenizer files with Git LFS for HuggingFace
161c946

Faaz commited on

Add GPU diagnostic script, fix architecture loading with low_cpu_mem_usage and sync
5fb9ec3

Faaz commited on

Fix setup_mi300x.sh for Docker container on MI300X droplet
35fd5fc

Faaz commited on

Fix setup_mi300x.sh step 2 + add project context summary
f04f58b

Faaz commited on

Fix train.py: mem -> memory on line 225
1c36b28

Faaz commited on

Day 3 COMPLETE: Full model architecture
2ff5c54

Faaz commited on

Day 2 COMPLETE: 1.48M examples processed, 6GB dataset, WebSight done
59c6c97

Faaz commited on

Day 1 Complete: Tokenizer setup — Qwen2.5-Coder-7B base + 22 MINDI special tokens (vocab 151,685), wrapper class, full format test
11e0d89

Faaz commited on

feat: initial project scaffold for MINDI 1.5 Vision-Coder
553fbf7

Faaz commited on