Model card: multimodal embeddings now run on GPU (LAST pooling end-state); re-ingest note fb3493a verified SEBK4C commited on 22 days ago
GPU multimodal embeddings: Gemma 4 12B native end-state path (LAST pooling + patched llamafile). image/audio embed on GPU; --search-file works on GPU. Verified on 2x RTX 4090. b246116 verified SEBK4C commited on 22 days ago
Model card: correct multimodal-embedding framing (Gemma 4 12B native interleave; dedicated graph is the stopgap) 2bf823b verified SEBK4C commited on 22 days ago
Model card: clarify GPU multimodal behavior (media indexes via text bridge; raw vectors auto-skipped on GPU) ef80674 verified SEBK4C commited on 22 days ago
Fix GPU multimodal ingest: orchestrator skips raw-media-embedding on GPU (was 0 chunks for any media corpus); media now indexes via text bridge. Verified on 2x RTX 4090. 04d4490 verified SEBK4C commited on 22 days ago
Model card: GPU usage section + hardware-verified note (2x RTX 4090, CUDA 12.8) 9918fd6 verified SEBK4C commited on 22 days ago
GPU-capable build: orchestrator honors --gpu (-ngl offload), verified end-to-end on 2x RTX 4090 (CUDA 12.8); includes macOS embedded-payload extraction fix 6aae262 verified SEBK4C commited on 22 days ago