Commit History

Revert to F32 KV: int8 KV bridge produced DEQUANTIZE 1-dim + STABLEHLO_COMPOSITE missing data → Metal delegate rejects
224e164
verified

sarmientoF commited on

int8 KV cache export (cache_update_composite + int8_kv_bridge)
3971892
verified

sarmientoF commited on

GPU-compatible export: no cache_update_composite, cache_length=4096, lora_rank=8
83de808
verified

sarmientoF commited on

Re-export cache_length=4096 (was 32768) for 6GB devices
0b3ad5a
verified

sarmientoF commited on

Replace with Modal-exported 32k bundle (int8 KV, LoRA r8)
c0c3eda
verified

sarmientoF commited on

Replace weight_only with dynamic_wi4_afp32 (fixes gibberish)
9957c4e
verified

sarmientoF commited on

Replace weight_only with dynamic_wi4_afp32 (fixes gibberish)
38517fa
verified

sarmientoF commited on

Upload gemma4-android-2k.litertlm with huggingface_hub
f74a45b
verified

sarmientoF commited on

Upload gemma4-base-2k.litertlm with huggingface_hub
5f95fca
verified

sarmientoF commited on

Upload gemma4-base.litertlm with huggingface_hub
a486421
verified

sarmientoF commited on

Upload gemma4-lora-8k.litertlm with huggingface_hub
4d24cf8
verified

sarmientoF commited on

Upload gemma4-base.litertlm with huggingface_hub
249942f
verified

sarmientoF commited on

Upload gemma4-base.litertlm with huggingface_hub
71dc037
verified

sarmientoF commited on

Upload gemma4-base.litertlm with huggingface_hub
56afb0f
verified

sarmientoF commited on

Upload adapters/pirate-scale-4.5.tflite with huggingface_hub
26a77e8
verified

sarmientoF commited on

Upload adapters/alpaca-2ep.tflite with huggingface_hub
c72ec29
verified

sarmientoF commited on

Upload adapters/alpaca-3ep.tflite with huggingface_hub
7cb5925
verified

sarmientoF commited on

Upload adapters/alpaca-3ep-scale-0.625.tflite with huggingface_hub
58734a5
verified

sarmientoF commited on

Upload adapters/qat-test.tflite with huggingface_hub
b24c7f8
verified

sarmientoF commited on

Upload adapters/alpaca-1ep.tflite with huggingface_hub
6f6bf12
verified

sarmientoF commited on

Upload adapters/adapter.tflite with huggingface_hub
55a100d
verified

sarmientoF commited on

Upload adapters/alpaca-1k.tflite with huggingface_hub
0636a5b
verified

sarmientoF commited on

Upload adapters/pirate-v9-scale-0.25.tflite with huggingface_hub
85764d0
verified

sarmientoF commited on

Upload gemma4-base.litertlm with huggingface_hub
4e4d23c
verified

sarmientoF commited on

initial commit
a9eb7ec
verified

sarmientoF commited on