Add root config.json (Hub download-stats query file + framework pointer) 1b6adc0 verified mlboydaisuke commited on 15 days ago
Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_aotc_h18p e9fe1e8 verified mlboydaisuke commited on 15 days ago
Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym 032adf2 verified mlboydaisuke commited on 15 days ago
Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_tbl 3e4aa57 verified mlboydaisuke commited on 15 days ago
E2B QAT gather tables (checkpoint-derived - pair with the QAT bundles) 1ff697a verified mlboydaisuke commited on 16 days ago
E2B official-QAT int4lin tbl bundle (Mac 78.9, iPhone 30.7 tok/s) 01a42dd verified mlboydaisuke commited on 16 days ago
card: iPhone-ready AOT h18p tbl bundle + CoreAIChat Gemma ⚡ mode (chat-surface 32.7/44.2) dc78015 verified mlboydaisuke commited on 17 days ago
gemma4 e2b int4lin tbl AOT h18p bundle (CoreAIChat Gemma ⚡ download target) 431f19b verified mlboydaisuke commited on 17 days ago
card: GPU-pipelined fast path section (tbl bundle, run contract, AOT + buffer rules) 0a5f979 verified mlboydaisuke commited on 17 days ago
gpu-pipelined: gemma4 int4lin tbl bundle (PLE table as static graph input; M4 Max 77.0, iPhone 30.3 via AOT) 19a329e verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) 9d9d358 verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) fbc613f verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) c5799e1 verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) 2151408 verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) 2464c01 verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) 17d640a verified mlboydaisuke commited on 17 days ago
ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile) e84203d verified mlboydaisuke commited on 17 days ago
gemma4 iPhone re-measure in the Release chat app: GPU 22, ANE 6 tok/s c3a102a verified mlboydaisuke commited on 17 days ago
Run it: device push-verification gotcha + AOT h18p architecture note ac86e87 verified mlboydaisuke commited on 17 days ago
macOS best: int8 fused-kernel core (56.6-59 tok/s) 84433ff verified mlboydaisuke commited on 17 days ago
iOS ANE: chunk plan (data-driven engine config) b1ae5f6 verified mlboydaisuke commited on 17 days ago
mmap gather front-end tables (shared: iOS GPU + iOS ANE) 3a30dba verified mlboydaisuke commited on 17 days ago
iOS GPU best: int4-kmeans fused-kernel core (17.7 tok/s) 400a691 verified mlboydaisuke commited on 17 days ago
Card: category layout (best verified config per platform x compute-unit) f54373e verified mlboydaisuke commited on 17 days ago