Commit History

Add root config.json (Hub download-stats query file + framework pointer)
1b6adc0
verified

mlboydaisuke commited on

Gemma4-VL: vision bundles + card section
629b28a
verified

mlboydaisuke commited on

Gemma4-VL: add gemma4_e2b_qat_vl_vision
3de1416
verified

mlboydaisuke commited on

Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_aotc_h18p
e9fe1e8
verified

mlboydaisuke commited on

Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym
032adf2
verified

mlboydaisuke commited on

Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_tbl
3e4aa57
verified

mlboydaisuke commited on

card: official-QAT int4 row + section
805955b
verified

mlboydaisuke commited on

E2B QAT gather tables (checkpoint-derived - pair with the QAT bundles)
1ff697a
verified

mlboydaisuke commited on

E2B QAT tbl bundle, precompiled h18p .aimodelc
b50efac
verified

mlboydaisuke commited on

E2B official-QAT int4lin tbl bundle (Mac 78.9, iPhone 30.7 tok/s)
01a42dd
verified

mlboydaisuke commited on

card: iPhone-ready AOT h18p tbl bundle + CoreAIChat Gemma ⚡ mode (chat-surface 32.7/44.2)
dc78015
verified

mlboydaisuke commited on

gemma4 e2b int4lin tbl AOT h18p bundle (CoreAIChat Gemma ⚡ download target)
431f19b
verified

mlboydaisuke commited on

card: GPU-pipelined fast path section (tbl bundle, run contract, AOT + buffer rules)
0a5f979
verified

mlboydaisuke commited on

gpu-pipelined: gemma4 int4lin tbl bundle (PLE table as static graph input; M4 Max 77.0, iPhone 30.3 via AOT)
19a329e
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
9d9d358
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
fbc613f
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
c5799e1
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
2151408
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
2464c01
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
17d640a
verified

mlboydaisuke commited on

ios-ane: revert to the device-verified bucket-64 chunk set (the bucket-512 set jetsams the on-device ANE first compile)
e84203d
verified

mlboydaisuke commited on

gemma4 iPhone re-measure in the Release chat app: GPU 22, ANE 6 tok/s
c3a102a
verified

mlboydaisuke commited on

Run it: device push-verification gotcha + AOT h18p architecture note
ac86e87
verified

mlboydaisuke commited on

macOS best: head+argmax kernel (int8)
4ba4544
verified

mlboydaisuke commited on

macOS best: int8 fused-kernel core (56.6-59 tok/s)
84433ff
verified

mlboydaisuke commited on

macOS best: gather front-end (.aimodel, int8)
517f747
verified

mlboydaisuke commited on

iOS ANE: chunk plan (data-driven engine config)
b1ae5f6
verified

mlboydaisuke commited on

iOS ANE best: head with in-graph argmax (int8)
8707ac4
verified

mlboydaisuke commited on

iOS ANE best: chunk 6/6 (int8, fp16-hardened)
829374e
verified

mlboydaisuke commited on

iOS ANE best: chunk 5/6 (int8, fp16-hardened)
12921ba
verified

mlboydaisuke commited on

iOS ANE best: chunk 4/6 (int8, fp16-hardened)
19ece50
verified

mlboydaisuke commited on

iOS ANE best: chunk 3/6 (int8, fp16-hardened)
ca49168
verified

mlboydaisuke commited on

iOS ANE best: chunk 2/6 (int8, fp16-hardened)
dca92cd
verified

mlboydaisuke commited on

iOS ANE best: chunk 1/6 (int8, fp16-hardened)
30f283c
verified

mlboydaisuke commited on

mmap gather front-end tables (shared: iOS GPU + iOS ANE)
3a30dba
verified

mlboydaisuke commited on

iOS GPU best: int4-kmeans head+argmax kernel
ca8c3c5
verified

mlboydaisuke commited on

iOS GPU best: int4-kmeans fused-kernel core (17.7 tok/s)
400a691
verified

mlboydaisuke commited on

Card: category layout (best verified config per platform x compute-unit)
f54373e
verified

mlboydaisuke commited on

Model card
8df53eb
verified

mlboydaisuke commited on

initial commit
513c05f
verified

mlboydaisuke commited on