Commit History

Q8 result: thrashes on 16 GB (CPU_REPACK doubles memory)
8270847
verified

waltgrace commited on

Add Gemma 4 benchmark results
ae909ec
verified

waltgrace commited on

Update MLX sniper link to 5.4 tok/s
8320889
verified

waltgrace commited on

Update MLX sniper link: 5.2 tok/s on 35B
6fa45be
verified

waltgrace commited on

Add full README with verified benchmarks and research findings
c94ca8d
verified

waltgrace commited on

Add common/arg.cpp
9f84be1
verified

waltgrace commited on

Add common/common.h
6ce9fdd
verified

waltgrace commited on

Add common/common.cpp
6123ae4
verified

waltgrace commited on

Add src/CMakeLists.txt
a8c1934
verified

waltgrace commited on

Add src/llama-expert-cache.h
c8ad244
verified

waltgrace commited on

Add src/llama-expert-cache.cpp
1fa479d
verified

waltgrace commited on

Add src/llama-expert-cache-ctx.h
715943d
verified

waltgrace commited on

Add src/llama-expert-cache-ctx.cpp
f3ad8b8
verified

waltgrace commited on

initial commit
43397e4
verified

waltgrace commited on