Commit History

make it clear you need to make a chat template jinja file
dcb412d

ubergarm commited on

upload smol-IQ4_KSS and post all perplexity data
c1aefbd

ubergarm commited on

Upload folder using huggingface_hub
cfc7a7b
verified

ubergarm commited on

Add AesSedai IQ3_XXS which has good perplexity!
c670fd2

ubergarm commited on

add tensor parallel example for 2x48 GB GPUs
0d7d6c9

ubergarm commited on

add ggml-org Q4_K_M to perplexity graph
6fb2415

ubergarm commited on

add link to working chat template
411bd6f

ubergarm commited on

Upload folder using huggingface_hub
f74f7d1
verified

ubergarm commited on

uploading smol-IQ2_KS
87e3d45

ubergarm commited on

update perplexity graph with official version
b813226

ubergarm commited on

update perplexity graph with experimental data
c32699f

ubergarm commited on

add some perplexity data
084e588

ubergarm commited on

Upload folder using huggingface_hub
ba689ec
verified

ubergarm commited on

Upload folder using huggingface_hub
aee4e84
verified

ubergarm commited on

Upload imatrix-Step-3.5-Flash-BF16.dat with huggingface_hub
ab7f5ee
verified

ubergarm commited on

working now on ik_llama.cpp
74da017

ubergarm commited on

Upload folder using huggingface_hub
8e0aec3
verified

ubergarm commited on

add note and upload mainline compatible IQ4_XS
ccd73d9

ubergarm commited on

add IQ4_XS only for testing for now
9e842dd

ubergarm commited on

redoing convert
fa210bb

ubergarm commited on

initial commit
28a18a8

ubergarm commited on