Commit History

replace fp32 model with one with attention layers
c916fce

Liman commited on

onnx model with attention
998e725

Liman commited on

rename q8 model
a5f4313

Liman commited on

add quantized models
18a990c

Liman commited on

onnx model
8d104f2

Liman commited on

initial commit
b4850a4
verified

limanup commited on