card: model.onnx + INT8 ONNX are bundled (fix not-bundled note) d0a4180 verified faxenoff commited on 3 days ago
Add ONNX: FP32 model.onnx (+data) + INT8 model_int8qdt.onnx (engine-build source) 4db7729 verified faxenoff commited on 3 days ago
Add compiled engines: TRT win sm_120 + OV (cpu/igpu/npu) + TVM win vulkan + tokenizer 0807aba verified faxenoff commited on 3 days ago