Delete _work/fp16/tokenizer.json with huggingface_hub 0e4cecc verified zeriworkspace commited on Mar 16
Delete _work/fp16/special_tokens_map.json with huggingface_hub 86a445b verified zeriworkspace commited on Mar 16
Delete _work/fp16/generation_config.json with huggingface_hub cb49c49 verified zeriworkspace commited on Mar 16
Delete _work/fp16/chat_template.jinja with huggingface_hub c6ad7f1 verified zeriworkspace commited on Mar 16
Upload onnx/model_quantized.onnx with huggingface_hub a396cb3 verified zeriworkspace commited on Mar 16
Delete _work/q8/model_quantized.onnx with huggingface_hub 6266400 verified zeriworkspace commited on Mar 16
Delete _work/fp32/tokenizer_config.json with huggingface_hub 9c6a0e6 verified zeriworkspace commited on Mar 16
Delete _work/fp32/tokenizer.json with huggingface_hub 160822f verified zeriworkspace commited on Mar 16
Delete _work/fp32/special_tokens_map.json with huggingface_hub 618cc7d verified zeriworkspace commited on Mar 16
Delete _work/fp32/generation_config.json with huggingface_hub 758c858 verified zeriworkspace commited on Mar 16
Delete _work/fp32/chat_template.jinja with huggingface_hub 889d945 verified zeriworkspace commited on Mar 16
Delete _work/fp16/tokenizer_config.json with huggingface_hub 0532f5c verified zeriworkspace commited on Mar 16
Delete _work/fp16/tokenizer.json with huggingface_hub 36e6f10 verified zeriworkspace commited on Mar 16
Delete _work/fp16/special_tokens_map.json with huggingface_hub 4cecdd9 verified zeriworkspace commited on Mar 16
Delete _work/fp16/generation_config.json with huggingface_hub 1c1de5c verified zeriworkspace commited on Mar 16
Delete _work/fp16/chat_template.jinja with huggingface_hub 9dcca04 verified zeriworkspace commited on Mar 16
fix: re-export INT8 ONNX with KV cache (text-generation-with-past) 615f662 verified zeriworkspace commited on Mar 16
chore: remove root model.onnx (moved to onnx/model_int8.onnx) 7623082 verified zeriworkspace commited on Mar 16
feat: add INT8 quantized ONNX model at onnx/model_int8.onnx c947cc3 verified zeriworkspace commited on Mar 16
feat: v2 retrained ONNX INT8 model (native FunctionGemma format) b98762c verified zeriworkspace commited on Mar 16
Delete onnx/model_quantized.onnx with huggingface_hub b5269ce verified zeriworkspace commited on Mar 15
Upload onnx/model_quantized.onnx with huggingface_hub ade38c9 verified zeriworkspace commited on Mar 15
Delete onnx/model_quantized.onnx with huggingface_hub 9fc13b1 verified zeriworkspace commited on Mar 15
Upload onnx/model_quantized.onnx with huggingface_hub 7e008d7 verified zeriworkspace commited on Mar 15