Omdano
/

INT8-H16P

Model card Files Files and versions

Commit History

Re-quantize with float16 for T4 GPU compatibility

c68d969
verified

Omdano commited on Oct 5, 2025

Remove model.safetensors - use pytorch_model.bin with INT8 weights

29acac0
verified

Omdano commited on Oct 5, 2025

Add back TorchAO INT8 quantization_config for proper loading

1209faf
verified

Omdano commited on Oct 5, 2025

Remove quantization_config to avoid BitsAndBytes imports

c7f451a
verified

Omdano commited on Oct 5, 2025

Upload folder using huggingface_hub

e020402
verified

Omdano commited on Oct 5, 2025

Upload README.md with huggingface_hub

8eada2c
verified

Omdano commited on Oct 5, 2025

Upload folder using huggingface_hub

8bf2e5c
verified

Omdano commited on Oct 5, 2025

initial commit

4f8176a
verified

Omdano commited on Oct 5, 2025