TobDeBer
/

SmartQuant

Model card Files Files and versions

30.1 GB

Ctrl+K

Ctrl+K

3 contributors

History: 16 commits

TobDeBer's picture

Delete llama-server-6343-cuda

aad2b42 verified 7 months ago

.gitattributes

2.2 kB
Rename llama-quantize to llama-quantize-sq 7 months ago
README.md

405 Bytes
Update README.md about 1 year ago
SmartQuant-Falcon-H1-0.5B-Instruct.gguf

275 MB
xet

Upload SmartQuant-Falcon-H1-0.5B-Instruct.gguf with huggingface_hub 10 months ago
SmartQuant-Llama-3.3-70B-Instruct.gguf

21 GB
xet

Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf 12 months ago
SmartQuant-granite-3.3-8b-instruct.gguf

5.84 GB
xet

Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf 12 months ago
Tiny-Moe.Q6_K_T3.gguf

84.7 MB
xet

Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub 8 months ago
calibration_datav3.txt

280 kB
add quantization tool 12 months ago
granite-4.0-tiny-preview-iq4_xs_T3UD.gguf

2.9 GB
xet

Upload granite-4.0-tiny-preview-iq4_xs_T3UD.gguf with huggingface_hub 7 months ago
llama-quantize-sq

2.78 MB
xet

Rename llama-quantize to llama-quantize-sq 7 months ago