pradeepannepu/slm02 (Quantized)

Description

This model is a quantized version of the original model pradeepannepu/slm02.

Quantization Details

  • Quantization Type: int4
  • bnb_4bit_quant_type: nf4
  • bnb_4bit_use_double_quant: True
  • bnb_4bit_compute_dtype: bfloat16
  • bnb_4bit_quant_storage: uint8
Downloads last month
-
Safetensors
Model size
18.8M params
Tensor type
F32
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pradeepannepu/slm02-bnb-4bit

Quantized
(3)
this model