llama-3.2-3b-bitsandbytes-4bit-nf4

This repository contains a quantized model artifact produced in the graduation project.

Model Details

Quantized folder: Advanced-Techniques/MixedPrecision/quantized/4bit-nf4
Benchmark JSON: Advanced-Techniques/MixedPrecision/benchmark_results/bitsandbytes_benchmark.json

Use the model with the library and runtime that match the quantization technique in this repo.

This model card is auto-generated from project files.
You should validate quality, safety, and license compatibility before public release.

Downloads last month: -; Downloads are not tracked for this model. How to track

Base model

Finetuned

this model