Upload quantized model and evaluation results

Files changed (7) hide show

README.md ADDED Viewed

+# 🧠 Model: Alexnet
+This is a quantized version of `Alexnet` using `W8A8 static` quantization.
+## 🧪 Evaluation Summary
+| Metric         | FP32             | Quantized  |
+|----------------|------------------|------------|
+| Top-1 Accuracy | 0.555 | 0.5357142686843872 |
+| Top-5 Accuracy | 0.777 | 0.788690447807312 |
+- Dataset: `ImageNet`
+- Evaluation Date: `2025-07-18`
+## 🔍 Notes
+Quantized W8A8 Resnet50 model from torchvision

__pycache__/model.cpython-310.pyc ADDED Viewed

Binary file (397 Bytes). View file

eval_results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"top1": 0.5357142686843872, "top5": 0.788690447807312}

liteml_config.yaml ADDED Viewed

+QAT:
+  device: "cuda"
+  fully_quantized: True
+  data_quantization:
+    status: On
+    bits: 8
+    custom_bits: {}
+    symmetric: On
+    quantization_mode: static
+    observer: "MovingAverage"
+    per_channel: False
+  weights_quantization:
+    status: On
+    bits: 8
+    custom_bits: {}
+    symmetric: On
+    per_channel: False

metadata.yaml ADDED Viewed

+model_name: Alexnet
+liteml_config: 'liteml_config.yaml'
+quantization_type: W8A8 static
+dataset: ImageNet
+notes: Quantized W8A8 Resnet50 model from torchvision
+# TODO: move float accuracy somewhere else
+top1_float: 0.555
+top5_float: 0.777

model.py ADDED Viewed

+import torch
+device = "cuda" if torch.cuda.is_available() else "cpu"
+def get_model():
+    from torchvision.models import alexnet
+    return alexnet(weights='IMAGENET1K_V1').to(device)

quantized_model.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9acca062fc682969c778761026f0a92bebf373c3629f86db9e501ce3925ea8d0
+size 244448061