Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,75 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- sentinel-manifold
|
| 4 |
+
- quantization
|
| 5 |
+
- edge-deployment
|
| 6 |
+
- int4
|
| 7 |
+
- pytorch
|
| 8 |
+
license: mit
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# 🦴 Sentinel Tiny Text — Sentinel-INT4
|
| 12 |
+
|
| 13 |
+
**Production-ready quantized model for edge deployment.**
|
| 14 |
+
|
| 15 |
+
This is the Sentinel-INT4 quantized variant of [Sentinel Tiny Text](https://huggingface.co/5dimension/sentinel-tiny-text).
|
| 16 |
+
|
| 17 |
+
---
|
| 18 |
+
|
| 19 |
+
## 📊 Quantization Spec (SDM-v1 §4)
|
| 20 |
+
|
| 21 |
+
| Property | Value |
|
| 22 |
+
|----------|-------|
|
| 23 |
+
| **Format** | Sentinel-INT4 |
|
| 24 |
+
| **Zero-Point (Z)** | C₁ = -0.007994021806 (attracting fixed point) |
|
| 25 |
+
| **Scale (S)** | max\|w\| · (1/e) |
|
| 26 |
+
| **Compression** | 8× |
|
| 27 |
+
| **Size** | ~6.4 MB |
|
| 28 |
+
| **Parent model** | [5dimension/sentinel-tiny-text](https://huggingface.co/5dimension/sentinel-tiny-text) |
|
| 29 |
+
|
| 30 |
+
---
|
| 31 |
+
|
| 32 |
+
## 🧬 Mathematical Foundation
|
| 33 |
+
|
| 34 |
+
The Sentinel quantization uses the attracting fixed point C₁ as the zero-point, minimizing rounding noise by clustering weights around the dynamical attractor.
|
| 35 |
+
|
| 36 |
+
```
|
| 37 |
+
Quantization: q = round((w - C₁) / S)
|
| 38 |
+
Dequantization: w = q · S + C₁
|
| 39 |
+
Where S = max|w| · (1/e)
|
| 40 |
+
```
|
| 41 |
+
|
| 42 |
+
---
|
| 43 |
+
|
| 44 |
+
## 🚀 Deployment Targets
|
| 45 |
+
|
| 46 |
+
| Platform | Compatible |
|
| 47 |
+
|----------|-----------|
|
| 48 |
+
| Mobile (iOS/Android) | ✅ |
|
| 49 |
+
| IoT devices | ✅ |
|
| 50 |
+
| Microcontrollers (ARM Cortex-M) | ✅ |
|
| 51 |
+
| Browser (ONNX/WebGL) | ✅ |
|
| 52 |
+
| Edge TPU | ⚠️ (needs conversion) |
|
| 53 |
+
|
| 54 |
+
---
|
| 55 |
+
|
| 56 |
+
## 📦 Download
|
| 57 |
+
|
| 58 |
+
```python
|
| 59 |
+
from huggingface_hub import hf_hub_download
|
| 60 |
+
|
| 61 |
+
# Download quantized weights
|
| 62 |
+
weights = hf_hub_download(repo_id="5dimension/sentinel-tiny-text-int4", filename="model_int4.pt")
|
| 63 |
+
```
|
| 64 |
+
|
| 65 |
+
---
|
| 66 |
+
|
| 67 |
+
## 🔗 Links
|
| 68 |
+
|
| 69 |
+
- **Parent model**: [5dimension/sentinel-tiny-text](https://huggingface.co/5dimension/sentinel-tiny-text)
|
| 70 |
+
- **Main repo**: [sentinel-manifold-discoveries](https://huggingface.co/5dimension/sentinel-manifold-discoveries)
|
| 71 |
+
- **Sentinel Hub**: [Interactive demo](https://huggingface.co/spaces/5dimension/sentinel-hub)
|
| 72 |
+
|
| 73 |
+
---
|
| 74 |
+
|
| 75 |
+
**License:** MIT | **Edge-ready. Theorem-backed.** 🦴
|