| --- |
| tags: |
| - sentinel-manifold |
| - quantization |
| - edge-deployment |
| - int4 |
| - pytorch |
| license: mit |
| --- |
| |
| # 𦴠Sentinel Tiny Text β Sentinel-INT4 |
|
|
| **Production-ready quantized model for edge deployment.** |
|
|
| This is the Sentinel-INT4 quantized variant of [Sentinel Tiny Text](https://huggingface.co/5dimension/sentinel-tiny-text). |
|
|
| --- |
|
|
| ## π Quantization Spec (SDM-v1 Β§4) |
|
|
| | Property | Value | |
| |----------|-------| |
| | **Format** | Sentinel-INT4 | |
| | **Zero-Point (Z)** | Cβ = -0.007994021806 (attracting fixed point) | |
| | **Scale (S)** | max\|w\| Β· (1/e) | |
| | **Compression** | 8Γ | |
| | **Size** | ~6.4 MB | |
| | **Parent model** | [5dimension/sentinel-tiny-text](https://huggingface.co/5dimension/sentinel-tiny-text) | |
|
|
| --- |
|
|
| ## 𧬠Mathematical Foundation |
|
|
| The Sentinel quantization uses the attracting fixed point Cβ as the zero-point, minimizing rounding noise by clustering weights around the dynamical attractor. |
|
|
| ``` |
| Quantization: q = round((w - Cβ) / S) |
| Dequantization: w = q Β· S + Cβ |
| Where S = max|w| Β· (1/e) |
| ``` |
|
|
| --- |
|
|
| ## π Deployment Targets |
|
|
| | Platform | Compatible | |
| |----------|-----------| |
| | Mobile (iOS/Android) | β
| |
| | IoT devices | β
| |
| | Microcontrollers (ARM Cortex-M) | β
| |
| | Browser (ONNX/WebGL) | β
| |
| | Edge TPU | β οΈ (needs conversion) | |
|
|
| --- |
|
|
| ## π¦ Download |
|
|
| ```python |
| from huggingface_hub import hf_hub_download |
| |
| # Download quantized weights |
| weights = hf_hub_download(repo_id="5dimension/sentinel-tiny-text-int4", filename="model_int4.pt") |
| ``` |
|
|
| --- |
|
|
| ## π Links |
|
|
| - **Parent model**: [5dimension/sentinel-tiny-text](https://huggingface.co/5dimension/sentinel-tiny-text) |
| - **Main repo**: [sentinel-manifold-discoveries](https://huggingface.co/5dimension/sentinel-manifold-discoveries) |
| - **Sentinel Hub**: [Interactive demo](https://huggingface.co/spaces/5dimension/sentinel-hub) |
|
|
| --- |
|
|
| **License:** MIT | **Edge-ready. Theorem-backed.** 𦴠|
|
|