Slim AAIT-86M main repo and move GGUF artifacts to dedicated repo
Browse files
README.md
CHANGED
|
@@ -60,10 +60,6 @@ Action logit order:
|
|
| 60 |
- anchor head checkpoint
|
| 61 |
- `AAIT-86M.safetensors`
|
| 62 |
- combined self-contained release artifact
|
| 63 |
-
- `AAIT-86M_q8_0.gguf`
|
| 64 |
-
- combined GGUF export, higher-fidelity quantization
|
| 65 |
-
- `AAIT-86M_q5_1.gguf`
|
| 66 |
-
- combined GGUF export, smaller quantization
|
| 67 |
- `config.json`
|
| 68 |
- `load_aait86m.py`
|
| 69 |
- `example_inference.py`
|
|
@@ -110,15 +106,11 @@ The retrieval checkpoint stores the trained projection heads and runtime config
|
|
| 110 |
|
| 111 |
## GGUF Note
|
| 112 |
|
| 113 |
-
|
| 114 |
|
| 115 |
-
|
| 116 |
|
| 117 |
-
-
|
| 118 |
-
- transport
|
| 119 |
-
- custom runtime integration work
|
| 120 |
-
|
| 121 |
-
They are not generic llama.cpp text-model artifacts.
|
| 122 |
|
| 123 |
## Operational Caveats
|
| 124 |
|
|
|
|
| 60 |
- anchor head checkpoint
|
| 61 |
- `AAIT-86M.safetensors`
|
| 62 |
- combined self-contained release artifact
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
- `config.json`
|
| 64 |
- `load_aait86m.py`
|
| 65 |
- `example_inference.py`
|
|
|
|
| 106 |
|
| 107 |
## GGUF Note
|
| 108 |
|
| 109 |
+
GGUF exports for this model live in the separate repository:
|
| 110 |
|
| 111 |
+
- `augmem/AAIT-86M-GGUF`
|
| 112 |
|
| 113 |
+
Those artifacts are quantized exports of the combined `AAIT-86M` package using the custom `triembed` architecture metadata. They are not generic llama.cpp text-model artifacts.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 114 |
|
| 115 |
## Operational Caveats
|
| 116 |
|