gcoderw commited on
Commit
f415dc1
·
verified ·
1 Parent(s): 03727c3

Slim AAIT-86M main repo and move GGUF artifacts to dedicated repo

Browse files
Files changed (1) hide show
  1. README.md +3 -11
README.md CHANGED
@@ -60,10 +60,6 @@ Action logit order:
60
  - anchor head checkpoint
61
  - `AAIT-86M.safetensors`
62
  - combined self-contained release artifact
63
- - `AAIT-86M_q8_0.gguf`
64
- - combined GGUF export, higher-fidelity quantization
65
- - `AAIT-86M_q5_1.gguf`
66
- - combined GGUF export, smaller quantization
67
  - `config.json`
68
  - `load_aait86m.py`
69
  - `example_inference.py`
@@ -110,15 +106,11 @@ The retrieval checkpoint stores the trained projection heads and runtime config
110
 
111
  ## GGUF Note
112
 
113
- The GGUF files in this repo are quantized exports of the combined `AAIT-86M` package using the custom `triembed` architecture metadata.
114
 
115
- They are useful for:
116
 
117
- - compact storage
118
- - transport
119
- - custom runtime integration work
120
-
121
- They are not generic llama.cpp text-model artifacts.
122
 
123
  ## Operational Caveats
124
 
 
60
  - anchor head checkpoint
61
  - `AAIT-86M.safetensors`
62
  - combined self-contained release artifact
 
 
 
 
63
  - `config.json`
64
  - `load_aait86m.py`
65
  - `example_inference.py`
 
106
 
107
  ## GGUF Note
108
 
109
+ GGUF exports for this model live in the separate repository:
110
 
111
+ - `augmem/AAIT-86M-GGUF`
112
 
113
+ Those artifacts are quantized exports of the combined `AAIT-86M` package using the custom `triembed` architecture metadata. They are not generic llama.cpp text-model artifacts.
 
 
 
 
114
 
115
  ## Operational Caveats
116