Spaces:
Running
Running
Upload model_tools.md
Browse files- model_tools.md +1 -1
model_tools.md
CHANGED
|
@@ -40,7 +40,7 @@ Tools to enhance LLM quantizations and merging
|
|
| 40 |
- Then assign the num_experts_per_tok in config.json (or the config.yaml)
|
| 41 |
|
| 42 |
# [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
|
| 43 |
-
- Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids '
|
| 44 |
|
| 45 |
# [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
|
| 46 |
- Audit your tokensurgeon results.
|
|
|
|
| 40 |
- Then assign the num_experts_per_tok in config.json (or the config.yaml)
|
| 41 |
|
| 42 |
# [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
|
| 43 |
+
- Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids 'Potemkin village' fake patches like `gen_id_patcher` and `vocab_id_patcher`.
|
| 44 |
|
| 45 |
# [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
|
| 46 |
- Audit your tokensurgeon results.
|