Naphula commited on
Commit
4b427d3
·
verified ·
1 Parent(s): 1e3f593

Upload model_tools.md

Browse files
Files changed (1) hide show
  1. model_tools.md +1 -1
model_tools.md CHANGED
@@ -40,7 +40,7 @@ Tools to enhance LLM quantizations and merging
40
  - Then assign the num_experts_per_tok in config.json (or the config.yaml)
41
 
42
  # [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
43
- - Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids 'cardboard town' fake patches like `gen_id_patcher` and `vocab_id_patcher`.
44
 
45
  # [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
46
  - Audit your tokensurgeon results.
 
40
  - Then assign the num_experts_per_tok in config.json (or the config.yaml)
41
 
42
  # [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
43
+ - Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids 'Potemkin village' fake patches like `gen_id_patcher` and `vocab_id_patcher`.
44
 
45
  # [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
46
  - Audit your tokensurgeon results.