Spaces:

Naphula
/

model_tools

Running

Naphula commited on Mar 6

Commit

4b427d3

verified ·

1 Parent(s): 1e3f593

Upload model_tools.md

Files changed (1) hide show

model_tools.md CHANGED Viewed

@@ -40,7 +40,7 @@ Tools to enhance LLM quantizations and merging
 - Then assign the num_experts_per_tok in config.json (or the config.yaml)
 # [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
-- Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids 'cardboard town' fake patches like `gen_id_patcher` and `vocab_id_patcher`.
 # [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
 - Audit your tokensurgeon results.

 - Then assign the num_experts_per_tok in config.json (or the config.yaml)
 # [tokensurgeon.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokensurgeon.py)
+- Uses adaptive VRAM from Grim Jim's `measure.py` like `graph_v18` to prevent OOM. Use recommended [batch file](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fix_tokenizers.bat) here or modify sh. This supposedly avoids 'Potemkin village' fake patches like `gen_id_patcher` and `vocab_id_patcher`.
 # [tokeninspector.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/tokeninspector.py)
 - Audit your tokensurgeon results.