Spaces:

Naphula
/

model_tools

Running

Naphula commited on Dec 14, 2025

Commit

268ead7

verified ·

1 Parent(s): 97948cc

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -10,6 +10,9 @@ pinned: false
 # Model Tools by Naphula
 Tools to enhance LLM quantizations and merging
 # [fp32_to_fp16.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fp32_to_fp16.py)
 - Converts FP32 to FP16 safetensors

 # Model Tools by Naphula
 Tools to enhance LLM quantizations and merging
+# [graph_v4.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/graph_v4.py)
+- Merge models in minutes instead of hours on low VRAM. For a 3060/3060 Ti user: This script enables functionality that is otherwise impossible (merging 70B models or large 7B merges with `--cuda`) without OOM. [More details here](https://huggingface.co/spaces/Naphula/model_tools/blob/main/mergekit_low-VRAM-graph_patch.md)
 # [fp32_to_fp16.py](https://huggingface.co/spaces/Naphula/model_tools/blob/main/fp32_to_fp16.py)
 - Converts FP32 to FP16 safetensors