robertzty commited on
Commit
f044d78
·
verified ·
1 Parent(s): 3b915fa

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +14 -4
README.md CHANGED
@@ -17,14 +17,24 @@ GGUF conversion of [nvidia/Cosmos-Reason2-8B](https://huggingface.co/nvidia/Cosm
17
  - `Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf` to `...-00005-of-00005.gguf`: Main model (BF16, split into 5 parts)
18
  - `mmproj-Cosmos-Reason2-8B-BF16.gguf`: Vision encoder (multimodal projector)
19
 
 
 
 
 
 
 
20
  ## Usage with llama.cpp
21
 
 
 
22
  ```bash
23
- # First, merge split files (optional, llama.cpp can load split files directly)
24
- llama-gguf-split --merge Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf Cosmos-Reason2-8B-BF16.gguf
 
25
 
26
- # Run with vision support
27
- llama-cli -m Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf --mmproj mmproj-Cosmos-Reason2-8B-BF16.gguf -cnv
 
28
  ```
29
 
30
  ## Notes
 
17
  - `Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf` to `...-00005-of-00005.gguf`: Main model (BF16, split into 5 parts)
18
  - `mmproj-Cosmos-Reason2-8B-BF16.gguf`: Vision encoder (multimodal projector)
19
 
20
+ ## Download
21
+
22
+ ```bash
23
+ huggingface-cli download robertzty/Cosmos-Reason2-8B-GGUF --local-dir ./Cosmos-Reason2-8B-GGUF
24
+ ```
25
+
26
  ## Usage with llama.cpp
27
 
28
+ No need to merge split files - llama.cpp loads them automatically:
29
+
30
  ```bash
31
+ llama-cli -m ./Cosmos-Reason2-8B-GGUF/Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf \
32
+ --mmproj ./Cosmos-Reason2-8B-GGUF/mmproj-Cosmos-Reason2-8B-BF16.gguf -cnv
33
+ ```
34
 
35
+ Optional: merge into single file:
36
+ ```bash
37
+ llama-gguf-split --merge Cosmos-Reason2-8B-BF16-split-00001-of-00005.gguf Cosmos-Reason2-8B-BF16.gguf
38
  ```
39
 
40
  ## Notes