TheBloke
/

CodeFuse-CodeLlama-34B-GGUF

Model card Files Files and versions

TheBloke commited on Sep 15, 2023

Commit

bfe1725

·

1 Parent(s): 29cf42e

Upload README.md

Files changed (1) hide show

README.md +57 -0

README.md CHANGED Viewed

@@ -129,6 +129,63 @@ Refer to the Provided Files table below to see what files use which methods, and
 <!-- README_GGUF.md-provided-files end -->
 <!-- README_GGUF.md-how-to-run start -->
 ## Example `llama.cpp` command

 <!-- README_GGUF.md-provided-files end -->
+<!-- README_GGUF.md-how-to-download start -->
+## How to download GGUF files
+**Note for manual downloaders:** You almost never want to clone the entire repo! Multiple different quantisation formats are provided, and most users only want to pick and download a single file.
+The following clients/libraries will automatically download models for you, providing a list of available models to choose from:
+- LM Studio
+- LoLLMS Web UI
+- Faraday.dev
+### In `text-generation-webui`
+Under Download Model, you can enter the model repo: TheBloke/CodeFuse-CodeLlama-34B-GGUF and below it, a specific filename to download, such as: codefuse-codellama-34b.q4_K_M.gguf.
+Then click Download.
+### On the command line, including multiple files at once
+I recommend using the `huggingface-hub` Python library:
+```shell
+pip3 install huggingface-hub>=0.17.1
+```
+Then you can download any individual model file to the current directory, at high speed, with a command like this:
+```shell
+huggingface-cli download TheBloke/CodeFuse-CodeLlama-34B-GGUF codefuse-codellama-34b.q4_K_M.gguf --local-dir . --local-dir-use-symlinks False
+```
+<details>
+  <summary>More advanced huggingface-cli download usage</summary>
+You can also download multiple files at once with a pattern:
+```shell
+huggingface-cli download TheBloke/CodeFuse-CodeLlama-34B-GGUF --local-dir . --local-dir-use-symlinks False --include='*Q4_K*gguf'
+```
+For more documentation on downloading with `huggingface-cli`, please see: [HF -> Hub Python Library -> Download files -> Download from the CLI](https://huggingface.co/docs/huggingface_hub/guides/download#download-from-the-cli).
+To accelerate downloads on fast connections (1Gbit/s or higher), install `hf_transfer`:
+```shell
+pip3 install hf_transfer
+```
+And set environment variable `HF_HUB_ENABLE_HF_TRANSFER` to `1`:
+```shell
+HUGGINGFACE_HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download TheBloke/CodeFuse-CodeLlama-34B-GGUF codefuse-codellama-34b.q4_K_M.gguf --local-dir . --local-dir-use-symlinks False
+```
+Windows CLI users: Use `set HUGGINGFACE_HUB_ENABLE_HF_TRANSFER=1` before running the download command.
+</details>
+<!-- README_GGUF.md-how-to-download end -->
 <!-- README_GGUF.md-how-to-run start -->
 ## Example `llama.cpp` command