Upload folder using huggingface_hub

Browse files

Files changed (14) hide show

.gitattributes +12 -0
README.md +95 -0
vistral-7b-chat-Q2_K.gguf +3 -0
vistral-7b-chat-Q3_K_L.gguf +3 -0
vistral-7b-chat-Q3_K_M.gguf +3 -0
vistral-7b-chat-Q3_K_S.gguf +3 -0
vistral-7b-chat-Q4_0.gguf +3 -0
vistral-7b-chat-Q4_K_M.gguf +3 -0
vistral-7b-chat-Q4_K_S.gguf +3 -0
vistral-7b-chat-Q5_0.gguf +3 -0
vistral-7b-chat-Q5_K_M.gguf +3 -0
vistral-7b-chat-Q5_K_S.gguf +3 -0
vistral-7b-chat-Q6_K.gguf +3 -0
vistral-7b-chat-Q8_0.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+vistral-7b-chat-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,95 @@

+---
+language:
+- vi
+library_name: transformers
+tags:
+- LLMs
+- NLP
+- Vietnamese
+- Large Language Models
+- TensorBlock
+- GGUF
+license: afl-3.0
+extra_gated_prompt: You agree not to use the model for experiments that could harm
+  human subjects.
+extra_gated_fields:
+  Name: text
+  Email: text
+  Affiliation: text
+  Country: text
+  I agree to the LICENSE of this model: checkbox
+base_model: minhtt/vistral-7b-chat
+---
+<div style="width: auto; margin-left: auto; margin-right: auto">
+<img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+    <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p style="margin-top: 0.5em; margin-bottom: 0em;">
+            Feedback and support: TensorBlock's  <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
+        </p>
+    </div>
+</div>
+## minhtt/vistral-7b-chat - GGUF
+This repo contains GGUF format model files for [minhtt/vistral-7b-chat](https://huggingface.co/minhtt/vistral-7b-chat).
+The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4242](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
+<div style="text-align: left; margin: 20px 0;">
+    <a href="https://tensorblock.co/waitlist/client" style="display: inline-block; padding: 10px 20px; background-color: #007bff; color: white; text-decoration: none; border-radius: 5px; font-weight: bold;">
+        Run them on the TensorBlock client using your local machine ↗
+    </a>
+</div>
+## Prompt template
+```
+<s>[INST] <<SYS>>
+{system_prompt}
+<</SYS>>
+{prompt} [/INST]
+```
+## Model file specification
+| Filename | Quant type | File Size | Description |
+| -------- | ---------- | --------- | ----------- |
+| [vistral-7b-chat-Q2_K.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q2_K.gguf) | Q2_K | 2.749 GB | smallest, significant quality loss - not recommended for most purposes |
+| [vistral-7b-chat-Q3_K_S.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q3_K_S.gguf) | Q3_K_S | 3.197 GB | very small, high quality loss |
+| [vistral-7b-chat-Q3_K_M.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q3_K_M.gguf) | Q3_K_M | 3.552 GB | very small, high quality loss |
+| [vistral-7b-chat-Q3_K_L.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q3_K_L.gguf) | Q3_K_L | 3.855 GB | small, substantial quality loss |
+| [vistral-7b-chat-Q4_0.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q4_0.gguf) | Q4_0 | 4.145 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
+| [vistral-7b-chat-Q4_K_S.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q4_K_S.gguf) | Q4_K_S | 4.177 GB | small, greater quality loss |
+| [vistral-7b-chat-Q4_K_M.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q4_K_M.gguf) | Q4_K_M | 4.405 GB | medium, balanced quality - recommended |
+| [vistral-7b-chat-Q5_0.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q5_0.gguf) | Q5_0 | 5.037 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
+| [vistral-7b-chat-Q5_K_S.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q5_K_S.gguf) | Q5_K_S | 5.037 GB | large, low quality loss - recommended |
+| [vistral-7b-chat-Q5_K_M.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q5_K_M.gguf) | Q5_K_M | 5.171 GB | large, very low quality loss - recommended |
+| [vistral-7b-chat-Q6_K.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q6_K.gguf) | Q6_K | 5.985 GB | very large, extremely low quality loss |
+| [vistral-7b-chat-Q8_0.gguf](https://huggingface.co/tensorblock/vistral-7b-chat-GGUF/blob/main/vistral-7b-chat-Q8_0.gguf) | Q8_0 | 7.751 GB | very large, extremely low quality loss - not recommended |
+## Downloading instruction
+### Command line
+Firstly, install Huggingface Client
+```shell
+pip install -U "huggingface_hub[cli]"
+```
+Then, downoad the individual model file the a local directory
+```shell
+huggingface-cli download tensorblock/vistral-7b-chat-GGUF --include "vistral-7b-chat-Q2_K.gguf" --local-dir MY_LOCAL_DIR
+```
+If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
+```shell
+huggingface-cli download tensorblock/vistral-7b-chat-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
+```

vistral-7b-chat-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a56b502cc1f89018c81c2bc6380efad8b24d3dca88d0ce2bd347410d6862ca28
+size 2749351168

vistral-7b-chat-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c3508439970e6e75bcb8c055ec297c6f6824670a18ddd2ed4f84088c1f010f6
+size 3854783136

vistral-7b-chat-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cfda8b90b15a36ab19ce9cbfbc2a54786947d063086706ae2defc7a5421481b7
+size 3551744672

vistral-7b-chat-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:906f3b9526346d17dde8e8ab30bbe98b25c2fc820046bf8d5f0f3954bb4f8d21
+size 3197325984

vistral-7b-chat-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42db081ec5ce834cfe26f1f4eb6b3823acca9e2371bafd6bd9c35205b01caa79
+size 4145139904

vistral-7b-chat-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5f3bf118813e034ba64b04cc5a141cdc7ba58ecc7eb4b4da55e4494ea88011a3
+size 4404662464

vistral-7b-chat-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7200acc782806934080b97d6599a68c1a8a902dda8e2e2c9262a2608666e442e
+size 4176597184

vistral-7b-chat-Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ca6b192964fd8cfada73a0c43d1f2adfc9e2b97424d131cf9ace8810011805c
+size 5037200064

vistral-7b-chat-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9bc8fe18d6064ecda8796af7a2891d5ddba5e642bbed1dab7f5182f426a228e2
+size 5170893504

vistral-7b-chat-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3a9b8757d922588cccc715ec41db15df2f4407287d294c28504d50b60e9552e
+size 5037200064

vistral-7b-chat-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:819a2189c41c06e738d5122b7ba5d1140e1175bd00601b3596c063c8ce88f612
+size 5985013984

vistral-7b-chat-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9dff17d61c4ec4b66a39e98dd98b698e42d834d835adbce8624744bd280961cf
+size 7751442592