Initial commit

Browse files

Files changed (8) hide show

.gitattributes +41 -0
README.md +78 -0
T-pro-it-2.1-Q4_K_M.gguf +3 -0
T-pro-it-2.1-Q5_0.gguf +3 -0
T-pro-it-2.1-Q5_K_M.gguf +3 -0
T-pro-it-2.1-Q5_K_S.gguf +3 -0
T-pro-it-2.1-Q6_K.gguf +3 -0
T-pro-it-2.1-Q8_0.gguf +3 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,41 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+T-pro-it-2.1-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,78 @@

+---
+language:
+- en
+base_model: t-tech/T-pro-it-2.1
+tags:
+- llama-cpp
+- gguf
+license: apache-2.0
+---
+# T-pro-it-2.1-GGUF
+**🚨 Users are advised to exercise caution and are responsible for any additional training and oversight required to ensure the model's responses meet acceptable ethical and safety standards. The responsibility for incorporating this model into industrial or commercial solutions lies entirely with those who choose to deploy it.**
+This repository contains **T-pro-it-2.1** converted to the **GGUF** format with
+[llama.cpp](https://github.com/ggerganov/llama.cpp).
+See the original BF16 model here: [t-tech/T-pro-it-2.1](https://huggingface.co/t-tech/T-pro-it-2.1).
+## Description
+T-pro-it-2.1 — is an efficient russian model built upon the Qwen 3 model family with improved instruction following and tool-calling capabilities compared to [T-pro-it-2.0](https://huggingface.co/t-tech/T-pro-it-2.0).
+Outperforms Qwen3-32B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows.
+**NOTE: This model supports only non-thinking mode and does not generate `<think></think>` blocks in its output. Meanwhile, specifying `enable_thinking=False` is no longer required.**
+## 📊 Benchmarks
+|                     | Ru Arena Hard | ruIFeval* | ruBFCL |
+|---------------------|---------------|----------|--------|
+| T-pro-it-2.1        | 93.8          | 80.7     | 66.0   |
+| T-pro-it-2.1-Q8_0   | 94.2          | 80.8     | 65.8   |
+| T-pro-it-2.1-Q6_K   | 93.4          | 80.0     | 65.9   |
+| T-pro-it-2.1-Q5_K_M | 92.7          | 81.4     | 65.7   |
+| T-pro-it-2.1-Q5_K_S | 92.3          | 80.4     | 65.2   |
+| T-pro-it-2.1-Q5_0   | 93.8          | 79.9     | 64.8   |
+| T-pro-it-2.1-Q4_K_M | 92.6          | 80.7     | 64.8   |
+\* IFeval metric is mean of 4 values: prompt and instruct levels for strict and loose accuracy.
+> **Recommendation:** choose the **highest-quality quantisation that fits your hardware** (VRAM / RAM).
+| Filename (→ `-gguf`) | Quant method | Bits | Size (GB) |
+|----------------------|--------------|------|-----------|
+| `T-pro-it-2.1-q8_0`        | Q8_0      | 8  | 34.8 |
+| `T-pro-it-2.1-q6_k`        | Q6_K      | 6  | 26.9 |
+| `T-pro-it-2.1-q5_k_m`      | Q5_K_M    | 5  | 23.2 |
+| `T-pro-it-2.1-q5_k_s`      | Q5_K_S    | 5  | 22.6 |
+| `T-pro-it-2.1-q5_0`        | Q5_0      | 5  | 22.6 |
+| `T-pro-it-2.1-q4_k_m`      | Q4_K_M    | 4  | 19.8 |
+*Size figures assume **no GPU off-loading**. Off-loading lowers RAM usage and uses VRAM instead.*
+## Quickstart
+### llama.cpp
+Check out our [llama.cpp documentation](https://qwen.readthedocs.io/en/latest/run_locally/llama.cpp.html) for more usage guide.
+We advise you to clone [`llama.cpp`](https://github.com/ggerganov/llama.cpp) and install it following the official guide. We follow the latest version of llama.cpp.
+In the following demonstration, we assume that you are running commands under the repository `llama.cpp`.
+```shell
+./llama-cli -hf t-tech/T-pro-it-2.1-GGUF:Q8_0 --jinja --color -ngl 99 -fa -sm row --temp 0.6 --presence-penalty 1.0 -c 40960 -n 32768 --no-context-shift
+```
+### ollama
+Check out our [ollama documentation](https://qwen.readthedocs.io/en/latest/run_locally/ollama.html) for more usage guide.
+You can run T-pro-2.1 with one command:
+```shell
+ollama run t-tech/T-pro-it-2.1:q8_0
+```
+See also [t-tech ollama homepage](https://ollama.com/t-tech/T-pro-it-2.1).

T-pro-it-2.1-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b19e88cc1154b4b0c99f09bc7b57b8e5f3a6af4135581a3bfd31a2518ed58714
+size 19761766048

T-pro-it-2.1-Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:06ba111467a7c0bd1a1b71b6e749566b87e658a6fcf6d5edf5ec0181c6add62f
+size 22634951968

T-pro-it-2.1-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5c93325a18deb28e9842c877fdb7f453e63829bae57b0603157fdc82afbfd659
+size 23214290208

T-pro-it-2.1-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:edf7a7c65e55d2963093074d108dea86109f05b659bf0ae01e809fb28fb20335
+size 22634951968

T-pro-it-2.1-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4e6d7215efe7c0c9f62a8c4f35e11987c41301459d1888225f96b0e0c626ffb1
+size 26882597152

T-pro-it-2.1-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:af2b3a761473a95ca31ab9b8647b9dc4288604a4a71f795ffd9402d2938c7621
+size 34816397344