Update README.md
Browse files
README.md
CHANGED
|
@@ -17,6 +17,8 @@ library_name: transformers
|
|
| 17 |
T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and **adds support for tool-calling capabilities** — a key advancement over [T-lite-it-1.0](https://huggingface.co/t-tech/T-lite-it-1.0), which lacks tool-use support.
|
| 18 |
Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.
|
| 19 |
|
|
|
|
|
|
|
| 20 |
**NOTE: This model supports only non-thinking mode and does not generate `<think></think>` in its output. Meanwhile, specifying `enable_thinking=False` is no longer required.**
|
| 21 |
|
| 22 |
### 📚 Dataset
|
|
@@ -62,6 +64,8 @@ This approach allows fine-grained control over each skill domain and results in
|
|
| 62 |
|
| 63 |
\*\* T-lite-it-1.0 does not support tool calling, therefore tool-calling benchmark metrics are not available
|
| 64 |
|
|
|
|
|
|
|
| 65 |
## Recommended Generation Parameters
|
| 66 |
|
| 67 |
```
|
|
|
|
| 17 |
T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and **adds support for tool-calling capabilities** — a key advancement over [T-lite-it-1.0](https://huggingface.co/t-tech/T-lite-it-1.0), which lacks tool-use support.
|
| 18 |
Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.
|
| 19 |
|
| 20 |
+
More train details in our Habr: https://habr.com/ru/companies/tbank/articles/979650/
|
| 21 |
+
|
| 22 |
**NOTE: This model supports only non-thinking mode and does not generate `<think></think>` in its output. Meanwhile, specifying `enable_thinking=False` is no longer required.**
|
| 23 |
|
| 24 |
### 📚 Dataset
|
|
|
|
| 64 |
|
| 65 |
\*\* T-lite-it-1.0 does not support tool calling, therefore tool-calling benchmark metrics are not available
|
| 66 |
|
| 67 |
+
More benchmarks can be found in our [Habr post](https://habr.com/ru/companies/tbank/articles/979650/).
|
| 68 |
+
|
| 69 |
## Recommended Generation Parameters
|
| 70 |
|
| 71 |
```
|