t-tech
/

T-lite-it-2.1

Text Generation

text-generation-inference

Model card Files Files and versions

taranetsdan commited on 12 days ago

Commit

d125c97

·

verified ·

1 Parent(s): 5beea0d

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ library_name: transformers
 T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and **adds support for tool-calling capabilities** — a key advancement over [T-lite-it-1.0](https://huggingface.co/t-tech/T-lite-it-1.0), which lacks tool-use support.
 Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.
 **NOTE: This model supports only non-thinking mode and does not generate `<think></think>` in its output. Meanwhile, specifying `enable_thinking=False` is no longer required.**
 ### 📚 Dataset
@@ -62,6 +64,8 @@ This approach allows fine-grained control over each skill domain and results in
 \*\* T-lite-it-1.0 does not support tool calling, therefore tool-calling benchmark metrics are not available
 ## Recommended Generation Parameters
 ```

 T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and **adds support for tool-calling capabilities** — a key advancement over [T-lite-it-1.0](https://huggingface.co/t-tech/T-lite-it-1.0), which lacks tool-use support.
 Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.
+More train details in our Habr: https://habr.com/ru/companies/tbank/articles/979650/
 **NOTE: This model supports only non-thinking mode and does not generate `<think></think>` in its output. Meanwhile, specifying `enable_thinking=False` is no longer required.**
 ### 📚 Dataset
 \*\* T-lite-it-1.0 does not support tool calling, therefore tool-calling benchmark metrics are not available
+More benchmarks can be found in our [Habr post](https://habr.com/ru/companies/tbank/articles/979650/).
 ## Recommended Generation Parameters
 ```