Trendyol LLM T1
Collection
Turkish & English reasoning models • 2 items • Updated • 2
How to use Trendyol/Trendyol-LLM-8B-T1 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="Trendyol/Trendyol-LLM-8B-T1")
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe(messages) # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Trendyol/Trendyol-LLM-8B-T1")
model = AutoModelForCausalLM.from_pretrained("Trendyol/Trendyol-LLM-8B-T1")
messages = [
{"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
).to(model.device)
outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))How to use Trendyol/Trendyol-LLM-8B-T1 with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Trendyol/Trendyol-LLM-8B-T1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Trendyol/Trendyol-LLM-8B-T1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker model run hf.co/Trendyol/Trendyol-LLM-8B-T1
How to use Trendyol/Trendyol-LLM-8B-T1 with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "Trendyol/Trendyol-LLM-8B-T1" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Trendyol/Trendyol-LLM-8B-T1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "Trendyol/Trendyol-LLM-8B-T1" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Trendyol/Trendyol-LLM-8B-T1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'How to use Trendyol/Trendyol-LLM-8B-T1 with Docker Model Runner:
docker model run hf.co/Trendyol/Trendyol-LLM-8B-T1
Trendyol LLM-8B-T1 is an 8-billion-parameter chat model built on top of Qwen 3-8B using large-scale Turkish e-commerce datasets curated by Trendyol. The primary goal of the model is to provide advanced reasoning capabilities in Turkish.
Alongside this specialization in Turkish, the base model's strong English capabilities have also been preserved, making it an effective tool in both languages.
/think (explicit reasoning) or /no_think (concise answers). import transformers
import torch
model_id = "Trendyol/Trendyol-LLM-8B-T1"
pipeline = transformers.pipeline(
"text-generation",
model=model_id,
model_kwargs={
"torch_dtype": torch.bfloat16,
"attn_implementation": "flash_attention_2",
"device_map": "auto"
}
)
messages = [
{"role": "system", "content": "Sen yardımsever bir asistansın."},
{"role": "user", "content": "Mona Lisa tablosu hakkında kısa bir özet yazar mısın?"}
]
outputs = pipeline(messages,
return_full_text=False,
max_new_tokens=2048
)
assistant_response = outputs[0]["generated_text"].strip()
print(assistant_response)
/think & /no_think
/think – model emits a <think> … </think> block with its internal reasoning before the final answer. This is default behaviour./no_think – append this directive in the last user turn, if you want to turn off thinking mode.Apache-2.0 – identical to the base Qwen 3-8B.
@misc{trendyolLLM8BT1,
title = {Trendyol LLM 8B T1},
author = {Trendyol LLM & Core NLP Team},
year = {2025},
url = {https://huggingface.co/Trendyol/Trendyol-LLM-8B-T1}
}
Base model
Qwen/Qwen3-8B-Base