Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +16 -16

README.md CHANGED Viewed

@@ -13,10 +13,10 @@ tags:
 - low-resource
 - nlp
 datasets:
-- ogulcanaydogan/turkish-llm-v10-training
 pipeline_tag: text-generation
 model-index:
-- name: turkish-llm-14b-instruct
   results: []
 ---
@@ -25,10 +25,10 @@ model-index:
 An open-source 14.7 billion parameter language model fine-tuned for native Turkish instruction following. Built on Qwen2.5-14B-Instruct using supervised fine-tuning (SFT) on a curated corpus of Turkish-language examples spanning science, history, geography, and general knowledge.
 <p align="center">
-  <a href="https://huggingface.co/spaces/ogulcanaydogan/turkish-llm-14b-chat"><img src="https://img.shields.io/badge/Demo-Live_Chat-blue?style=for-the-badge&logo=huggingface" alt="Demo"></a>
   <a href="https://github.com/ogulcanaydogan/Turkish-LLM"><img src="https://img.shields.io/badge/GitHub-Repository-black?style=for-the-badge&logo=github" alt="GitHub"></a>
-  <a href="https://huggingface.co/datasets/ogulcanaydogan/turkish-llm-v10-training"><img src="https://img.shields.io/badge/Dataset-144K_samples-green?style=for-the-badge&logo=huggingface" alt="Dataset"></a>
-  <a href="https://huggingface.co/ogulcanaydogan/turkish-llm-7b-instruct"><img src="https://img.shields.io/badge/Also_Available-7B_Model-yellow?style=for-the-badge&logo=huggingface" alt="7B"></a>
 </p>
 ---
@@ -65,14 +65,14 @@ This model is part of the **Turkish-LLM** family:
 | Model | Parameters | Base | Method | Use Case |
 |-------|-----------|------|--------|----------|
-| **turkish-llm-14b-instruct** (this) | 14.7B | Qwen2.5-14B-Instruct | SFT | Higher quality, complex reasoning |
-| [turkish-llm-7b-instruct](https://huggingface.co/ogulcanaydogan/turkish-llm-7b-instruct) | 7B | Turkcell-LLM-7b-v1 | LoRA | Lightweight, faster inference |
 ## Training
 ### Dataset
-Training data was sourced from the [turkish-llm-v10-training](https://huggingface.co/datasets/ogulcanaydogan/turkish-llm-v10-training) dataset — a curated collection of **144,000 Turkish instruction-response pairs** — with a focused SFT subset of approximately 2,600 high-quality examples selected for alignment.
 | Domain | Examples | Purpose |
 |--------|----------|---------|
@@ -120,7 +120,7 @@ Raw Turkish Data ──▶ Preprocessing ──▶ SFT Training ──▶ Evalua
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch
-model_id = "ogulcanaydogan/turkish-llm-14b-instruct"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
 model = AutoModelForCausalLM.from_pretrained(
     model_id,
@@ -150,7 +150,7 @@ print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[1]:], skip_special_t
 ```bash
 pip install vllm
-vllm serve ogulcanaydogan/turkish-llm-14b-instruct \
     --dtype float16 \
     --max-model-len 4096
 ```
@@ -158,7 +158,7 @@ vllm serve ogulcanaydogan/turkish-llm-14b-instruct \
 ### Ollama (Local)
 ```bash
-ollama run hf.co/ogulcanaydogan/turkish-llm-14b-instruct
 ```
 ### Chat Template
@@ -218,10 +218,10 @@ This model is released under Apache 2.0 to support open research and development
 | Resource | Link |
 |----------|------|
-| 7B Model | [ogulcanaydogan/turkish-llm-7b-instruct](https://huggingface.co/ogulcanaydogan/turkish-llm-7b-instruct) |
-| Training Dataset (144K) | [ogulcanaydogan/turkish-llm-v10-training](https://huggingface.co/datasets/ogulcanaydogan/turkish-llm-v10-training) |
-| Live Demo (14B) | [turkish-llm-14b-chat](https://huggingface.co/spaces/ogulcanaydogan/turkish-llm-14b-chat) |
-| Live Demo (7B) | [turkish-llm-7b-chat](https://huggingface.co/spaces/ogulcanaydogan/turkish-llm-7b-chat) |
 | Training Pipeline | [LowResource-LLM-Forge](https://github.com/ogulcanaydogan/LowResource-LLM-Forge) |
 | Project Repository | [Turkish-LLM on GitHub](https://github.com/ogulcanaydogan/Turkish-LLM) |
@@ -233,7 +233,7 @@ This model is released under Apache 2.0 to support open research and development
   author    = {Aydogan, Ogulcan},
   year      = {2026},
   publisher = {Hugging Face},
-  url       = {https://huggingface.co/ogulcanaydogan/turkish-llm-14b-instruct}
 }
 ```

 - low-resource
 - nlp
 datasets:
+- ogulcanaydogan/Turkish-LLM-v10-Training
 pipeline_tag: text-generation
 model-index:
+- name: Turkish-LLM-14B-Instruct
   results: []
 ---
 An open-source 14.7 billion parameter language model fine-tuned for native Turkish instruction following. Built on Qwen2.5-14B-Instruct using supervised fine-tuning (SFT) on a curated corpus of Turkish-language examples spanning science, history, geography, and general knowledge.
 <p align="center">
+  <a href="https://huggingface.co/spaces/ogulcanaydogan/Turkish-LLM-14B-Chat"><img src="https://img.shields.io/badge/Demo-Live_Chat-blue?style=for-the-badge&logo=huggingface" alt="Demo"></a>
   <a href="https://github.com/ogulcanaydogan/Turkish-LLM"><img src="https://img.shields.io/badge/GitHub-Repository-black?style=for-the-badge&logo=github" alt="GitHub"></a>
+  <a href="https://huggingface.co/datasets/ogulcanaydogan/Turkish-LLM-v10-Training"><img src="https://img.shields.io/badge/Dataset-144K_samples-green?style=for-the-badge&logo=huggingface" alt="Dataset"></a>
+  <a href="https://huggingface.co/ogulcanaydogan/Turkish-LLM-7B-Instruct"><img src="https://img.shields.io/badge/Also_Available-7B_Model-yellow?style=for-the-badge&logo=huggingface" alt="7B"></a>
 </p>
 ---
 | Model | Parameters | Base | Method | Use Case |
 |-------|-----------|------|--------|----------|
+| **Turkish-LLM-14B-Instruct** (this) | 14.7B | Qwen2.5-14B-Instruct | SFT | Higher quality, complex reasoning |
+| [Turkish-LLM-7B-Instruct](https://huggingface.co/ogulcanaydogan/Turkish-LLM-7B-Instruct) | 7B | Turkcell-LLM-7b-v1 | LoRA | Lightweight, faster inference |
 ## Training
 ### Dataset
+Training data was sourced from the [Turkish-LLM-v10-Training](https://huggingface.co/datasets/ogulcanaydogan/Turkish-LLM-v10-Training) dataset — a curated collection of **144,000 Turkish instruction-response pairs** — with a focused SFT subset of approximately 2,600 high-quality examples selected for alignment.
 | Domain | Examples | Purpose |
 |--------|----------|---------|
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch
+model_id = "ogulcanaydogan/Turkish-LLM-14B-Instruct"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
 model = AutoModelForCausalLM.from_pretrained(
     model_id,
 ```bash
 pip install vllm
+vllm serve ogulcanaydogan/Turkish-LLM-14B-Instruct \
     --dtype float16 \
     --max-model-len 4096
 ```
 ### Ollama (Local)
 ```bash
+ollama run hf.co/ogulcanaydogan/Turkish-LLM-14B-Instruct
 ```
 ### Chat Template
 | Resource | Link |
 |----------|------|
+| 7B Model | [Turkish-LLM-7B-Instruct](https://huggingface.co/ogulcanaydogan/Turkish-LLM-7B-Instruct) |
+| Training Dataset (144K) | [Turkish-LLM-v10-Training](https://huggingface.co/datasets/ogulcanaydogan/Turkish-LLM-v10-Training) |
+| Live Demo (14B) | [Turkish-LLM-14B-Chat](https://huggingface.co/spaces/ogulcanaydogan/Turkish-LLM-14B-Chat) |
+| Live Demo (7B) | [Turkish-LLM-7B-Chat](https://huggingface.co/spaces/ogulcanaydogan/Turkish-LLM-7B-Chat) |
 | Training Pipeline | [LowResource-LLM-Forge](https://github.com/ogulcanaydogan/LowResource-LLM-Forge) |
 | Project Repository | [Turkish-LLM on GitHub](https://github.com/ogulcanaydogan/Turkish-LLM) |
   author    = {Aydogan, Ogulcan},
   year      = {2026},
   publisher = {Hugging Face},
+  url       = {https://huggingface.co/ogulcanaydogan/Turkish-LLM-14B-Instruct}
 }
 ```