Orkhon / bengu

57M English/Turkish bilingual base model in the Turkic branch.

This repository is one member of the Orkhon model family, an auditable from-scratch LLM stack covering tokenizer, pretraining, post-training, evaluation, serving, and Hugging Face export.

Family Context

Member Role HF repo
bumin 4M arithmetic/chat smoke model proving the full Orkhon pipeline. korkmazumut/orkhon-bumin
tonyukuk 22M TinyStories base model trained from scratch with Orkhon. korkmazumut/orkhon-tonyukuk
kultigin 22M story-instruction SFT member of the Orkhon family. korkmazumut/orkhon-kultigin
istemi 51M FineWeb-Edu base model trained on real web text. korkmazumut/orkhon-istemi
bengu current 57M English/Turkish bilingual base model in the Turkic branch. korkmazumut/orkhon-bengu
bengu-gokturk 57M Old Turkic rune-to-Latin transliterator SFT. This is transliteration, not translation. korkmazumut/orkhon-bengu-gokturk

kashgari is intentionally not part of this first upload batch because the local model-zoo entry stores metadata and a re-import hint, not archived weights.

Intended Use

  • Inspecting and reproducing the Orkhon training/export path.
  • Running small local demos on CPU/MPS/CUDA.
  • Comparing Orkhon family members by training stage and data mix.
  • For bengu-gokturk: rune-to-Latin Old Turkic transliteration demos.

Not Intended For

  • Reliable factual QA.
  • Safety-critical decisions.
  • Claims of state-of-the-art Turkish or general LLM performance.
  • Treating Old Turkic transliteration as modern Turkish translation.

Evaluation

Metric Value
loss 3.852317476272583
ppl 47.10209483086865
tokens 163840

Current benchmark reports in the source repo are smoke baselines unless marked otherwise. Do not treat limit-20 benchmark runs as headline capability claims.

Example Prompt

Turkiye'nin baskenti Ankara

Usage

Install Orkhon from the source repo, then load this exported folder:

pip install git+https://github.com/UmutKorkmaz/orkhon
from huggingface_hub import snapshot_download
from orkhon.export.to_hf import load_exported_model
from orkhon.tokenizer import load_tokenizer

path = snapshot_download("korkmazumut/orkhon-bengu")
model, cfg = load_exported_model(path, device="cpu")
tok = load_tokenizer(path)

For chat/instruct members, use orkhon.serve.chat_cli.reply. For base members, use raw completion with orkhon.model.generate.

Files

  • model.safetensors - exported inference weights
  • config.json - Orkhon architecture config
  • tokenizer.json, tokenizer_config.json, special_tokens_map.json
  • manifest.json - model-zoo metadata from the source repo
  • samples.txt - saved local samples when available
  • orkhon_model_card.md - original local model card

Limitations

This is a small research/education model family. The models can repeat, confabulate, and fail outside their narrow data scale. Public demos should keep the claim narrow: Orkhon is an auditable from-scratch stack with a Turkic branch, not a frontier assistant.

Downloads last month
18
Safetensors
Model size
57.4M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support