Instructions to use AksaraLLM/Kiel-Mini-59M-DPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use AksaraLLM/Kiel-Mini-59M-DPO with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="AksaraLLM/Kiel-Mini-59M-DPO")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("AksaraLLM/Kiel-Mini-59M-DPO", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use AksaraLLM/Kiel-Mini-59M-DPO with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "AksaraLLM/Kiel-Mini-59M-DPO" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AksaraLLM/Kiel-Mini-59M-DPO", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/AksaraLLM/Kiel-Mini-59M-DPO
- SGLang
How to use AksaraLLM/Kiel-Mini-59M-DPO with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "AksaraLLM/Kiel-Mini-59M-DPO" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AksaraLLM/Kiel-Mini-59M-DPO", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "AksaraLLM/Kiel-Mini-59M-DPO" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AksaraLLM/Kiel-Mini-59M-DPO", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use AksaraLLM/Kiel-Mini-59M-DPO with Docker Model Runner:
docker model run hf.co/AksaraLLM/Kiel-Mini-59M-DPO
Kiel-Mini-59M-DPO
โ ๏ธ Status: early experiment. This 85M-parameter decoder-only transformer was trained from scratch as part of the early AksaraLLM line. It uses the GPT-2 BPE tokenizer (50257 vocab) which is not optimal for Indonesian, and the training corpus was limited. By standard perplexity it is not a usable Indonesian language model today.
Architecture
| Property | Value |
|---|---|
| Parameters | 85.0M |
| Layers | 8 |
| Heads | 8 |
| Hidden size | 512 |
| FFN size | 2048 |
| Vocabulary | 50257 (GPT-2 BPE) |
| Context length | 128 |
| RMSNorm + RoPE + SwiGLU | yes |
Measured baseline (Devin audit, CPU eval)
- Perplexity (50 ID sentences, GPT-2 tokenizer): 56525 (very high โ model not converged)
- English-stopword ratio in ID-prompted output: 0.6%
- Indonesian-stopword ratio in ID-prompted output: 0.0%
For comparison, the working Indonesian models in this org reach perplexity โ 8โ15 on the same 50-sentence eval set.
Sample for "Indonesia adalah negara":
Indonesia adalah negara coal covetedutterstock Citizensindependencealky mac motive <!-- Megan port Ruff togetDefinitionagamemarkets scars Contribut sort finances SharmaJoe [' quarterbacks698 admiredar
Why the previous "Skor 10/11 Grade S" is misleading
That figure is from a custom 11-question in-house scorecard, not from a standard LM evaluation. Perplexity on plain Indonesian text reveals that this checkpoint cannot model the distribution.
Limitations
- Wrong tokenizer for the language: GPT-2 BPE is optimised for English.
- Severely under-trained at this size + corpus.
- No chat template in tokenizer config; treat as a base LM only.
What to use instead
AksaraLLM/Kiel-Pro-0.5B-v3โ 494M Qwen2-based, PPL โ 15.AksaraLLM/AksaraLLM-Qwen-1.5B-v5-publicโ 1.78B Qwen2-based, PPL โ 8.4.
License
Apache 2.0