SOVYN-300M-Cortex
SOVYN-300M-Cortex is a small Korean conversational model trained from scratch.
It is not a Transformers-compatible checkpoint yet; it uses the custom PyTorch
architecture in src/sovyn.
Current Status
- Parameters: about 300M
- Format: custom PyTorch checkpoint
- Tokenizer: SentencePiece BPE
- Context length: 512 tokens
- Weight dtype in checkpoint: bfloat16
- Main checkpoint:
sovyn_300m_last.pt
This is an early experimental checkpoint. It can handle short Korean dialogue patterns, but it is not a broad knowledge model.
Quick Start
python -m venv .venv
.\.venv\Scripts\python.exe -m pip install torch sentencepiece pyyaml tqdm
.\.venv\Scripts\python.exe scripts\chat.py --checkpoint sovyn_300m_last.pt --tokenizer sovyn.model
Example:
user: ๋ ์ค๋ ํผ๊ณคํด
sovyn: ๋ง์ด ์ง์ณค๊ฒ ๋ค. ์ง๊ธ์ ์ ๊น ์ฌ์ด๋ ๊ด์ฐฎ์.
Ollama-Compatible Local API
This repository includes an Ollama-compatible bridge, but the model is not a native GGUF Ollama model yet.
powershell -ExecutionPolicy Bypass -File scripts\start_ollama_bridge.ps1
Then call:
POST http://127.0.0.1:11435/api/chat
Files
sovyn_300m_last.pt: model checkpointsovyn.model,sovyn.vocab: SentencePiece tokenizerconfig.yaml: model and training configsrc/sovyn: custom PyTorch architecture and formatting/data helpersscripts/chat.py: local chat runnerscripts/ollama_bridge.py: Ollama-compatible local API bridge
Notes
SOVYN-300M-Cortex was trained for short, natural Korean replies. The next major step is converting the architecture to a standard export format or writing a GGUF converter so it can be registered as a native Ollama model.
- Downloads last month
- 10