--- language: - ko library_name: pytorch tags: - sovyn - korean - conversational - causal-lm - from-scratch pipeline_tag: text-generation --- # SOVYN-300M-Cortex SOVYN-300M-Cortex is a small Korean conversational model trained from scratch. It is not a Transformers-compatible checkpoint yet; it uses the custom PyTorch architecture in `src/sovyn`. ## Current Status - Parameters: about 300M - Format: custom PyTorch checkpoint - Tokenizer: SentencePiece BPE - Context length: 512 tokens - Weight dtype in checkpoint: bfloat16 - Main checkpoint: `sovyn_300m_last.pt` This is an early experimental checkpoint. It can handle short Korean dialogue patterns, but it is not a broad knowledge model. ## Quick Start ```powershell python -m venv .venv .\.venv\Scripts\python.exe -m pip install torch sentencepiece pyyaml tqdm .\.venv\Scripts\python.exe scripts\chat.py --checkpoint sovyn_300m_last.pt --tokenizer sovyn.model ``` Example: ```text user: 나 오늘 피곤해 sovyn: 많이 지쳤겠다. 지금은 잠깐 쉬어도 괜찮아. ``` ## Ollama-Compatible Local API This repository includes an Ollama-compatible bridge, but the model is not a native GGUF Ollama model yet. ```powershell powershell -ExecutionPolicy Bypass -File scripts\start_ollama_bridge.ps1 ``` Then call: ```text POST http://127.0.0.1:11435/api/chat ``` ## Files - `sovyn_300m_last.pt`: model checkpoint - `sovyn.model`, `sovyn.vocab`: SentencePiece tokenizer - `config.yaml`: model and training config - `src/sovyn`: custom PyTorch architecture and formatting/data helpers - `scripts/chat.py`: local chat runner - `scripts/ollama_bridge.py`: Ollama-compatible local API bridge ## Notes SOVYN-300M-Cortex was trained for short, natural Korean replies. The next major step is converting the architecture to a standard export format or writing a GGUF converter so it can be registered as a native Ollama model.