Instructions to use zID4si/Zlatorog-12B-Instruct-Beta with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zID4si/Zlatorog-12B-Instruct-Beta with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="zID4si/Zlatorog-12B-Instruct-Beta") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("zID4si/Zlatorog-12B-Instruct-Beta") model = AutoModelForCausalLM.from_pretrained("zID4si/Zlatorog-12B-Instruct-Beta") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use zID4si/Zlatorog-12B-Instruct-Beta with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "zID4si/Zlatorog-12B-Instruct-Beta" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zID4si/Zlatorog-12B-Instruct-Beta", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/zID4si/Zlatorog-12B-Instruct-Beta
- SGLang
How to use zID4si/Zlatorog-12B-Instruct-Beta with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "zID4si/Zlatorog-12B-Instruct-Beta" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zID4si/Zlatorog-12B-Instruct-Beta", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "zID4si/Zlatorog-12B-Instruct-Beta" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zID4si/Zlatorog-12B-Instruct-Beta", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use zID4si/Zlatorog-12B-Instruct-Beta with Docker Model Runner:
docker model run hf.co/zID4si/Zlatorog-12B-Instruct-Beta
Zlatorog-12B-Instruct-Beta
This model is a fine-tuned version of zidsi/MistralNemoCPT6 on the custom mix of SFT datasets.
Model description
This model is based on Mistral Nemo 12B and has undergone multiple refinement stages, including Continual Pretraining (CPT) and Supervised Fine-Tuning (SFT) with an instruction-tuned objective. It is designed for high-quality instruction following, reasoning, and tool-augmented generation tasks. The model has been optimized for both language understanding and function-calling scenarios.
Intended uses & limitations
Research, explore and have fun with Slovenian LLM :)
Training and evaluation data
Bad standard Slovenian benchmarks results but sometimes impresssive "real world" prompt responses :)
Reduced hallucinations rate on "Who is ...?" prompts.
Up to 16k ctx should work OK, for longer contexts training data would be required to improve @ CPT Long stage.
🔧 Tool Usage
This model supports tool-augmented generation, allowing it to invoke external tools via structured function calls based on user prompts. The tool usage follows a special tokenized format, with tool availability declared at the beginning of the prompt and a corresponding tool call generated in the model's output.
📌 Example
Input (Prompt):
<s>[AVAILABLE_TOOLS][{"type": "function", "function": {"name": "poisci_novice", "description": "Orodje poišče današnje novice glede na iskalni niz - ključne besede v besedilu.", "parameters": {"type": "object", "properties": {"iskalni_niz": {"type": "string", "description": "Iskalni niz za iskanje novic."}}, "required": ["iskalni_niz"]}}}][/AVAILABLE_TOOLS][INST]Naredi seznam današnjih novic o velikih jezikovnih modelih.[/INST]
Model Output:
[TOOL_CALLS][{"name": "poisci_novice", "arguments": {"iskalni_niz": "veliki jezikovni modeli"}}]
🛠️ How It Works
- [AVAILABLE_TOOLS]: Declares tools the model can call, formatted as JSON.
- [INST]...[/INST]: Wraps the user instruction or prompt.
- [TOOL_CALLS]: The model responds with structured tool calls matching the declared API.
This format is suitable for integration into multi-tool agents or custom orchestration pipelines where the model determines and delegates external functions.
📤 Tool Call Schema
Tool definitions follow the OpenAI function calling specification, using a JSON Schema to define parameters and their constraints.
License
This model is released under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
- Attribution must be given to the original author.
- NonCommercial use only. Commercial use is not permitted under this license.
- NoDerivatives means you may not share adaptations of the model.
For commercial use, custom development, or to request extended rights, please send an email to info <at> zid.si
More details on the license: https://creativecommons.org/licenses/by-nc-nd/4.0/
💡 Support Future Research
The developer is currently seeking funding to continue research and development on this model and related projects. If you’re interested in supporting or collaborating, please get in touch.
💡 Podprite nadaljnji razvoj
Razvijalec trenutno išče finančno podporo za nadaljevanje raziskav in razvoja tega modela ter sorodnih projektov. Za podporo ali sodelovanje vas vabimo, da vzpostavite stik.
📄 Citation
If you use Zlatorog-12B-Instruct-Beta in your research or applications, please cite it as follows:
BibTeX
@misc{zlatorog12b2025,
title = {Zlatorog-12B-Instruct-Beta: A Slovenian Instruction-Tuned Large Language Model},
author = {Tomaž Savodnik, Zavod za informacijsko družbo (zID)},
year = {2025},
howpublished = {\url{https://huggingface.co/zID4si/Zlatorog-12B-Instruct-Beta}},
note = {Version Beta}
}
APA
Savodnik, T., Zavod za informacijsko družbo (zID). (2025). Zlatorog-12B-Instruct-Beta: A Slovenian Instruction-Tuned Large Language Model [Computer software]. Hugging Face. https://huggingface.co/zID4si/Zlatorog-12B-Instruct-Beta
- Downloads last month
- 5
Model tree for zID4si/Zlatorog-12B-Instruct-Beta
Base model
zidsi/MistralNemoCPT6