How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="mags0ft/SmolLM2-360m-German-Instruct",
	filename="",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

SmolLM2-360m-German-Instruct

Showcase image for SmolLM2-360m-German-Instruct

This is a continued pre-train as well as an instruct fine-tune done using Unsloth in order to make SmolLM2 360m capable of speaking German. It has been trained on 25% of the German Wikipedia as well as the full German version of the Alpaca-GPT4 dataset (translated version).

Even though a lot of training has been done, this is still a tiny model and is highly limited to its small size. Expect many hallucinations and do not use this in a demanding production workflow.

Links

Cite as

@misc{smollm2germaninstruct,
  author       = {Magnus Leonard Schlinsog},
  title        = {Enhancing Foreign Language Proficiency in SmolLM2-360M via Continued Pretraining and Instruction Fine-Tuning},
  year         = {2025},
  url          = {https://huggingface.co/mags0ft/SmolLM2-360m-German-Instruct},
}
Downloads last month
165
Safetensors
Model size
0.4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mags0ft/SmolLM2-360m-German-Instruct

Quantized
(32)
this model

Datasets used to train mags0ft/SmolLM2-360m-German-Instruct