Csermely (MLX)

MLX version of Csermely — a 190M parameter Hungarian language model optimized for Apple Silicon. Part of the Emese model family.

This is the native MLX bfloat16 checkpoint. For the HuggingFace transformers version, see emese-tech/csermely.

Model Details


Version	0.2
Parameters	190.2M
Architecture	LLaMA-style (decoder-only transformer)
Context length	4,096 tokens (YaRN RoPE, 4× factor)
Training context	1,024 tokens
Precision	bfloat16
Vocabulary	32,000 (SentencePiece Unigram, Hungarian)
Training data	~2B tokens of Hungarian text
Framework	MLX (Apple Silicon)
License	MIT

import mlx.core as mx
from model import Emese, ModelConfig

config = ModelConfig()
model = Emese(config)
model.load_weights("model.safetensors")

Safetensors

Model size

0.2B params

Tensor type

BF16

MLX

Hardware compatibility

Quantized