Sleeping
Agents
MariChatmen Demo
💬
Selected 4B MariChatmen LoRA demo.
None defined yet.
MariChatmen is an experimental LLM post-training project for building an always-Andalûh, Sevillian-leaning chat assistant.
The project explores how to adapt open language models to answer in Andalûh / Andalusian Spanish using a staged pipeline:
Andalûh adaptive pretraining → SFT → ORPO → GRPO → evaluation → release
andaluh-py for rule-based Andalûh transformation.| Resource | Purpose |
|---|---|
| MariChatmen Space | Interactive Gradio demo |
MariChatmen-*-LoRA |
LoRA / QLoRA model adapters |
MariChatmen-Andaluh-Data |
Dataset samples, benchmarks, and metadata |
MariChatmen |
Training code, evaluation scripts, and reports |
This is an active research-engineering project. Early runs validated the pipeline, but final model quality is still being improved through better data, persona tuning, ORPO preference pairs, and GRPO reward design.
Technical writeups and progress notes: antoniolobo.com/blog