--- title: MariChatmen emoji: 🟢 colorFrom: green colorTo: yellow sdk: static pinned: false --- # MariChatmen **MariChatmen** is an experimental LLM post-training project for building an always-Andalûh, Sevillian-leaning chat assistant. The project explores how to adapt open language models to answer in **Andalûh / Andalusian Spanish** using a staged pipeline: ```text Andalûh adaptive pretraining → SFT → ORPO → GRPO → evaluation → release ```` ## Project goals * Train models that answer in Andalûh, even when prompted in standard Spanish. * Build a fictional Sevillian persona: **MariChatmen / MariCarmen**. * Use `andaluh-py` for rule-based Andalûh transformation. * Evaluate accent, persona, usefulness, and semantic stability with custom metrics. * Release reproducible model, data, demo, and training artifacts. ## Repositories | Resource | Purpose | | --------------------------------------------------------------------- | ---------------------------------------------- | | [MariChatmen Space](https://huggingface.co/spaces/alobos/MariChatmen) | Interactive Gradio demo | | `MariChatmen-*-LoRA` | LoRA / QLoRA model adapters | | `MariChatmen-Andaluh-Data` | Dataset samples, benchmarks, and metadata | | `MariChatmen` | Training code, evaluation scripts, and reports | ## Status This is an active research-engineering project. Early runs validated the pipeline, but final model quality is still being improved through better data, persona tuning, ORPO preference pairs, and GRPO reward design. ## Blog Technical writeups and progress notes: [antoniolobo.com/blog](https://antoniolobo.com/blog)