Omegus / README.md
TOKETTER's picture
Document first Omegus adapter run
d259a1b verified
---
base_model: HuggingFaceTB/SmolLM2-135M-Instruct
library_name: peft
pipeline_tag: text-generation
language:
- es
tags:
- lora
- sft
- spanish
- agent
- technical-assistant
license: apache-2.0
---
# Omegus
Omegus is a Spanish technical chatbot model package for the Charlie / Omega agent architecture.
It is designed to respond as a precise technical assistant with progressive status reporting, software architecture judgment, and clear explanations of the Omega framework.
This repository currently contains a first local demo LoRA adapter. It is intentionally small and should be treated as a prototype checkpoint, not a production-quality assistant yet.
Source material:
- `../Charlie-Skill.md`
- `../spec_maestra_framework_unificado_v0.3.md`
The published adapter fine-tunes `HuggingFaceTB/SmolLM2-135M-Instruct` with LoRA using a compact chat dataset in `data/charlie_omega_sft.jsonl`.
## Training Run
First published adapter:
- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`
- Method: LoRA SFT
- Local hardware: Apple Silicon MPS
- Dataset size: 20 chat examples
- Epochs: 1
- Train loss: 3.502
- Eval loss: 3.588
Hugging Face Jobs training was attempted, but the account did not have enough prepaid credit balance at the time. The current adapter was trained locally instead.
## Intended Use
- Spanish technical chatbot
- Software architecture and code-review assistant behavior
- Omega framework explanation and synthesis
- Prototype agent-persona research
## Local Dry Run
From this folder:
```bash
uv run train_sft.py
```
This trains locally if your machine has the needed compute.
## Push To Hugging Face
The default Hub target is `TOKETTER/Omegus`. With a logged-in Hugging Face session:
```bash
export HUB_MODEL_ID="TOKETTER/Omegus"
uv run train_sft.py
```
The script pushes LoRA adapter/checkpoints to the Hub when `HUB_MODEL_ID` is set.
## Quick Load
```python
from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer
repo = "TOKETTER/Omegus"
tokenizer = AutoTokenizer.from_pretrained(repo)
model = AutoPeftModelForCausalLM.from_pretrained(repo)
```
## Recommended Cloud Job Shape
Default Hub target:
```bash
TOKETTER/Omegus
```
For a cheap demo on Hugging Face Jobs:
- Flavor: `t4-small` or similar low-cost GPU
- Timeout: `1h`
- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`
For a better small assistant:
- Flavor: `a10g-large`
- Timeout: `2h`
- Increase dataset size before training
## Next Dataset Upgrade
The included dataset is intentionally small so the training pipeline is easy to inspect. The next quality step is to expand it into 200-500 instruction examples extracted from the two source docs, with separate examples for:
- Charlie activation and progressive logging
- Code review and bug triage behavior
- Ω framework explanations
- Ω6 functional consciousness caveats
- Mathematical definitions and architecture summaries