TOKETTER
/

Omegus

Text Generation

technical-assistant

Model card Files Files and versions

Omegus / README.md

TOKETTER's picture

Document first Omegus adapter run

d259a1b verified 26 days ago

|

history blame contribute delete

2.93 kB

	---
	base_model: HuggingFaceTB/SmolLM2-135M-Instruct
	library_name: peft
	pipeline_tag: text-generation
	language:
	- es
	tags:
	- lora
	- sft
	- spanish
	- agent
	- technical-assistant
	license: apache-2.0
	---

	# Omegus

	Omegus is a Spanish technical chatbot model package for the Charlie / Omega agent architecture.

	It is designed to respond as a precise technical assistant with progressive status reporting, software architecture judgment, and clear explanations of the Omega framework.

	This repository currently contains a first local demo LoRA adapter. It is intentionally small and should be treated as a prototype checkpoint, not a production-quality assistant yet.

	Source material:

	- `../Charlie-Skill.md`
	- `../spec_maestra_framework_unificado_v0.3.md`

	The published adapter fine-tunes `HuggingFaceTB/SmolLM2-135M-Instruct` with LoRA using a compact chat dataset in `data/charlie_omega_sft.jsonl`.

	## Training Run

	First published adapter:

	- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`
	- Method: LoRA SFT
	- Local hardware: Apple Silicon MPS
	- Dataset size: 20 chat examples
	- Epochs: 1
	- Train loss: 3.502
	- Eval loss: 3.588

	Hugging Face Jobs training was attempted, but the account did not have enough prepaid credit balance at the time. The current adapter was trained locally instead.

	## Intended Use

	- Spanish technical chatbot
	- Software architecture and code-review assistant behavior
	- Omega framework explanation and synthesis
	- Prototype agent-persona research

	## Local Dry Run

	From this folder:

	```bash
	uv run train_sft.py
	```

	This trains locally if your machine has the needed compute.

	## Push To Hugging Face

	The default Hub target is `TOKETTER/Omegus`. With a logged-in Hugging Face session:

	```bash
	export HUB_MODEL_ID="TOKETTER/Omegus"
	uv run train_sft.py
	```

	The script pushes LoRA adapter/checkpoints to the Hub when `HUB_MODEL_ID` is set.

	## Quick Load

	```python
	from peft import AutoPeftModelForCausalLM
	from transformers import AutoTokenizer

	repo = "TOKETTER/Omegus"
	tokenizer = AutoTokenizer.from_pretrained(repo)
	model = AutoPeftModelForCausalLM.from_pretrained(repo)
	```

	## Recommended Cloud Job Shape

	Default Hub target:

	```bash
	TOKETTER/Omegus
	```

	For a cheap demo on Hugging Face Jobs:

	- Flavor: `t4-small` or similar low-cost GPU
	- Timeout: `1h`
	- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`

	For a better small assistant:

	- Flavor: `a10g-large`
	- Timeout: `2h`
	- Increase dataset size before training

	## Next Dataset Upgrade

	The included dataset is intentionally small so the training pipeline is easy to inspect. The next quality step is to expand it into 200-500 instruction examples extracted from the two source docs, with separate examples for:

	- Charlie activation and progressive logging
	- Code review and bug triage behavior
	- Ω framework explanations
	- Ω6 functional consciousness caveats
	- Mathematical definitions and architecture summaries