File size: 2,934 Bytes
be34e96
202db76
be34e96
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d259a1b
 
be34e96
 
 
 
 
202db76
be34e96
d259a1b
 
 
 
 
 
 
 
 
 
 
 
 
 
be34e96
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d259a1b
 
 
 
 
 
 
 
 
 
 
be34e96
 
 
 
 
 
 
 
 
 
 
 
202db76
be34e96
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
---
base_model: HuggingFaceTB/SmolLM2-135M-Instruct
library_name: peft
pipeline_tag: text-generation
language:
  - es
tags:
  - lora
  - sft
  - spanish
  - agent
  - technical-assistant
license: apache-2.0
---

# Omegus

Omegus is a Spanish technical chatbot model package for the Charlie / Omega agent architecture.

It is designed to respond as a precise technical assistant with progressive status reporting, software architecture judgment, and clear explanations of the Omega framework.

This repository currently contains a first local demo LoRA adapter. It is intentionally small and should be treated as a prototype checkpoint, not a production-quality assistant yet.

Source material:

- `../Charlie-Skill.md`
- `../spec_maestra_framework_unificado_v0.3.md`

The published adapter fine-tunes `HuggingFaceTB/SmolLM2-135M-Instruct` with LoRA using a compact chat dataset in `data/charlie_omega_sft.jsonl`.

## Training Run

First published adapter:

- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`
- Method: LoRA SFT
- Local hardware: Apple Silicon MPS
- Dataset size: 20 chat examples
- Epochs: 1
- Train loss: 3.502
- Eval loss: 3.588

Hugging Face Jobs training was attempted, but the account did not have enough prepaid credit balance at the time. The current adapter was trained locally instead.

## Intended Use

- Spanish technical chatbot
- Software architecture and code-review assistant behavior
- Omega framework explanation and synthesis
- Prototype agent-persona research

## Local Dry Run

From this folder:

```bash
uv run train_sft.py
```

This trains locally if your machine has the needed compute.

## Push To Hugging Face

The default Hub target is `TOKETTER/Omegus`. With a logged-in Hugging Face session:

```bash
export HUB_MODEL_ID="TOKETTER/Omegus"
uv run train_sft.py
```

The script pushes LoRA adapter/checkpoints to the Hub when `HUB_MODEL_ID` is set.

## Quick Load

```python
from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer

repo = "TOKETTER/Omegus"
tokenizer = AutoTokenizer.from_pretrained(repo)
model = AutoPeftModelForCausalLM.from_pretrained(repo)
```

## Recommended Cloud Job Shape

Default Hub target:

```bash
TOKETTER/Omegus
```

For a cheap demo on Hugging Face Jobs:

- Flavor: `t4-small` or similar low-cost GPU
- Timeout: `1h`
- Base model: `HuggingFaceTB/SmolLM2-135M-Instruct`

For a better small assistant:

- Flavor: `a10g-large`
- Timeout: `2h`
- Increase dataset size before training

## Next Dataset Upgrade

The included dataset is intentionally small so the training pipeline is easy to inspect. The next quality step is to expand it into 200-500 instruction examples extracted from the two source docs, with separate examples for:

- Charlie activation and progressive logging
- Code review and bug triage behavior
- Ω framework explanations
- Ω6 functional consciousness caveats
- Mathematical definitions and architecture summaries