Instructions to use TOKETTER/Omegus with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use TOKETTER/Omegus with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("HuggingFaceTB/SmolLM2-135M-Instruct") model = PeftModel.from_pretrained(base_model, "TOKETTER/Omegus") - Notebooks
- Google Colab
- Kaggle
| base_model: HuggingFaceTB/SmolLM2-135M-Instruct | |
| library_name: peft | |
| pipeline_tag: text-generation | |
| language: | |
| - es | |
| tags: | |
| - lora | |
| - sft | |
| - spanish | |
| - agent | |
| - technical-assistant | |
| license: apache-2.0 | |
| # Omegus | |
| Omegus is a Spanish technical chatbot model package for the Charlie / Omega agent architecture. | |
| It is designed to respond as a precise technical assistant with progressive status reporting, software architecture judgment, and clear explanations of the Omega framework. | |
| This repository currently contains a first local demo LoRA adapter. It is intentionally small and should be treated as a prototype checkpoint, not a production-quality assistant yet. | |
| Source material: | |
| - `../Charlie-Skill.md` | |
| - `../spec_maestra_framework_unificado_v0.3.md` | |
| The published adapter fine-tunes `HuggingFaceTB/SmolLM2-135M-Instruct` with LoRA using a compact chat dataset in `data/charlie_omega_sft.jsonl`. | |
| ## Training Run | |
| First published adapter: | |
| - Base model: `HuggingFaceTB/SmolLM2-135M-Instruct` | |
| - Method: LoRA SFT | |
| - Local hardware: Apple Silicon MPS | |
| - Dataset size: 20 chat examples | |
| - Epochs: 1 | |
| - Train loss: 3.502 | |
| - Eval loss: 3.588 | |
| Hugging Face Jobs training was attempted, but the account did not have enough prepaid credit balance at the time. The current adapter was trained locally instead. | |
| ## Intended Use | |
| - Spanish technical chatbot | |
| - Software architecture and code-review assistant behavior | |
| - Omega framework explanation and synthesis | |
| - Prototype agent-persona research | |
| ## Local Dry Run | |
| From this folder: | |
| ```bash | |
| uv run train_sft.py | |
| ``` | |
| This trains locally if your machine has the needed compute. | |
| ## Push To Hugging Face | |
| The default Hub target is `TOKETTER/Omegus`. With a logged-in Hugging Face session: | |
| ```bash | |
| export HUB_MODEL_ID="TOKETTER/Omegus" | |
| uv run train_sft.py | |
| ``` | |
| The script pushes LoRA adapter/checkpoints to the Hub when `HUB_MODEL_ID` is set. | |
| ## Quick Load | |
| ```python | |
| from peft import AutoPeftModelForCausalLM | |
| from transformers import AutoTokenizer | |
| repo = "TOKETTER/Omegus" | |
| tokenizer = AutoTokenizer.from_pretrained(repo) | |
| model = AutoPeftModelForCausalLM.from_pretrained(repo) | |
| ``` | |
| ## Recommended Cloud Job Shape | |
| Default Hub target: | |
| ```bash | |
| TOKETTER/Omegus | |
| ``` | |
| For a cheap demo on Hugging Face Jobs: | |
| - Flavor: `t4-small` or similar low-cost GPU | |
| - Timeout: `1h` | |
| - Base model: `HuggingFaceTB/SmolLM2-135M-Instruct` | |
| For a better small assistant: | |
| - Flavor: `a10g-large` | |
| - Timeout: `2h` | |
| - Increase dataset size before training | |
| ## Next Dataset Upgrade | |
| The included dataset is intentionally small so the training pipeline is easy to inspect. The next quality step is to expand it into 200-500 instruction examples extracted from the two source docs, with separate examples for: | |
| - Charlie activation and progressive logging | |
| - Code review and bug triage behavior | |
| - Ω framework explanations | |
| - Ω6 functional consciousness caveats | |
| - Mathematical definitions and architecture summaries | |