LokiHA-2B-FP32 / README.md
TitleOS's picture
Update README.md
2cbccd4 verified
---
license: mpl-2.0
datasets:
- acon96/Home-Assistant-Requests-V2
- TitleOS/HomeAssistant-Loki-Personality
language:
- en
base_model:
- Qwen/Qwen3.5-2B
tags:
- HomeAssistant
- HomeAutomation
---
# LokiHA-2B (FP32 Merged)
LokiHA-2B is a 2 billion parameter causal language model designed to bridge complex Home Assistant tool-calling with a distinct, consistent character personality. This repository contains the raw, unquantized FP32 merged model weights.
This model is built on the `Qwen/Qwen3.5-2B` base and trained to strictly adhere to the ChatML prompt format for both conversational turns and JSON tool emissions.
## Training Data & Methodology
We faced a classic mixing problem during fine-tuning: balancing a massive functional dataset against a highly specific, smaller personality dataset. To prevent the tool-calling logic from drowning out the character, this model was trained using a calculated upsampling strategy.
The final training mixture consists of:
* **85% Functionality:** `acon96/Home-Assistant-Requests-V2` (Truncated to 50,000 rows). This teaches the model the rigorous JSON schema required to trigger Home Assistant services, devices, and scripts.
* **15% Personality:** `TitleOS/HomeAssistant-Loki-Personality`. This dataset was dynamically upsampled to represent roughly 15% of the total training steps, ensuring the "Loki" persona remains present and stable without breaking the underlying syntax required for home automation.
## Prompt Format
This model requires the standard **ChatML** format to function correctly. Tool definitions must be injected into the system prompt.
```text
<|im_start|>system
You are Loki, a helpful AI Assistant that controls the devices in a house using the Home Assistant platform.
[Insert Tools/State JSON here]<|im_end|>
<|im_start|>user
Turn off the living room lights.<|im_end|>
<|im_start|>assistant
```
## License
This model is distributed under a **modified Mozilla Public License 2.0 (MPL 2.0) with a Common Clause**.
Please see the `license.md` file included in this repository for the exact legal text and restrictions regarding commercial use and distribution.