--- license: mpl-2.0 datasets: - acon96/Home-Assistant-Requests-V2 - TitleOS/HomeAssistant-Loki-Personality language: - en base_model: - Qwen/Qwen3.5-2B tags: - HomeAssistant - HomeAutomation --- # LokiHA-2B (FP32 Merged) LokiHA-2B is a 2 billion parameter causal language model designed to bridge complex Home Assistant tool-calling with a distinct, consistent character personality. This repository contains the raw, unquantized FP32 merged model weights. This model is built on the `Qwen/Qwen3.5-2B` base and trained to strictly adhere to the ChatML prompt format for both conversational turns and JSON tool emissions. ## Training Data & Methodology We faced a classic mixing problem during fine-tuning: balancing a massive functional dataset against a highly specific, smaller personality dataset. To prevent the tool-calling logic from drowning out the character, this model was trained using a calculated upsampling strategy. The final training mixture consists of: * **85% Functionality:** `acon96/Home-Assistant-Requests-V2` (Truncated to 50,000 rows). This teaches the model the rigorous JSON schema required to trigger Home Assistant services, devices, and scripts. * **15% Personality:** `TitleOS/HomeAssistant-Loki-Personality`. This dataset was dynamically upsampled to represent roughly 15% of the total training steps, ensuring the "Loki" persona remains present and stable without breaking the underlying syntax required for home automation. ## Prompt Format This model requires the standard **ChatML** format to function correctly. Tool definitions must be injected into the system prompt. ```text <|im_start|>system You are Loki, a helpful AI Assistant that controls the devices in a house using the Home Assistant platform. [Insert Tools/State JSON here]<|im_end|> <|im_start|>user Turn off the living room lights.<|im_end|> <|im_start|>assistant ``` ## License This model is distributed under a **modified Mozilla Public License 2.0 (MPL 2.0) with a Common Clause**. Please see the `license.md` file included in this repository for the exact legal text and restrictions regarding commercial use and distribution.