# Floxoris Harmony v1 **Floxoris Harmony v1** is an improved lightweight moderation model for **Russian** and **Ukrainian** text. It continues the idea behind the earlier `Floxoris Harmony v0` release, but presents it as a more mature and refined iteration designed for practical production use where **speed, compact size, and stable binary moderation** matter. The model is intended for scenarios such as Telegram bots, AI assistants, chat filters, user-generated content pipelines, and message pre-moderation systems. Its goal is simple: provide a fast first-pass toxicity check with minimal deployment overhead. ## What Is New In v1 Compared to the earlier `Floxoris Harmony v0` positioning, **Floxoris Harmony v1** is presented as a cleaner and more production-ready version of the same moderation concept: - improved and more polished model release - better suited for practical integration into moderation pipelines - compact architecture for fast inference - focused binary safety classification for real-time systems - clearer output labels: `safe` and `toxic` Rather than expanding into a large multi-head moderation system, this version keeps the original lightweight philosophy while aiming for a more reliable and streamlined production experience. ## Features - Binary toxic content classification - Supports **Russian** and **Ukrainian** - Small model footprint and fast inference - Suitable for real-time moderation - Easy to integrate into lightweight services - Good fit for bots, assistants, and chat filtering ## Model Details - **Task:** Binary text classification - **Architecture:** `BertForSequenceClassification` - **Model family:** BERT - **Languages:** Russian, Ukrainian - **Classes:** `safe`, `toxic` - **Max sequence length:** 512 tokens - **Hidden size:** 128 - **Layers:** 2 - **Attention heads:** 2 ## Labels The model returns one of two classes: - `0` = `safe` - `1` = `toxic` ## Intended Use **Floxoris Harmony v1** is intended as a lightweight moderation layer for: - Telegram bots - AI assistants - chat and community platforms - message pre-filtering - user input validation before LLM processing - low-cost production moderation pipelines Typical usage patterns include: - filtering incoming messages before sending them to a model or operator - flagging potentially toxic content for review - reducing moderation load in high-volume environments - adding a first-pass safety layer to conversational systems ## Example Usage ```python from transformers import AutoTokenizer, AutoModelForSequenceClassification import torch model_name = "floxoris/floxoris-harmony-v1" tokenizer = AutoTokenizer.from_pretrained(model_name) model = AutoModelForSequenceClassification.from_pretrained(model_name) id2label = { 0: "safe", 1: "toxic", } texts = [ "привіт, як справи?", "ти ідіот?", ] inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt") with torch.no_grad(): logits = model(**inputs).logits probs = torch.softmax(logits, dim=-1) preds = torch.argmax(probs, dim=-1) for text, pred, prob in zip(texts, preds, probs): label = id2label[pred.item()] confidence = prob[pred.item()].item() print(f"{text} -> {label} ({confidence:.4f})") ``` ## Example Output ```text "привіт, як справи?" -> safe "ти ідіот?" -> toxic ``` These examples are illustrative only and should not be treated as a benchmark. ## Limitations - This is a **binary** moderation model and does not classify toxicity categories - It may miss subtle harassment, sarcasm, or context-dependent abuse - It may produce false positives on slang, irony, or emotionally charged language - Performance can degrade on mixed-language input, heavy misspellings, or niche jargon - It should be used as a lightweight moderation layer, not as a complete safety system - Human review is still recommended for high-stakes moderation decisions ## Why This Model **Floxoris Harmony v1** is built for teams that need: - simple deployment - quick inference - low infrastructure cost - moderation for Russian and Ukrainian text - a practical first-stage safety classifier If you need a compact moderation component instead of a large and expensive multilingual safety stack, this model is designed for that role. ## Summary **Floxoris Harmony v1** is a new and improved generation of the lightweight moderation approach introduced by `Zire Guard v0`. It keeps the same practical focus on compact binary toxicity detection for Russian and Ukrainian, while presenting a cleaner, more refined, and more production-oriented release for modern moderation pipelines.