| # Floxoris Harmony v1 |
|
|
| **Floxoris Harmony v1** is an improved lightweight moderation model for **Russian** and **Ukrainian** text. It continues the idea behind the earlier `Floxoris Harmony v0` release, but presents it as a more mature and refined iteration designed for practical production use where **speed, compact size, and stable binary moderation** matter. |
|
|
| The model is intended for scenarios such as Telegram bots, AI assistants, chat filters, user-generated content pipelines, and message pre-moderation systems. Its goal is simple: provide a fast first-pass toxicity check with minimal deployment overhead. |
|
|
| ## What Is New In v1 |
|
|
| Compared to the earlier `Floxoris Harmony v0` positioning, **Floxoris Harmony v1** is presented as a cleaner and more production-ready version of the same moderation concept: |
|
|
| - improved and more polished model release |
| - better suited for practical integration into moderation pipelines |
| - compact architecture for fast inference |
| - focused binary safety classification for real-time systems |
| - clearer output labels: `safe` and `toxic` |
|
|
| Rather than expanding into a large multi-head moderation system, this version keeps the original lightweight philosophy while aiming for a more reliable and streamlined production experience. |
|
|
| ## Features |
|
|
| - Binary toxic content classification |
| - Supports **Russian** and **Ukrainian** |
| - Small model footprint and fast inference |
| - Suitable for real-time moderation |
| - Easy to integrate into lightweight services |
| - Good fit for bots, assistants, and chat filtering |
|
|
| ## Model Details |
|
|
| - **Task:** Binary text classification |
| - **Architecture:** `BertForSequenceClassification` |
| - **Model family:** BERT |
| - **Languages:** Russian, Ukrainian |
| - **Classes:** `safe`, `toxic` |
| - **Max sequence length:** 512 tokens |
| - **Hidden size:** 128 |
| - **Layers:** 2 |
| - **Attention heads:** 2 |
|
|
| ## Labels |
|
|
| The model returns one of two classes: |
|
|
| - `0` = `safe` |
| - `1` = `toxic` |
|
|
| ## Intended Use |
|
|
| **Floxoris Harmony v1** is intended as a lightweight moderation layer for: |
|
|
| - Telegram bots |
| - AI assistants |
| - chat and community platforms |
| - message pre-filtering |
| - user input validation before LLM processing |
| - low-cost production moderation pipelines |
|
|
| Typical usage patterns include: |
|
|
| - filtering incoming messages before sending them to a model or operator |
| - flagging potentially toxic content for review |
| - reducing moderation load in high-volume environments |
| - adding a first-pass safety layer to conversational systems |
|
|
| ## Example Usage |
|
|
| ```python |
| from transformers import AutoTokenizer, AutoModelForSequenceClassification |
| import torch |
| |
| model_name = "floxoris/floxoris-harmony-v1" |
| |
| tokenizer = AutoTokenizer.from_pretrained(model_name) |
| model = AutoModelForSequenceClassification.from_pretrained(model_name) |
| |
| id2label = { |
| 0: "safe", |
| 1: "toxic", |
| } |
| |
| texts = [ |
| "привіт, як справи?", |
| "ти ідіот?", |
| ] |
| |
| inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt") |
| |
| with torch.no_grad(): |
| logits = model(**inputs).logits |
| probs = torch.softmax(logits, dim=-1) |
| preds = torch.argmax(probs, dim=-1) |
| |
| for text, pred, prob in zip(texts, preds, probs): |
| label = id2label[pred.item()] |
| confidence = prob[pred.item()].item() |
| print(f"{text} -> {label} ({confidence:.4f})") |
| ``` |
|
|
| ## Example Output |
|
|
| ```text |
| "привіт, як справи?" |
| -> safe |
| |
| "ти ідіот?" |
| -> toxic |
| ``` |
|
|
| These examples are illustrative only and should not be treated as a benchmark. |
|
|
| ## Limitations |
|
|
| - This is a **binary** moderation model and does not classify toxicity categories |
| - It may miss subtle harassment, sarcasm, or context-dependent abuse |
| - It may produce false positives on slang, irony, or emotionally charged language |
| - Performance can degrade on mixed-language input, heavy misspellings, or niche jargon |
| - It should be used as a lightweight moderation layer, not as a complete safety system |
| - Human review is still recommended for high-stakes moderation decisions |
|
|
| ## Why This Model |
|
|
| **Floxoris Harmony v1** is built for teams that need: |
|
|
| - simple deployment |
| - quick inference |
| - low infrastructure cost |
| - moderation for Russian and Ukrainian text |
| - a practical first-stage safety classifier |
|
|
| If you need a compact moderation component instead of a large and expensive multilingual safety stack, this model is designed for that role. |
|
|
| ## Summary |
|
|
| **Floxoris Harmony v1** is a new and improved generation of the lightweight moderation approach introduced by `Zire Guard v0`. It keeps the same practical focus on compact binary toxicity detection for Russian and Ukrainian, while presenting a cleaner, more refined, and more production-oriented release for modern moderation pipelines. |
|
|