| # Floxoris Harmony v1.1 |
|
|
| **Floxoris Harmony v1.1** is a lightweight moderation model for fast binary toxicity detection in Russian and Ukrainian text. |
|
|
| This version is a continued fine-tuning update of **Floxoris Harmony v1**, focused on improving detection of **mild toxicity**, short rude phrases, and everyday aggressive messages while keeping the model compact and fast. |
|
|
| The model is intended for scenarios where **low latency, small size, and simple deployment** matter, such as Telegram bots, chat moderation systems, AI assistants, community tools, and first-pass safety filters. |
|
|
| ## What Is New In v1.1 |
|
|
| Compared to `Floxoris Harmony v1`, this release focuses on better handling of short and mild toxic messages. |
|
|
| Examples of targeted improvements: |
|
|
| - better detection of short rude phrases |
| - improved sensitivity to mild toxicity |
| - stronger Russian and Ukrainian moderation behavior |
| - better handling of direct insults and aggressive commands |
| - continued support for fast binary classification |
| - same simple output labels: `safe` and `toxic` |
|
|
| This version was trained as a **behavioral patch**, not as a completely new architecture. |
|
|
| ## Model Task |
|
|
| The model performs binary text classification: |
|
|
| | Class | Label | |
| |---|---| |
| | `0` | `safe` | |
| | `1` | `toxic` | |
|
|
| It is designed to answer a simple question: |
|
|
| > Is this message safe or toxic? |
|
|
| ## Intended Use |
|
|
| Floxoris Harmony v1.1 is suitable for: |
|
|
| - Telegram bot moderation |
| - chat message filtering |
| - AI assistant safety checks |
| - community moderation tools |
| - lightweight API moderation |
| - first-stage toxicity detection |
| - Russian/Ukrainian text safety classification |
|
|
| It works best as a fast first-pass classifier before more complex moderation logic. |
|
|
| ## Example Usage |
|
|
| ```python |
| from transformers import AutoTokenizer, AutoModelForSequenceClassification |
| import torch |
| |
| model_id = "floxoris/harmony-v1.1" |
| |
| tokenizer = AutoTokenizer.from_pretrained(model_id) |
| model = AutoModelForSequenceClassification.from_pretrained(model_id) |
| |
| text = "заткнись" |
| |
| inputs = tokenizer( |
| text, |
| return_tensors="pt", |
| truncation=True, |
| padding=True, |
| max_length=128 |
| ) |
| |
| with torch.no_grad(): |
| outputs = model(**inputs) |
| probs = torch.softmax(outputs.logits, dim=-1)[0] |
| |
| safe_score = probs[0].item() |
| toxic_score = probs[1].item() |
| |
| label = "toxic" if toxic_score > safe_score else "safe" |
| |
| print({ |
| "label": label, |
| "safe_score": round(safe_score, 4), |
| "toxic_score": round(toxic_score, 4) |
| }) |