harmony-v1 / README.md
sollamon's picture
release
521d842 verified
# Floxoris Harmony v1
**Floxoris Harmony v1** is an improved lightweight moderation model for **Russian** and **Ukrainian** text. It continues the idea behind the earlier `Floxoris Harmony v0` release, but presents it as a more mature and refined iteration designed for practical production use where **speed, compact size, and stable binary moderation** matter.
The model is intended for scenarios such as Telegram bots, AI assistants, chat filters, user-generated content pipelines, and message pre-moderation systems. Its goal is simple: provide a fast first-pass toxicity check with minimal deployment overhead.
## What Is New In v1
Compared to the earlier `Floxoris Harmony v0` positioning, **Floxoris Harmony v1** is presented as a cleaner and more production-ready version of the same moderation concept:
- improved and more polished model release
- better suited for practical integration into moderation pipelines
- compact architecture for fast inference
- focused binary safety classification for real-time systems
- clearer output labels: `safe` and `toxic`
Rather than expanding into a large multi-head moderation system, this version keeps the original lightweight philosophy while aiming for a more reliable and streamlined production experience.
## Features
- Binary toxic content classification
- Supports **Russian** and **Ukrainian**
- Small model footprint and fast inference
- Suitable for real-time moderation
- Easy to integrate into lightweight services
- Good fit for bots, assistants, and chat filtering
## Model Details
- **Task:** Binary text classification
- **Architecture:** `BertForSequenceClassification`
- **Model family:** BERT
- **Languages:** Russian, Ukrainian
- **Classes:** `safe`, `toxic`
- **Max sequence length:** 512 tokens
- **Hidden size:** 128
- **Layers:** 2
- **Attention heads:** 2
## Labels
The model returns one of two classes:
- `0` = `safe`
- `1` = `toxic`
## Intended Use
**Floxoris Harmony v1** is intended as a lightweight moderation layer for:
- Telegram bots
- AI assistants
- chat and community platforms
- message pre-filtering
- user input validation before LLM processing
- low-cost production moderation pipelines
Typical usage patterns include:
- filtering incoming messages before sending them to a model or operator
- flagging potentially toxic content for review
- reducing moderation load in high-volume environments
- adding a first-pass safety layer to conversational systems
## Example Usage
```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
model_name = "floxoris/floxoris-harmony-v1"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)
id2label = {
0: "safe",
1: "toxic",
}
texts = [
"привіт, як справи?",
"ти ідіот?",
]
inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt")
with torch.no_grad():
logits = model(**inputs).logits
probs = torch.softmax(logits, dim=-1)
preds = torch.argmax(probs, dim=-1)
for text, pred, prob in zip(texts, preds, probs):
label = id2label[pred.item()]
confidence = prob[pred.item()].item()
print(f"{text} -> {label} ({confidence:.4f})")
```
## Example Output
```text
"привіт, як справи?"
-> safe
"ти ідіот?"
-> toxic
```
These examples are illustrative only and should not be treated as a benchmark.
## Limitations
- This is a **binary** moderation model and does not classify toxicity categories
- It may miss subtle harassment, sarcasm, or context-dependent abuse
- It may produce false positives on slang, irony, or emotionally charged language
- Performance can degrade on mixed-language input, heavy misspellings, or niche jargon
- It should be used as a lightweight moderation layer, not as a complete safety system
- Human review is still recommended for high-stakes moderation decisions
## Why This Model
**Floxoris Harmony v1** is built for teams that need:
- simple deployment
- quick inference
- low infrastructure cost
- moderation for Russian and Ukrainian text
- a practical first-stage safety classifier
If you need a compact moderation component instead of a large and expensive multilingual safety stack, this model is designed for that role.
## Summary
**Floxoris Harmony v1** is a new and improved generation of the lightweight moderation approach introduced by `Zire Guard v0`. It keeps the same practical focus on compact binary toxicity detection for Russian and Ukrainian, while presenting a cleaner, more refined, and more production-oriented release for modern moderation pipelines.