harmony-v1 / README.md
sollamon's picture
release
521d842 verified

Floxoris Harmony v1

Floxoris Harmony v1 is an improved lightweight moderation model for Russian and Ukrainian text. It continues the idea behind the earlier Floxoris Harmony v0 release, but presents it as a more mature and refined iteration designed for practical production use where speed, compact size, and stable binary moderation matter.

The model is intended for scenarios such as Telegram bots, AI assistants, chat filters, user-generated content pipelines, and message pre-moderation systems. Its goal is simple: provide a fast first-pass toxicity check with minimal deployment overhead.

What Is New In v1

Compared to the earlier Floxoris Harmony v0 positioning, Floxoris Harmony v1 is presented as a cleaner and more production-ready version of the same moderation concept:

  • improved and more polished model release
  • better suited for practical integration into moderation pipelines
  • compact architecture for fast inference
  • focused binary safety classification for real-time systems
  • clearer output labels: safe and toxic

Rather than expanding into a large multi-head moderation system, this version keeps the original lightweight philosophy while aiming for a more reliable and streamlined production experience.

Features

  • Binary toxic content classification
  • Supports Russian and Ukrainian
  • Small model footprint and fast inference
  • Suitable for real-time moderation
  • Easy to integrate into lightweight services
  • Good fit for bots, assistants, and chat filtering

Model Details

  • Task: Binary text classification
  • Architecture: BertForSequenceClassification
  • Model family: BERT
  • Languages: Russian, Ukrainian
  • Classes: safe, toxic
  • Max sequence length: 512 tokens
  • Hidden size: 128
  • Layers: 2
  • Attention heads: 2

Labels

The model returns one of two classes:

  • 0 = safe
  • 1 = toxic

Intended Use

Floxoris Harmony v1 is intended as a lightweight moderation layer for:

  • Telegram bots
  • AI assistants
  • chat and community platforms
  • message pre-filtering
  • user input validation before LLM processing
  • low-cost production moderation pipelines

Typical usage patterns include:

  • filtering incoming messages before sending them to a model or operator
  • flagging potentially toxic content for review
  • reducing moderation load in high-volume environments
  • adding a first-pass safety layer to conversational systems

Example Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

model_name = "floxoris/floxoris-harmony-v1"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

id2label = {
    0: "safe",
    1: "toxic",
}

texts = [
    "привіт, як справи?",
    "ти ідіот?",
]

inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits
    probs = torch.softmax(logits, dim=-1)
    preds = torch.argmax(probs, dim=-1)

for text, pred, prob in zip(texts, preds, probs):
    label = id2label[pred.item()]
    confidence = prob[pred.item()].item()
    print(f"{text} -> {label} ({confidence:.4f})")

Example Output

"привіт, як справи?"
-> safe

"ти ідіот?"
-> toxic

These examples are illustrative only and should not be treated as a benchmark.

Limitations

  • This is a binary moderation model and does not classify toxicity categories
  • It may miss subtle harassment, sarcasm, or context-dependent abuse
  • It may produce false positives on slang, irony, or emotionally charged language
  • Performance can degrade on mixed-language input, heavy misspellings, or niche jargon
  • It should be used as a lightweight moderation layer, not as a complete safety system
  • Human review is still recommended for high-stakes moderation decisions

Why This Model

Floxoris Harmony v1 is built for teams that need:

  • simple deployment
  • quick inference
  • low infrastructure cost
  • moderation for Russian and Ukrainian text
  • a practical first-stage safety classifier

If you need a compact moderation component instead of a large and expensive multilingual safety stack, this model is designed for that role.

Summary

Floxoris Harmony v1 is a new and improved generation of the lightweight moderation approach introduced by Zire Guard v0. It keeps the same practical focus on compact binary toxicity detection for Russian and Ukrainian, while presenting a cleaner, more refined, and more production-oriented release for modern moderation pipelines.