release

521d842 verified about 1 month ago

4.68 kB

	# Floxoris Harmony v1

	Floxoris Harmony v1 is an improved lightweight moderation model for Russian and Ukrainian text. It continues the idea behind the earlier `Floxoris Harmony v0` release, but presents it as a more mature and refined iteration designed for practical production use where speed, compact size, and stable binary moderation matter.

	The model is intended for scenarios such as Telegram bots, AI assistants, chat filters, user-generated content pipelines, and message pre-moderation systems. Its goal is simple: provide a fast first-pass toxicity check with minimal deployment overhead.

	## What Is New In v1

	Compared to the earlier `Floxoris Harmony v0` positioning, Floxoris Harmony v1 is presented as a cleaner and more production-ready version of the same moderation concept:

	- improved and more polished model release
	- better suited for practical integration into moderation pipelines
	- compact architecture for fast inference
	- focused binary safety classification for real-time systems
	- clearer output labels: `safe` and `toxic`

	Rather than expanding into a large multi-head moderation system, this version keeps the original lightweight philosophy while aiming for a more reliable and streamlined production experience.

	## Features

	- Binary toxic content classification
	- Supports Russian and Ukrainian
	- Small model footprint and fast inference
	- Suitable for real-time moderation
	- Easy to integrate into lightweight services
	- Good fit for bots, assistants, and chat filtering

	## Model Details

	- Task: Binary text classification
	- Architecture: `BertForSequenceClassification`
	- Model family: BERT
	- Languages: Russian, Ukrainian
	- Classes: `safe`, `toxic`
	- Max sequence length: 512 tokens
	- Hidden size: 128
	- Layers: 2
	- Attention heads: 2

	## Labels

	The model returns one of two classes:

	- `0` = `safe`
	- `1` = `toxic`

	## Intended Use

	Floxoris Harmony v1 is intended as a lightweight moderation layer for:

	- Telegram bots
	- AI assistants
	- chat and community platforms
	- message pre-filtering
	- user input validation before LLM processing
	- low-cost production moderation pipelines

	Typical usage patterns include:

	- filtering incoming messages before sending them to a model or operator
	- flagging potentially toxic content for review
	- reducing moderation load in high-volume environments
	- adding a first-pass safety layer to conversational systems

	## Example Usage

	```python
	from transformers import AutoTokenizer, AutoModelForSequenceClassification
	import torch

	model_name = "floxoris/floxoris-harmony-v1"

	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForSequenceClassification.from_pretrained(model_name)

	id2label = {
	0: "safe",
	1: "toxic",
	}

	texts = [
	"привіт, як справи?",
	"ти ідіот?",
	]

	inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt")

	with torch.no_grad():
	logits = model(**inputs).logits
	probs = torch.softmax(logits, dim=-1)
	preds = torch.argmax(probs, dim=-1)

	for text, pred, prob in zip(texts, preds, probs):
	label = id2label[pred.item()]
	confidence = prob[pred.item()].item()
	print(f"{text} -> {label} ({confidence:.4f})")
	```

	## Example Output

	```text
	"привіт, як справи?"
	-> safe

	"ти ідіот?"
	-> toxic
	```

	These examples are illustrative only and should not be treated as a benchmark.

	## Limitations

	- This is a binary moderation model and does not classify toxicity categories
	- It may miss subtle harassment, sarcasm, or context-dependent abuse
	- It may produce false positives on slang, irony, or emotionally charged language
	- Performance can degrade on mixed-language input, heavy misspellings, or niche jargon
	- It should be used as a lightweight moderation layer, not as a complete safety system
	- Human review is still recommended for high-stakes moderation decisions

	## Why This Model

	Floxoris Harmony v1 is built for teams that need:

	- simple deployment
	- quick inference
	- low infrastructure cost
	- moderation for Russian and Ukrainian text
	- a practical first-stage safety classifier

	If you need a compact moderation component instead of a large and expensive multilingual safety stack, this model is designed for that role.

	## Summary

	Floxoris Harmony v1 is a new and improved generation of the lightweight moderation approach introduced by `Zire Guard v0`. It keeps the same practical focus on compact binary toxicity detection for Russian and Ukrainian, while presenting a cleaner, more refined, and more production-oriented release for modern moderation pipelines.