Instructions to use IlyaGusev/rubertconv_toxic_clf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use IlyaGusev/rubertconv_toxic_clf with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="IlyaGusev/rubertconv_toxic_clf")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("IlyaGusev/rubertconv_toxic_clf") model = AutoModelForSequenceClassification.from_pretrained("IlyaGusev/rubertconv_toxic_clf") - Notebooks
- Google Colab
- Kaggle
RuBERTConv Toxic Classifier
Model description
Based on rubert-base-cased-conversational model
Intended uses & limitations
How to use
Colab: link
from transformers import pipeline
model_name = "IlyaGusev/rubertconv_toxic_clf"
pipe = pipeline("text-classification", model=model_name, tokenizer=model_name, framework="pt")
text = "Ты придурок из интернета"
pipe([text])
Training data
Datasets:
Augmentations:
- ё -> е
- Remove or add "?" or "!"
- Fix CAPS
- Concatenate toxic and non-toxic texts
- Concatenate two non-toxic texts
- Add toxic words from vocabulary
- Add typos
- Mask toxic words with "*", "@", "$"
Training procedure
TBA
- Downloads last month
- 444