berkeley-nest/Nectar
Viewer • Updated • 183k • 1.34k • 295
How to use M4-ai/TinyMistral-248M-v2-cleaner with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="M4-ai/TinyMistral-248M-v2-cleaner") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("M4-ai/TinyMistral-248M-v2-cleaner")
model = AutoModelForSequenceClassification.from_pretrained("M4-ai/TinyMistral-248M-v2-cleaner")This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training. We recommend to use the sigmoid function to turn the logits into probabilities:
1 / (1 + torch.exp(logits))
Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.