jondurbin/gutenberg-dpo-v0.1
Viewer • Updated • 918 • 1.22k • 164
How to use lucyknada/ifable_gemma-2-Ifable-9B-exl2 with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("lucyknada/ifable_gemma-2-Ifable-9B-exl2", dtype="auto")This model ranked first on the Creative Writing Benchmark (https://eqbench.com/creative_writing.html) on September 10, 2024
Training method: SimPO (GitHub - princeton-nlp/SimPO: SimPO: Simple Preference Optimization with a Reference-Free Reward)
It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen | Sft Loss |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1.4444 | 0.9807 | 35 | 1.0163 | -21.6822 | -47.8754 | 0.9167 | 26.1931 | -4.7875 | -2.1682 | -17.0475 | -12.0041 | 0.0184 |
We are looking for product manager and operations managers to build applications through our model, and also open for business cooperation, and also AI engineer to join us, contact with : contact@ifable.ai