metadata
license: apache-2.0
datasets:
- ConicCat/AntiRepV0.2
language:
- en
base_model:
- swiss-ai/Apertus-8B-2509
TODO: improve model card
Trained from Apertus-8B Base using liger kernel with a two step Anchored SFT > LD-DPO recipe.
Alpaca template, no system.
.7 temp, top_p .95, no rep pen or dry