--- license: apache-2.0 datasets: - ConicCat/AntiRepV0.2 language: - en base_model: - swiss-ai/Apertus-8B-2509 ---

TODO: improve model card Trained from Apertus-8B Base using liger kernel with a two step Anchored SFT > LD-DPO recipe. Alpaca template, no system. .7 temp, top_p .95, no rep pen or dry