Safetensors
ernie4_5_moe

This is a Pre-SFT/DPO checkpoint of IRIS, our ERNIE finetune.

These improvements over ERNIE-21B-REAP have been noted

Benchmark Pre-CPT Post-CPT Δ

ARC-Easy 79.6 83.9 +4.3

ARC-Challenge 50.6 60.4 +9.8

HellaSwag 70.5 78.9 +8.4

Winogrande 67.2 72.1 +4.9

Downloads last month
237
Safetensors
Model size
18B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jerrimu/IRIS-18B-CPT

Finetuned
(1)
this model

Datasets used to train jerrimu/IRIS-18B-CPT