GGUF
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="jerrimu/ERNIE-21B-REAP-GGUFS",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

ERNIE 21B reap 20%

Benchmark Score Notes
ARC-Easy 79.59% acc_norm
ARC-Challenge 50.60% acc_norm
HellaSwag 70.50% acc_norm
Winogrande 67.17% acc
GSM8K 79.00% exact_match (flexible-extract)
MMLU 65.82% acc (average across all subjects)
Downloads last month
28
GGUF
Model size
18B params
Architecture
ernie4_5-moe
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support