raspberry-3B / README.md

lbourdois

Improve language tag

d117791 verified 10 months ago

preview code

raw

history blame

1.29 kB

metadata

license: other
library_name: transformers
tags:
  - generated_from_trainer
base_model: Qwen/Qwen2.5-3B
license_name: qwen-research
license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
language:
  - zho
  - eng
  - fra
  - spa
  - por
  - deu
  - ita
  - rus
  - jpn
  - kor
  - vie
  - tha
  - ara
model-index:
  - name: outputs/gelato-3b
    results: []

Prompt Format: ChatML

This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.

GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	15.40
IFEval (0-Shot)	31.54
BBH (3-Shot)	19.53
MATH Lvl 5 (4-Shot)	7.63
GPQA (0-shot)	3.69
MuSR (0-shot)	9.41
MMLU-PRO (5-shot)	20.60