CodeBarcenas-7b / README.md
leaderboard-pr-bot's picture
Adding Evaluation Results
02aa992
|
raw
history blame
929 Bytes
metadata
license: llama2
language:
  - en

CodeBarcenas Model specialized in the Python language Based on the model: WizardLM/WizardCoder-Python-7B-V1.0 And trained with the dataset: mlabonne/Evol-Instruct-Python-26k

Made with ❤️ in Guadalupe, Nuevo Leon, Mexico 🇲🇽

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 35.03
ARC (25-shot) 42.32
HellaSwag (10-shot) 63.43
MMLU (5-shot) 33.39
TruthfulQA (0-shot) 38.51
Winogrande (5-shot) 60.38
GSM8K (5-shot) 2.5
DROP (3-shot) 4.71