File size: 2,243 Bytes
e33d1fa
 
422fa29
e33d1fa
 
 
 
 
 
8189b44
422fa29
8189b44
 
 
 
926cc34
 
8189b44
 
 
 
 
85e2fd9
 
8189b44
 
 
 
 
 
 
 
 
 
 
 
 
 
d9fe78f
8189b44
422fa29
8189b44
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
title: README
emoji: ⛰️
colorFrom: gray
colorTo: red
sdk: static
pinned: false
---

---
QIMMA ⛰️ — A quality-first Arabic LLM Leaderboard that evaluates and compares the performance of Arabic Large Language Models.
---

## About

QIMMA قمّة (Summit in Arabic) is a quality-assured Arabic LLM evaluation leaderboard built on 14 carefully chosen benchmarks spanning STEM, legal reasoning, medical knowledge, poetry, cultural understanding, and code generation. QIMMA includes over 52,000 quality-validated samples across multiple-choice, generative, and code evaluation tracks. Over 99% of QIMMA's content is native Arabic, ensuring authentic linguistic and cultural assessment rather than relying on translated materials. 
The leaderboard is powered by a fully automated GPU evaluation pipeline, generously supported by **[TII (Technology Innovation Institute)](https://www.tii.ae)**, running on H100 infrastructure. Every submitted model goes through the same pipeline under the same conditions.

---

## Submit Your Model

Submissions are open to everyone. To submit a model through the leaderboard space [QIMMA leaderboard](https://huggingface.co/spaces/qimma/leaderboard).
Set `model type` to `base` for base models or `instruct` for chat/instruction-tuned models.

---

## Evaluation Queue & Delays

We run a shared GPU evaluation queue. Jobs are picked up automatically every few minutes, but **evaluation times vary** depending on model size and current queue load — please expect delays of several hours during busy periods.
You can track your submission status at any time by checking the `status` field in the [leaderboard-requests](https://huggingface.co/datasets/qimma/leaderboard-requests) dataset:

| Status | Meaning |
|--------|---------|
| `submitted` | Request received, waiting to be picked up |
| `pending` | Job dispatched, queued on GPU cluster |
| `running` | Evaluation actively in progress |
| `finished` | Results published to the leaderboard |
| `failed` | Something went wrong |

If your submission has been stuck for more than 24 hours or you have any questions, open a thread in the **[Discussion tab](https://huggingface.co/spaces/qimma/leaderboard/discussions)** and we'll look into it.

---