Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -12,10 +12,13 @@ pinned: false
|
|
| 12 |
|
| 13 |
We are a non-profit research lab focused on understanding and building better Korean language models. See below for an overview of our projects.
|
| 14 |
|
| 15 |
-
**
|
| 16 |
We have built _the_ most-widely used korean benchmarks including HAE-RAE Bench (cultural knowledge, [dataset](https://huggingface.co/datasets/HAERAE-HUB/HAE_RAE_BENCH_1.0), [paper](https://arxiv.org/abs/2309.02706)),
|
| 17 |
KMMLU (general knowledge, [dataset](https://huggingface.co/datasets/HAERAE-HUB/KMMLU), [paper](https://arxiv.org/abs/2402.11548)), HRM8K (math, [dataset](https://huggingface.co/datasets/HAERAE-HUB/HRM8K), [paper](https://www.arxiv.org/abs/2501.02448)), and KMMLU-Redux/Pro (general knowledge, [dataset](https://huggingface.co/datasets/LGAI-EXAONE/KMMLU-Pro), [paper](https://arxiv.org/abs/2507.08924)).
|
| 18 |
|
|
|
|
|
|
|
|
|
|
| 19 |
**Reasoning Language Models**
|
| 20 |
With cooperation with [KISTI-KONI](https://huggingface.co/KISTI-KONI) we released the [KO-REAson](https://huggingface.co/KOREAson) series, <10B reasoning language models trained for Korean.
|
| 21 |
|
|
|
|
| 12 |
|
| 13 |
We are a non-profit research lab focused on understanding and building better Korean language models. See below for an overview of our projects.
|
| 14 |
|
| 15 |
+
**Benchmarks**
|
| 16 |
We have built _the_ most-widely used korean benchmarks including HAE-RAE Bench (cultural knowledge, [dataset](https://huggingface.co/datasets/HAERAE-HUB/HAE_RAE_BENCH_1.0), [paper](https://arxiv.org/abs/2309.02706)),
|
| 17 |
KMMLU (general knowledge, [dataset](https://huggingface.co/datasets/HAERAE-HUB/KMMLU), [paper](https://arxiv.org/abs/2402.11548)), HRM8K (math, [dataset](https://huggingface.co/datasets/HAERAE-HUB/HRM8K), [paper](https://www.arxiv.org/abs/2501.02448)), and KMMLU-Redux/Pro (general knowledge, [dataset](https://huggingface.co/datasets/LGAI-EXAONE/KMMLU-Pro), [paper](https://arxiv.org/abs/2507.08924)).
|
| 18 |
|
| 19 |
+
**Evaluation**
|
| 20 |
+
We developed the [haerae-evaluation-toolkit](https://github.com/HAE-RAE/haerae-evaluation-toolkit), a unified LLM evaluation framework designed to provide consistent and reproducible benchmarking for Korean and multilingual models.
|
| 21 |
+
|
| 22 |
**Reasoning Language Models**
|
| 23 |
With cooperation with [KISTI-KONI](https://huggingface.co/KISTI-KONI) we released the [KO-REAson](https://huggingface.co/KOREAson) series, <10B reasoning language models trained for Korean.
|
| 24 |
|