apeters commited on
Commit
1985f61
Β·
verified Β·
1 Parent(s): ce5e843

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -9,6 +9,7 @@ pinned: false
9
  <p align="center">
10
  <img src="OpenDataArena.PNG" alt="OpenDataArena Banner" width="300">
11
  </p>
 
12
  ## 🌐 About OpenDataArena
13
 
14
  **OpenDataArena (ODA)** is an open research initiative devoted to evaluating, benchmarking, and creating high-value datasets for the post-training era of large language models (LLMs).
@@ -19,9 +20,9 @@ To make **data evaluation scientific, transparent, and community-driven**, while
19
 
20
  ### πŸ”‘ Key Features
21
 
22
- - πŸ† **Dataset Leaderboard** β€” [Leaderboard](https://opendataarena.github.io/leaderboard.html) ranks and visualizes the most valuable datasets across multiple domains, based on unified post-training benchmarks.
23
- - πŸ“Š **Comprehensive Scoring System** β€” [Scoring tool](https://github.com/OpenDataArena/OpenDataArena-Tool/tree/main/data_scorer) measures dataset quality, diversity, difficulty, and learning value using reproducible pipelines.
24
- - 🧰 **Open-Source Toolkit** β€” *[OpenDataArena-Tool](https://github.com/OpenDataArena/OpenDataArena-Tool)* enables dataset curation, scoring, and analysis with a standardized, community-driven workflow.
25
  - 🌱 **High-Value Data Generation** β€” beyond evaluation, ODA continuously produces and shares new, top-quality datasets for fine-tuning and alignment research.
26
 
27
 
 
9
  <p align="center">
10
  <img src="OpenDataArena.PNG" alt="OpenDataArena Banner" width="300">
11
  </p>
12
+
13
  ## 🌐 About OpenDataArena
14
 
15
  **OpenDataArena (ODA)** is an open research initiative devoted to evaluating, benchmarking, and creating high-value datasets for the post-training era of large language models (LLMs).
 
20
 
21
  ### πŸ”‘ Key Features
22
 
23
+ - πŸ† **Dataset Leaderboard** β€” [Leaderboard](https://opendataarena.github.io/leaderboard.html) ranks the most valuable datasets across multiple domains, based on unified post-training benchmarks.
24
+ - πŸ“Š **Comprehensive Scoring System** β€” [Scoring tool](https://github.com/OpenDataArena/OpenDataArena-Tool/tree/main/data_scorer) measures dataset quality, diversity, and learning values using reproducible pipelines.
25
+ - 🧰 **Open-Source Toolkit** β€” *[OpenDataArena-Tool](https://github.com/OpenDataArena/OpenDataArena-Tool)* enables dataset evaluation, scoring with a standardized, community-driven workflow.
26
  - 🌱 **High-Value Data Generation** β€” beyond evaluation, ODA continuously produces and shares new, top-quality datasets for fine-tuning and alignment research.
27
 
28