| | --- |
| | title: README |
| | emoji: 🐢 |
| | colorFrom: blue |
| | colorTo: green |
| | sdk: static |
| | pinned: false |
| | --- |
| | |
| | # Mukayese: Turkish NLP Strikes Back |
| |
|
| | Turkish Natural Language Processing is left behind in developing state-of-the-art systems due to a lack of organized benchmarks and baselines. We fill this gap with __Mukayese__ (Turkish word for "comparison/benchmarking"), an extensive set of datasets and benchmarks for several Turkish NLP tasks. All of the datasets and code have been made public in this repository. |
| |
|
| | --- |
| | ## Updates |
| |
|
| | - (22/03/2022) Summarization models are online on Huggingface! |
| | - (25/02/2022) Datasets have been made available through pre-release [v0.0.1](https://github.com/alisafaya/mukayese/releases/tag/v0.0.1) |
| |
|
| | --- |
| | ## What to do with Mukayese ? |
| |
|
| | With Mukayese, researchers of Turkish NLP will be able to: |
| |
|
| | - Compare the performance of existing methods in leaderboards. |
| | - Access existing implementations of NLP baselines. |
| | - Evaluate their own methods on the relevant test datasets. |
| | - Submit their own work to be enlisted in our leaderboards. |
| |
|
| | ## Mukayese's Mission |
| |
|
| | The most important goal of Mukayese is to standardize the comparison and evaluation of Turkish NLP methods. As a result of the lack of a platform for benchmarking, Turkish NLP researchers struggle with comparing their models to the existing ones. |
| |
|
| |
|