File size: 1,325 Bytes
8441872
 
 
 
 
 
 
 
 
e781d07
 
 
 
d72f950
e781d07
 
e6e9998
e781d07
 
d72f950
e781d07
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
title: README
emoji: 🐢
colorFrom: blue
colorTo: green
sdk: static
pinned: false
---

# Mukayese: Turkish NLP Strikes Back

Turkish Natural Language Processing is left behind in developing state-of-the-art systems due to a lack of organized benchmarks and baselines. We fill this gap with __Mukayese__ (Turkish word for "comparison/benchmarking"), an extensive set of datasets and benchmarks for several Turkish NLP tasks. All of the datasets and code have been made public in this repository.

--- 
## Updates

- (22/03/2022) Summarization models are online on Huggingface!
- (25/02/2022) Datasets have been made available through pre-release [v0.0.1](https://github.com/alisafaya/mukayese/releases/tag/v0.0.1)

---
## What to do with Mukayese ?

With Mukayese, researchers of Turkish NLP will be able to:

 - Compare the performance of existing methods in leaderboards.
 - Access existing implementations of NLP baselines.
 - Evaluate their own methods on the relevant test datasets.
 - Submit their own work to be enlisted in our leaderboards.

## Mukayese's Mission

The most important goal of Mukayese is to standardize the comparison and evaluation of Turkish NLP methods. As a result of the lack of a platform for benchmarking, Turkish NLP researchers struggle with comparing their models to the existing ones.