Spaces:
Sleeping
Sleeping
update readme
Browse files
README.md
CHANGED
@@ -26,10 +26,15 @@ http://localhost:8080
|
|
26 |
The data used to train the classifier comes from the NADI 2021 dataset for Arabic Dialect Identification [(Abdul-Mageed et al., 2021)](#cite-mageed-2021).
|
27 |
It is a corpus of tweets collected using Twitter's API and labeled thanks to the users' locations with the country and region.
|
28 |
|
29 |
-
I used the language model `https://huggingface.co/moussaKam/AraBART` to extract features from the input text by taking the output of its last hidden layer. I used these vector embeddings as the input for a Multinomial Logistic Regression to classify the input text into one of the 21 dialects (Countries).
|
30 |
|
31 |
For more details, you can refer to the docs directory.
|
32 |
|
|
|
|
|
|
|
|
|
|
|
33 |
## References:
|
34 |
- <a name="cite-mageed-2021"></a>
|
35 |
[Abdul-Mageed et al., 2021](https://arxiv.org/abs/2103.08466)
|
|
|
26 |
The data used to train the classifier comes from the NADI 2021 dataset for Arabic Dialect Identification [(Abdul-Mageed et al., 2021)](#cite-mageed-2021).
|
27 |
It is a corpus of tweets collected using Twitter's API and labeled thanks to the users' locations with the country and region.
|
28 |
|
29 |
+
In the current version, I used the language model `https://huggingface.co/moussaKam/AraBART` to extract features from the input text by taking the output of its last hidden layer. I used these vector embeddings as the input for a Multinomial Logistic Regression to classify the input text into one of the 21 dialects (Countries).
|
30 |
|
31 |
For more details, you can refer to the docs directory.
|
32 |
|
33 |
+
## Releases
|
34 |
+
### v0.0.1
|
35 |
+
In the first release, I used the language model `https://huggingface.co/moussaKam/AraBART` to extract features from the input text by taking the output of its last hidden layer. I used these vector embeddings as the input for a Multinomial Logistic Regression to classify the input text into one of the 21 dialects (Countries).
|
36 |
+
### v0.0.2
|
37 |
+
|
38 |
## References:
|
39 |
- <a name="cite-mageed-2021"></a>
|
40 |
[Abdul-Mageed et al., 2021](https://arxiv.org/abs/2103.08466)
|