RapMinerz
commited on
Commit
·
e2236c5
1
Parent(s):
57624b2
update readme
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ library_name: gensim
|
|
12 |
|
13 |
## Overview
|
14 |
|
15 |
-
|
16 |
|
17 |
The model captures the semantic relationships between words in the context of French rap, providing a useful tool for studies associated to french slang and music writing.
|
18 |
|
|
|
12 |
|
13 |
## Overview
|
14 |
|
15 |
+
__Word2Bezbar__ are __Word2Vec__ models trained on __french rap lyrics__ sourced from __Genius__. Tokenization has been done using __NLTK__ french `word_tokenze` function, with a prior processing to remove __french oral contractions__. Used dataset size was __323MB__, corresponding to __77M tokens__.
|
16 |
|
17 |
The model captures the semantic relationships between words in the context of French rap, providing a useful tool for studies associated to french slang and music writing.
|
18 |
|