RapMinerz commited on
Commit
e2236c5
·
1 Parent(s): 57624b2

update readme

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ library_name: gensim
12
 
13
  ## Overview
14
 
15
- Word2Bezbar are Word2Vec models trained on a 323MB dataset of cleaned French rap lyrics sourced from Genius. Tokenization has been done using NLTK french word_tokenze function, with a prior processing to remove french oral contractions.
16
 
17
  The model captures the semantic relationships between words in the context of French rap, providing a useful tool for studies associated to french slang and music writing.
18
 
 
12
 
13
  ## Overview
14
 
15
+ __Word2Bezbar__ are __Word2Vec__ models trained on __french rap lyrics__ sourced from __Genius__. Tokenization has been done using __NLTK__ french `word_tokenze` function, with a prior processing to remove __french oral contractions__. Used dataset size was __323MB__, corresponding to __77M tokens__.
16
 
17
  The model captures the semantic relationships between words in the context of French rap, providing a useful tool for studies associated to french slang and music writing.
18