Updated readme.md
Browse files
README.md
CHANGED
@@ -12,3 +12,50 @@ tags: ["#AncientGreek", "#LanguageModels", "#Linguistics"]
|
|
12 |
---
|
13 |
|
14 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
15 |
+
|
16 |
+
## File Descriptions
|
17 |
+
|
18 |
+
### app.py
|
19 |
+
The main application file for the Streamlit app. This file contains the core logic and layout for the Agalma interface.
|
20 |
+
|
21 |
+
### requirements.txt
|
22 |
+
A list of Python dependencies required to run the app.
|
23 |
+
|
24 |
+
### models/
|
25 |
+
A directory containing pre-trained word2vec models. These models are used for generating vector representations of words in the interface.
|
26 |
+
|
27 |
+
### 3d_models/
|
28 |
+
A directory containing pre-trained 3D representations of the models in the ./models directory.
|
29 |
+
Dimensions were reduced using t-SNE technique.
|
30 |
+
|
31 |
+
### images/
|
32 |
+
A directory containing all necessary images for the layout of the interface.
|
33 |
+
|
34 |
+
### corpora/
|
35 |
+
A directory containing compressed corpora files to faster access all the words from the corpora.
|
36 |
+
|
37 |
+
### lemma_list_raw/
|
38 |
+
A directory containing .txt files with all lemmas occuring in each time slice.
|
39 |
+
|
40 |
+
### all_lemmas.pkl.gz
|
41 |
+
A compressed pickle file containing all lemmas that are in the LSJ dictionary.
|
42 |
+
|
43 |
+
### autocomplete.py
|
44 |
+
A Python file that contains functions to optimize the input suggestions given when a user is typing as input.
|
45 |
+
|
46 |
+
### compress_words.py
|
47 |
+
A Python file that contains functions on compressing words. The compressed files obtained are used to faster obtain words that are in the corpora.
|
48 |
+
|
49 |
+
### lsj_dict.json
|
50 |
+
A JSON file that contains relevant info in .xml files of the LSJ dictionary.
|
51 |
+
|
52 |
+
### lsj_dict.py
|
53 |
+
A Python file that contains function to convert .xml file representations of the LSJ dictionary into a JSON file.
|
54 |
+
|
55 |
+
### plots.py
|
56 |
+
A Python file containing functions to create 3D plots.
|
57 |
+
|
58 |
+
### word2vec.py
|
59 |
+
A Python file containing functions needed for computations with vectors.
|
60 |
+
|
61 |
+
|