Files changed (2) hide show
  1. README (3).md +168 -0
  2. vocab (1).txt +0 -0
README (3).md ADDED
@@ -0,0 +1,168 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - multilingual
4
+ - af
5
+ - sq
6
+ - am
7
+ - ar
8
+ - hy
9
+ - as
10
+ - az
11
+ - eu
12
+ - be
13
+ - bn
14
+ - bs
15
+ - bg
16
+ - my
17
+ - ca
18
+ - ceb
19
+ - zh
20
+ - co
21
+ - hr
22
+ - cs
23
+ - da
24
+ - nl
25
+ - en
26
+ - eo
27
+ - et
28
+ - fi
29
+ - fr
30
+ - fy
31
+ - gl
32
+ - ka
33
+ - de
34
+ - el
35
+ - gu
36
+ - ht
37
+ - ha
38
+ - haw
39
+ - he
40
+ - hi
41
+ - hmn
42
+ - hu
43
+ - is
44
+ - ig
45
+ - id
46
+ - ga
47
+ - it
48
+ - ja
49
+ - jv
50
+ - kn
51
+ - kk
52
+ - km
53
+ - rw
54
+ - ko
55
+ - ku
56
+ - ky
57
+ - lo
58
+ - la
59
+ - lv
60
+ - lt
61
+ - lb
62
+ - mk
63
+ - mg
64
+ - ms
65
+ - ml
66
+ - mt
67
+ - mi
68
+ - mr
69
+ - mn
70
+ - ne
71
+ - no
72
+ - ny
73
+ - or
74
+ - fa
75
+ - pl
76
+ - pt
77
+ - pa
78
+ - ro
79
+ - ru
80
+ - sm
81
+ - gd
82
+ - sr
83
+ - st
84
+ - sn
85
+ - si
86
+ - sk
87
+ - sl
88
+ - so
89
+ - es
90
+ - su
91
+ - sw
92
+ - sv
93
+ - tl
94
+ - tg
95
+ - ta
96
+ - tt
97
+ - te
98
+ - th
99
+ - bo
100
+ - tr
101
+ - tk
102
+ - ug
103
+ - uk
104
+ - ur
105
+ - uz
106
+ - vi
107
+ - cy
108
+ - wo
109
+ - xh
110
+ - yi
111
+ - yo
112
+ - zu
113
+ pipeline_tag: sentence-similarity
114
+ tags:
115
+ - sentence-transformers
116
+ - feature-extraction
117
+ - sentence-similarity
118
+ library_name: sentence-transformers
119
+ license: apache-2.0
120
+ ---
121
+
122
+ # LaBSE
123
+ This is a port of the [LaBSE](https://tfhub.dev/google/LaBSE/1) model to PyTorch. It can be used to map 109 languages to a shared vector space.
124
+
125
+
126
+ ## Usage (Sentence-Transformers)
127
+
128
+ Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
129
+
130
+ ```
131
+ pip install -U sentence-transformers
132
+ ```
133
+
134
+ Then you can use the model like this:
135
+
136
+ ```python
137
+ from sentence_transformers import SentenceTransformer
138
+ sentences = ["This is an example sentence", "Each sentence is converted"]
139
+
140
+ model = SentenceTransformer('sentence-transformers/LaBSE')
141
+ embeddings = model.encode(sentences)
142
+ print(embeddings)
143
+ ```
144
+
145
+
146
+
147
+ ## Evaluation Results
148
+
149
+
150
+
151
+ For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name=sentence-transformers/LaBSE)
152
+
153
+
154
+
155
+ ## Full Model Architecture
156
+ ```
157
+ SentenceTransformer(
158
+ (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
159
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
160
+ (2): Dense({'in_features': 768, 'out_features': 768, 'bias': True, 'activation_function': 'torch.nn.modules.activation.Tanh'})
161
+ (3): Normalize()
162
+ )
163
+ ```
164
+
165
+ ## Citing & Authors
166
+
167
+ Have a look at [LaBSE](https://tfhub.dev/google/LaBSE/1) for the respective publication that describes LaBSE.
168
+
vocab (1).txt ADDED
The diff for this file is too large to render. See raw diff