ggml files of bge-small-en

You can use this ggml for https://github.com/skeskinen/bert.cpp

bge-small-en

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8654 12.81 0.5111 26.28
f16 0.8654 12.02 0.5112 19.39
q4_0 0.8637 10.07 0.5073 44.53
q4_1 0.8645 11.04 0.5087 39.58

all-MiniLM-L12-v2

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8306 13.36 0.4117 21.23
f16 0.8306 11.51 0.4119 20.08
q4_0 0.8310 11.27 0.4183 20.81
q4_1 0.8325 12.37 0.4093 19.38

all-MiniLM-L6-v2

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8201 6.83 0.4082 11.34
f16 0.8201 6.17 0.4085 10.28
q4_0 0.8175 5.45 0.3911 10.63
q4_1 0.8223 6.79 0.4027 11.41

bert-base-uncased

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.4738 52.38 0.3361 88.56
f16 0.4739 33.24 0.3361 55.86
q4_0 0.4940 33.93 0.3375 57.82
q4_1 0.4612 36.86 0.3318 59.63
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.