sambanovasystems
/

SambaLingo-Serbian-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zolicsaki commited on Feb 23, 2024

Commit

71e0e42

·

verified ·

1 Parent(s): 80d66ed

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -51,6 +51,18 @@ All pre-training is done on the [Cultura-X](https://huggingface.co/datasets/uonl
 ## Tokenizer Details
 We extended the vocabulary of the base llama model from 32,000 tokens to 57,000 tokens by adding up to 25,000 non-overlapping tokens from the new language.
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

 ## Tokenizer Details
 We extended the vocabulary of the base llama model from 32,000 tokens to 57,000 tokens by adding up to 25,000 non-overlapping tokens from the new language.
+## Evaluation
+|        | SambaLingo-Serbian-Base       | sr-gpt2 | bloom-7b1 | xglm-7.5B | mGPT-13B |
+|-------------------------------|---------|-----------|-----------|----------|--------|
+| Perplexity (Lower Is Better)  | **1.436**   | -         | 2.140     | 2.404    | 2.429  |
+| FLORES en->sr (8 shot, CHRF)  | **0.448**   | 0.002     | 0.171     | 0.090    | 0.024  |
+| FLORES sr->en (8 shot, CHRF)  | **0.625**   | 0.071     | 0.206     | 0.257    | 0.026  |
+| FLORES en->sr  (8 shot, BLEU) | **0.188**   | 0.000     | 0.003     | 0.001    | 0.000  |
+| FLORES sr->en  (8 shot, BLEU) | **0.352**   | 0.000     | 0.019     | 0.040    | 0.000  |
+| Belebele (3 shot)             | **48.33%**  | 23.00%    | 23.89%    | 27.00%   | 25.22% |
+| SIB-200 (3 shot)              | 55.39%  | -%        | 32.35%    | **61.76%**   | 39.22% |
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->