ml6team
/

mt5-small-german-query-generation

Text2Text Generation

query-generation

Inference Endpoints

Model card Files Files and versions Community

mrchtr commited on Apr 27, 2022

Commit

5f95c71

·

1 Parent(s): fc9044a

Update model card

Files changed (1) hide show

README.md +32 -0

README.md CHANGED Viewed

@@ -1,3 +1,35 @@
 ---
 license: apache-2.0
 ---

 ---
+language:
+- de
+tags:
+- pytorch
+- query-generation
+widget:
+- text: "Das Lama (Lama glama) ist eine Art der Kamele. Es ist in den südamerikanischen Anden verbreitet und eine vom Guanako abstammende Haustierform."
+  example_title: "Article 1"
 license: apache-2.0
+metrics:
+- Rouge-Score
 ---
+# mt5-small-german-query-generation
+## Model description:
+This model was created with the purpose to generate possible queries for a german input article.
+For this model, we finetuned a multilingual T5 model [mt5-small](https://huggingface.co/google/mt5-small) on the [MMARCO dataset](https://huggingface.co/datasets/unicamp-dl/mmarco) the machine translated version of the MS MARCO dataset.
+The model was trained for 1 epoch, on 200,000 unique queries of the dataset. We trained the model on one K80 GPU for 25,000 iterations with following parameters:
+  - learning rate: 1e-3
+  - train batch size: 8
+  - max input sequence length: 512
+  - max target sequence length: 64
+## Model Performance:
+Model evaluation was done on 2000 evaluation paragraphs of the dataset. Mean [f1 ROUGE scores](https://github.com/pltrdy/rouge) were calculated for the model.
+| Rouge-1 | Rouge-2 | Rouge-L |
+|---|---|---|
+|0.162 | 0.052 | 0.161 |