BramVanroy
/

mbart-large-cc25-ft-amr30-en_es_nl-stratified

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on Oct 24, 2023

Commit

2020cc8

·

1 Parent(s): d0a42ca

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -1,16 +1,16 @@
 ---
 base_model: facebook/mbart-large-cc25
-tags:
-- generated_from_trainer
-model-index:
-- name: en_es_nl+no_processing
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# en_es_nl+no_processing
 This model is a fine-tuned version of [facebook/mbart-large-cc25](https://huggingface.co/facebook/mbart-large-cc25) on the None dataset.
 It achieves the following results on the evaluation set:
@@ -84,4 +84,4 @@ The following hyperparameters were used during training:
 - Transformers 4.34.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.14.2
-- Tokenizers 0.13.3

 ---
 base_model: facebook/mbart-large-cc25
+language:
+- en
+- nl
+- es
 ---
+# EN, ES and NL to AMR parsing (stratified)
+This version was trained on a subselection of the data. The AMR 3.0 corpus was translated to all the relevant languages. We then divided the dataset so
+that in total we only see a third of the each language's dataset (so that in total we only see the full AMR 3.0 corpus in size once). In other words,
+all languages were undersampled for research purposes.
 This model is a fine-tuned version of [facebook/mbart-large-cc25](https://huggingface.co/facebook/mbart-large-cc25) on the None dataset.
 It achieves the following results on the evaluation set:
 - Transformers 4.34.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.14.2
+- Tokenizers 0.13.3