ccdv
/

lsg-bart-base-4096-arxiv

text2text-generation

Model card Files Files and versions Community

ccdv commited on May 9, 2022

Commit

24140c4

·

1 Parent(s): 379f42d

readme

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -15,6 +15,9 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # ccdv/lsg-bart-base-4096-arxiv
 This model is a fine-tuned version of [ccdv/lsg-bart-base-4096](https://huggingface.co/ccdv/lsg-bart-base-4096) on the scientific_papers arxiv dataset. \
@@ -37,8 +40,6 @@ The model relies on Local-Sparse-Global attention to handle long sequences:
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
 The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned. \
-**This model relies on a custom modeling file, you need to add trust_remote_code=True**\
-**See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
 ## Intended uses & limitations

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+**This model relies on a custom modeling file, you need to add trust_remote_code=True**\
+**See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
 # ccdv/lsg-bart-base-4096-arxiv
 This model is a fine-tuned version of [ccdv/lsg-bart-base-4096](https://huggingface.co/ccdv/lsg-bart-base-4096) on the scientific_papers arxiv dataset. \
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
 The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned. \
 ## Intended uses & limitations