ccdv commited on
Commit
24140c4
·
1 Parent(s): 379f42d
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -15,6 +15,9 @@ model-index:
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
 
 
 
18
  # ccdv/lsg-bart-base-4096-arxiv
19
 
20
  This model is a fine-tuned version of [ccdv/lsg-bart-base-4096](https://huggingface.co/ccdv/lsg-bart-base-4096) on the scientific_papers arxiv dataset. \
@@ -37,8 +40,6 @@ The model relies on Local-Sparse-Global attention to handle long sequences:
37
 
38
  The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
39
  The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned. \
40
- **This model relies on a custom modeling file, you need to add trust_remote_code=True**\
41
- **See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
42
 
43
  ## Intended uses & limitations
44
 
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ **This model relies on a custom modeling file, you need to add trust_remote_code=True**\
19
+ **See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
20
+
21
  # ccdv/lsg-bart-base-4096-arxiv
22
 
23
  This model is a fine-tuned version of [ccdv/lsg-bart-base-4096](https://huggingface.co/ccdv/lsg-bart-base-4096) on the scientific_papers arxiv dataset. \
 
40
 
41
  The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
42
  The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned. \
 
 
43
 
44
  ## Intended uses & limitations
45