justinphan3110 commited on
Commit
eabecbf
1 Parent(s): fb6f40a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: vi
3
+ datasets:
4
+ - cc100
5
+ tags:
6
+ - summarization
7
+ - translation
8
+ - question-answering
9
+
10
+ license: mit
11
+ ---
12
+
13
+ # EnViT5-base
14
+
15
+ State-of-the-art pretrained Transformer-based encoder-decoder model for Vietnamese and English.
16
+
17
+ ## How to use
18
+ For more details, do check out [our Github repo](https://github.com/vietai/mtet).
19
+
20
+ [Finetunning Example can be found here](https://github.com/vietai/ViT5/tree/main/finetunning_huggingface).
21
+
22
+ ```python
23
+ from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
24
+
25
+ tokenizer = AutoTokenizer.from_pretrained("VietAI/envit5-base")
26
+ model = AutoModelForSeq2SeqLM.from_pretrained("VietAI/envit5-base")
27
+ model.cuda()
28
+ ```
29
+
30
+ ## Citation
31
+ ```
32
+ @misc{mtet,
33
+ doi = {10.48550/ARXIV.2210.05610},
34
+ url = {https://arxiv.org/abs/2210.05610},
35
+ author = {Ngo, Chinh and Trinh, Trieu H. and Phan, Long and Tran, Hieu and Dang, Tai and Nguyen, Hieu and Nguyen, Minh and Luong, Minh-Thang},
36
+ keywords = {Computation and Language (cs.CL), Artificial Intelligence (cs.AI), FOS: Computer and information sciences, FOS: Computer and information sciences},
37
+ title = {MTet: Multi-domain Translation for English and Vietnamese},
38
+ publisher = {arXiv},
39
+ year = {2022},
40
+ copyright = {Creative Commons Attribution 4.0 International}
41
+ }
42
+
43
+ ```