metadata
license: openrail
library_name: peft
tags:
- generated_from_trainer
base_model: VietAI/envit5-translation
metrics:
- bleu
model-index:
- name: envit5-MedEV
results: []
envit5-MedEV
This model is a fine-tuned version of VietAI/envit5-translation on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.0795
- Bleu: 44.8343 -> 47.903 on MedEV test set
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 32
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 4
- total_train_batch_size: 128
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_steps: 10
- num_epochs: 5
Training results
Training Loss | Epoch | Step | Validation Loss | Bleu |
---|---|---|---|---|
33.2165 | 0.1314 | 700 | 0.5906 | 0.0653 |
0.4083 | 0.2628 | 1400 | 0.1096 | 13.8606 |
0.114 | 0.3942 | 2100 | 0.0918 | 14.7674 |
0.1027 | 0.5256 | 2800 | 0.0890 | 14.9410 |
0.0997 | 0.6571 | 3500 | 0.0873 | 15.0741 |
0.0973 | 0.7885 | 4200 | 0.0861 | 15.1717 |
0.0964 | 0.9199 | 4900 | 0.0852 | 15.2362 |
0.0949 | 1.0513 | 5600 | 0.0844 | 15.3131 |
0.0947 | 1.1827 | 6300 | 0.0838 | 15.3815 |
0.0937 | 1.3141 | 7000 | 0.0832 | 15.5075 |
0.0935 | 1.4455 | 7700 | 0.0827 | 15.5932 |
0.092 | 1.5769 | 8400 | 0.0822 | 15.6434 |
0.0924 | 1.7084 | 9100 | 0.0818 | 15.7233 |
0.0915 | 1.8398 | 9800 | 0.0815 | 15.8051 |
0.0915 | 1.9712 | 10500 | 0.0812 | 15.8279 |
0.0906 | 2.1026 | 11200 | 0.0809 | 15.8559 |
0.0904 | 2.2340 | 11900 | 0.0807 | 15.9008 |
0.0908 | 2.3654 | 12600 | 0.0805 | 15.8917 |
0.0904 | 2.4968 | 13300 | 0.0803 | 15.9352 |
0.0895 | 2.6282 | 14000 | 0.0802 | 15.9442 |
0.0896 | 2.7597 | 14700 | 0.0800 | 15.9677 |
0.0894 | 2.8911 | 15400 | 0.0800 | 15.9459 |
0.09 | 3.0225 | 16100 | 0.0799 | 15.9746 |
0.0895 | 3.1539 | 16800 | 0.0798 | 16.0154 |
0.0892 | 3.2853 | 17500 | 0.0797 | 15.9976 |
0.0896 | 3.4167 | 18200 | 0.0797 | 16.0193 |
0.0893 | 3.5481 | 18900 | 0.0796 | 16.0179 |
0.0888 | 3.6795 | 19600 | 0.0796 | 16.0510 |
0.0887 | 3.8110 | 20300 | 0.0796 | 16.0226 |
0.0891 | 3.9424 | 21000 | 0.0796 | 16.0277 |
0.0892 | 4.0738 | 21700 | 0.0796 | 16.0302 |
0.0892 | 4.2052 | 22400 | 0.0795 | 16.0425 |
0.0886 | 4.3366 | 23100 | 0.0795 | 16.0452 |
0.0889 | 4.4680 | 23800 | 0.0795 | 16.0518 |
0.0888 | 4.5994 | 24500 | 0.0795 | 16.0397 |
0.0893 | 4.7308 | 25200 | 0.0795 | 16.0450 |
0.0889 | 4.8623 | 25900 | 0.0795 | 16.0497 |
0.0887 | 4.9937 | 26600 | 0.0795 | 16.0497 |
Framework versions
- PEFT 0.10.0
- Transformers 4.40.2
- Pytorch 2.3.0
- Datasets 2.19.1
- Tokenizers 0.19.1