metadata

license: openrail
library_name: peft
tags:
  - generated_from_trainer
base_model: VietAI/envit5-translation
metrics:
  - bleu
model-index:
  - name: envit5-MedEV
    results: []

envit5-MedEV

This model is a fine-tuned version of VietAI/envit5-translation on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.0795
Bleu: 44.8343 -> 47.903 on MedEV test set

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 32
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 128
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 10
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Bleu
33.2165	0.1314	700	0.5906	0.0653
0.4083	0.2628	1400	0.1096	13.8606
0.114	0.3942	2100	0.0918	14.7674
0.1027	0.5256	2800	0.0890	14.9410
0.0997	0.6571	3500	0.0873	15.0741
0.0973	0.7885	4200	0.0861	15.1717
0.0964	0.9199	4900	0.0852	15.2362
0.0949	1.0513	5600	0.0844	15.3131
0.0947	1.1827	6300	0.0838	15.3815
0.0937	1.3141	7000	0.0832	15.5075
0.0935	1.4455	7700	0.0827	15.5932
0.092	1.5769	8400	0.0822	15.6434
0.0924	1.7084	9100	0.0818	15.7233
0.0915	1.8398	9800	0.0815	15.8051
0.0915	1.9712	10500	0.0812	15.8279
0.0906	2.1026	11200	0.0809	15.8559
0.0904	2.2340	11900	0.0807	15.9008
0.0908	2.3654	12600	0.0805	15.8917
0.0904	2.4968	13300	0.0803	15.9352
0.0895	2.6282	14000	0.0802	15.9442
0.0896	2.7597	14700	0.0800	15.9677
0.0894	2.8911	15400	0.0800	15.9459
0.09	3.0225	16100	0.0799	15.9746
0.0895	3.1539	16800	0.0798	16.0154
0.0892	3.2853	17500	0.0797	15.9976
0.0896	3.4167	18200	0.0797	16.0193
0.0893	3.5481	18900	0.0796	16.0179
0.0888	3.6795	19600	0.0796	16.0510
0.0887	3.8110	20300	0.0796	16.0226
0.0891	3.9424	21000	0.0796	16.0277
0.0892	4.0738	21700	0.0796	16.0302
0.0892	4.2052	22400	0.0795	16.0425
0.0886	4.3366	23100	0.0795	16.0452
0.0889	4.4680	23800	0.0795	16.0518
0.0888	4.5994	24500	0.0795	16.0397
0.0893	4.7308	25200	0.0795	16.0450
0.0889	4.8623	25900	0.0795	16.0497
0.0887	4.9937	26600	0.0795	16.0497

Framework versions

PEFT 0.10.0
Transformers 4.40.2
Pytorch 2.3.0
Datasets 2.19.1
Tokenizers 0.19.1