natanmb's picture
update model card README.md
9542754
|
raw
history blame
1.67 kB
metadata
license: apache-2.0
tags:
  - summarization
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-finetuned-multi-news
    results: []

t5-small-finetuned-multi-news

This model is a fine-tuned version of t5-small on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 2.9807
  • Rouge1: 14.227
  • Rouge2: 4.3001
  • Rougel: 10.7052
  • Rougelsum: 12.5784

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5.6e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum
3.489 1.0 250 3.0402 14.1245 4.3869 10.6829 12.78
3.2405 2.0 500 2.9928 14.0471 4.1959 10.6048 12.4612
3.1865 3.0 750 2.9807 14.227 4.3001 10.7052 12.5784

Framework versions

  • Transformers 4.28.1
  • Pytorch 2.0.0+cu118
  • Datasets 2.11.0
  • Tokenizers 0.13.3