benjaminzwhite commited on
Commit
7175d27
1 Parent(s): 82da0b5

doc: update README.md

Browse files

- add ROUGE scores
- add qualitative subjective impressions of the summary quality

Files changed (1) hide show
  1. README.md +21 -5
README.md CHANGED
@@ -5,6 +5,13 @@ tags:
5
  model-index:
6
  - name: pegasus-samsum-nlp-with-transformers-ch06
7
  results: []
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -12,21 +19,30 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # pegasus-samsum-nlp-with-transformers-ch06
14
 
15
- This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on an unknown dataset.
 
16
  It achieves the following results on the evaluation set:
17
  - Loss: 1.4839
18
 
 
 
 
 
 
 
 
 
19
  ## Model description
20
 
21
- More information needed
22
 
23
  ## Intended uses & limitations
24
 
25
- More information needed
26
 
27
  ## Training and evaluation data
28
 
29
- More information needed
30
 
31
  ## Training procedure
32
 
@@ -56,4 +72,4 @@ The following hyperparameters were used during training:
56
  - Transformers 4.39.3
57
  - Pytorch 2.1.2
58
  - Datasets 2.18.0
59
- - Tokenizers 0.15.2
 
5
  model-index:
6
  - name: pegasus-samsum-nlp-with-transformers-ch06
7
  results: []
8
+ datasets:
9
+ - samsum
10
+ language:
11
+ - en
12
+ metrics:
13
+ - rouge
14
+ pipeline_tag: summarization
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
19
 
20
  # pegasus-samsum-nlp-with-transformers-ch06
21
 
22
+ This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the [SAMSum](https://huggingface.co/datasets/samsum) dataset.
23
+
24
  It achieves the following results on the evaluation set:
25
  - Loss: 1.4839
26
 
27
+ It achieves the following ROUGE scores on the test set:
28
+ - rouge1: 0.555556
29
+ - rouge2: 0.230769
30
+ - rougeL: 0.518519
31
+ - rougeLsum: 0.518519
32
+
33
+ **Quick human evaluation of summarization quality:** the results are generally good, after visual inspection of the summaries generated on test set conversations. However it seems some entities/attributions are incorrect (saw an example where model confuses peoples' roles in multi-person chat)
34
+
35
  ## Model description
36
 
37
+ PEGASUS doc can be found here: [https://huggingface.co/docs/transformers/model_doc/pegasus](https://huggingface.co/docs/transformers/model_doc/pegasus)
38
 
39
  ## Intended uses & limitations
40
 
41
+ This model was trained while studying the NLP With Transformers book; it is not intended to be used for any real applications.
42
 
43
  ## Training and evaluation data
44
 
45
+ The finetuning data is the SAMSum dataset only.
46
 
47
  ## Training procedure
48
 
 
72
  - Transformers 4.39.3
73
  - Pytorch 2.1.2
74
  - Datasets 2.18.0
75
+ - Tokenizers 0.15.2