ldos
/

text_shortening_model_v78

+---
+license: apache-2.0
+base_model: t5-small
+tags:
+- generated_from_trainer
+model-index:
+- name: text_shortening_model_v78
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# text_shortening_model_v78
+This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.2534
+- Bert precision: 0.8992
+- Bert recall: 0.8971
+- Bert f1-score: 0.8977
+- Average word count: 6.6982
+- Max word count: 16
+- Min word count: 2
+- Average token count: 10.8994
+- % shortened texts with length > 12: 1.4724
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 40
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bert precision | Bert recall | Bert f1-score | Average word count | Max word count | Min word count | Average token count | % shortened texts with length > 12 |
+|:-------------:|:-----:|:----:|:---------------:|:--------------:|:-----------:|:-------------:|:------------------:|:--------------:|:--------------:|:-------------------:|:----------------------------------:|
+| 1.7828        | 1.0   | 30   | 1.2876          | 0.8895         | 0.8866      | 0.8876        | 6.7595             | 16             | 2              | 10.6969             | 1.3497                             |
+| 1.3539        | 2.0   | 60   | 1.1852          | 0.8938         | 0.8896      | 0.8912        | 6.6074             | 16             | 2              | 10.6442             | 1.227                              |
+| 1.2004        | 3.0   | 90   | 1.1374          | 0.8942         | 0.891       | 0.8921        | 6.6896             | 16             | 2              | 10.7534             | 1.3497                             |
+| 1.1048        | 4.0   | 120  | 1.1165          | 0.8929         | 0.8931      | 0.8926        | 6.8025             | 16             | 2              | 10.9178             | 1.7178                             |
+| 1.0322        | 5.0   | 150  | 1.0995          | 0.8953         | 0.896       | 0.8952        | 6.827              | 17             | 2              | 11.0037             | 1.9632                             |
+| 0.9687        | 6.0   | 180  | 1.0822          | 0.894          | 0.8947      | 0.8939        | 6.9043             | 17             | 3              | 11.0147             | 2.2086                             |
+| 0.9076        | 7.0   | 210  | 1.0858          | 0.8976         | 0.8965      | 0.8966        | 6.7951             | 17             | 2              | 10.9804             | 1.8405                             |
+| 0.8602        | 8.0   | 240  | 1.0894          | 0.8977         | 0.897       | 0.8969        | 6.7902             | 17             | 2              | 10.9558             | 2.0859                             |
+| 0.8076        | 9.0   | 270  | 1.0945          | 0.8982         | 0.8974      | 0.8974        | 6.7669             | 17             | 2              | 10.9779             | 1.9632                             |
+| 0.7739        | 10.0  | 300  | 1.0843          | 0.8974         | 0.8959      | 0.8962        | 6.719              | 17             | 2              | 10.8564             | 1.9632                             |
+| 0.7361        | 11.0  | 330  | 1.0982          | 0.8972         | 0.8967      | 0.8965        | 6.789              | 17             | 3              | 10.9779             | 1.5951                             |
+| 0.7052        | 12.0  | 360  | 1.0953          | 0.8977         | 0.8958      | 0.8963        | 6.7055             | 17             | 2              | 10.8798             | 1.8405                             |
+| 0.6749        | 13.0  | 390  | 1.1098          | 0.8981         | 0.8967      | 0.897         | 6.7325             | 17             | 2              | 10.908              | 2.0859                             |
+| 0.6441        | 14.0  | 420  | 1.1158          | 0.8996         | 0.8981      | 0.8984        | 6.7043             | 17             | 2              | 10.8761             | 1.5951                             |
+| 0.623         | 15.0  | 450  | 1.1146          | 0.9009         | 0.8989      | 0.8994        | 6.7006             | 17             | 2              | 10.9104             | 1.3497                             |
+| 0.604         | 16.0  | 480  | 1.1267          | 0.902          | 0.8988      | 0.9           | 6.7104             | 17             | 2              | 10.8491             | 1.3497                             |
+| 0.583         | 17.0  | 510  | 1.1357          | 0.8999         | 0.8989      | 0.8989        | 6.7706             | 17             | 2              | 10.9767             | 2.0859                             |
+| 0.5605        | 18.0  | 540  | 1.1513          | 0.8996         | 0.899       | 0.8989        | 6.7534             | 17             | 2              | 11.0356             | 2.0859                             |
+| 0.5439        | 19.0  | 570  | 1.1643          | 0.8993         | 0.8988      | 0.8986        | 6.816              | 17             | 2              | 11.0638             | 1.8405                             |
+| 0.5281        | 20.0  | 600  | 1.1626          | 0.8998         | 0.8985      | 0.8987        | 6.7399             | 16             | 2              | 11.0025             | 1.4724                             |
+| 0.5165        | 21.0  | 630  | 1.1720          | 0.9002         | 0.8974      | 0.8983        | 6.6417             | 16             | 2              | 10.7816             | 1.5951                             |
+| 0.5002        | 22.0  | 660  | 1.1834          | 0.9005         | 0.8995      | 0.8996        | 6.7607             | 17             | 2              | 10.9693             | 2.2086                             |
+| 0.486         | 23.0  | 690  | 1.2005          | 0.9027         | 0.8979      | 0.8999        | 6.5853             | 17             | 2              | 10.7264             | 1.8405                             |
+| 0.4757        | 24.0  | 720  | 1.1916          | 0.9008         | 0.8991      | 0.8995        | 6.6994             | 16             | 2              | 10.8859             | 2.0859                             |
+| 0.4679        | 25.0  | 750  | 1.2023          | 0.8988         | 0.8985      | 0.8982        | 6.7853             | 16             | 2              | 10.984              | 2.0859                             |
+| 0.4502        | 26.0  | 780  | 1.2108          | 0.9012         | 0.8991      | 0.8997        | 6.7141             | 16             | 2              | 10.8577             | 2.3313                             |
+| 0.4452        | 27.0  | 810  | 1.2188          | 0.9015         | 0.8987      | 0.8996        | 6.6098             | 14             | 2              | 10.7853             | 1.8405                             |
+| 0.4351        | 28.0  | 840  | 1.2187          | 0.9005         | 0.8997      | 0.8996        | 6.7681             | 16             | 2              | 10.9877             | 2.0859                             |
+| 0.4223        | 29.0  | 870  | 1.2319          | 0.9001         | 0.8985      | 0.8989        | 6.6798             | 16             | 2              | 10.9031             | 1.7178                             |
+| 0.4186        | 30.0  | 900  | 1.2386          | 0.8987         | 0.8982      | 0.898         | 6.7509             | 16             | 2              | 10.9718             | 1.8405                             |
+| 0.4146        | 31.0  | 930  | 1.2399          | 0.9001         | 0.8989      | 0.8991        | 6.719              | 16             | 2              | 10.9067             | 1.8405                             |
+| 0.4039        | 32.0  | 960  | 1.2467          | 0.8997         | 0.8992      | 0.899         | 6.7632             | 16             | 2              | 11.0074             | 2.0859                             |
+| 0.3998        | 33.0  | 990  | 1.2425          | 0.8997         | 0.899       | 0.8989        | 6.7497             | 16             | 2              | 10.946              | 1.8405                             |
+| 0.396         | 34.0  | 1020 | 1.2474          | 0.9001         | 0.8982      | 0.8987        | 6.6945             | 16             | 2              | 10.8773             | 1.7178                             |
+| 0.3938        | 35.0  | 1050 | 1.2497          | 0.9003         | 0.8977      | 0.8985        | 6.6454             | 16             | 2              | 10.8245             | 0.9816                             |
+| 0.3828        | 36.0  | 1080 | 1.2511          | 0.9002         | 0.8978      | 0.8986        | 6.6798             | 16             | 2              | 10.8663             | 1.4724                             |
+| 0.3842        | 37.0  | 1110 | 1.2482          | 0.9002         | 0.898       | 0.8987        | 6.6883             | 16             | 2              | 10.8883             | 1.7178                             |
+| 0.391         | 38.0  | 1140 | 1.2507          | 0.9001         | 0.8974      | 0.8983        | 6.6577             | 16             | 2              | 10.8528             | 1.3497                             |
+| 0.3796        | 39.0  | 1170 | 1.2529          | 0.8994         | 0.8972      | 0.8979        | 6.6945             | 16             | 2              | 10.8933             | 1.5951                             |
+| 0.3807        | 40.0  | 1200 | 1.2534          | 0.8992         | 0.8971      | 0.8977        | 6.6982             | 16             | 2              | 10.8994             | 1.4724                             |
+### Framework versions
+- Transformers 4.33.1
+- Pytorch 2.0.1+cu118
+- Datasets 2.14.5
+- Tokenizers 0.13.3

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.33.1"
+}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60a6c4a7ae2dd091ff62189f80f51d12dcc5603e26b559459b01318c7866f493
 size 242071641

 version https://git-lfs.github.com/spec/v1
+oid sha256:08ab7c0349d85e5fa86d1d7f89e0be8dc5d994baa7f1b79e7a43644051583ada
 size 242071641