hieungo1410
commited on
Commit
•
911c8d3
1
Parent(s):
26f828e
End of training
Browse files- README.md +32 -52
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
- Loss: 0.
|
19 |
- Score: 42.9309
|
20 |
- Counts: [2102, 1955, 1808, 1661]
|
21 |
- Totals: [2107, 1960, 1813, 1666]
|
@@ -47,62 +47,42 @@ The following hyperparameters were used during training:
|
|
47 |
- seed: 42
|
48 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
49 |
- lr_scheduler_type: linear
|
50 |
-
- num_epochs:
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Score | Counts | Totals | Precisions | Bp | Sys Len | Ref Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------------------------:|:------------------------:|:----------------------------------------------------------------------------:|:------:|:-------:|:-------:|
|
56 |
-
| No log | 1.0 | 71 | 0.
|
57 |
-
| No log | 2.0 | 142 | 0.
|
58 |
-
| No log | 3.0 | 213 | 0.
|
59 |
-
| No log | 4.0 | 284 | 0.
|
60 |
-
| No log | 5.0 | 355 | 0.
|
61 |
-
| No log | 6.0 | 426 | 0.
|
62 |
-
| No log | 7.0 | 497 | 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
-
| 0.
|
76 |
-
| 0.
|
77 |
-
| 0.
|
78 |
-
| 0.
|
79 |
-
| 0.
|
80 |
-
| 0.
|
81 |
-
| 0.
|
82 |
-
| 0.
|
83 |
-
| 0.
|
84 |
-
| 0.
|
85 |
-
| 0.
|
86 |
-
| 0.0169 | 31.0 | 2201 | 0.0016 | 42.9120 | [2102, 1955, 1807, 1659] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.66905681191396, 99.5798319327731] | 0.4305 | 2107 | 3883 |
|
87 |
-
| 0.0169 | 32.0 | 2272 | 0.0012 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
88 |
-
| 0.0169 | 33.0 | 2343 | 0.0015 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
89 |
-
| 0.0169 | 34.0 | 2414 | 0.0022 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
90 |
-
| 0.0169 | 35.0 | 2485 | 0.0005 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
91 |
-
| 0.0108 | 36.0 | 2556 | 0.0006 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
92 |
-
| 0.0108 | 37.0 | 2627 | 0.0009 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
93 |
-
| 0.0108 | 38.0 | 2698 | 0.0013 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
94 |
-
| 0.0108 | 39.0 | 2769 | 0.0010 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
95 |
-
| 0.0108 | 40.0 | 2840 | 0.0010 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
96 |
-
| 0.0108 | 41.0 | 2911 | 0.0002 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
97 |
-
| 0.0108 | 42.0 | 2982 | 0.0006 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
98 |
-
| 0.0069 | 43.0 | 3053 | 0.0002 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
99 |
-
| 0.0069 | 44.0 | 3124 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
100 |
-
| 0.0069 | 45.0 | 3195 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
101 |
-
| 0.0069 | 46.0 | 3266 | 0.0001 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
102 |
-
| 0.0069 | 47.0 | 3337 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
103 |
-
| 0.0069 | 48.0 | 3408 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
104 |
-
| 0.0069 | 49.0 | 3479 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
105 |
-
| 0.0049 | 50.0 | 3550 | 0.0000 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
106 |
|
107 |
|
108 |
### Framework versions
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 0.0012
|
19 |
- Score: 42.9309
|
20 |
- Counts: [2102, 1955, 1808, 1661]
|
21 |
- Totals: [2107, 1960, 1813, 1666]
|
|
|
47 |
- seed: 42
|
48 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
49 |
- lr_scheduler_type: linear
|
50 |
+
- num_epochs: 30
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Score | Counts | Totals | Precisions | Bp | Sys Len | Ref Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------------------------:|:------------------------:|:----------------------------------------------------------------------------:|:------:|:-------:|:-------:|
|
56 |
+
| No log | 1.0 | 71 | 0.3122 | 34.4724 | [1905, 1607, 1368, 1146] | [2158, 2011, 1864, 1717] | [88.27618164967562, 79.91049229239185, 73.39055793991416, 66.74432149097262] | 0.4496 | 2158 | 3883 |
|
57 |
+
| No log | 2.0 | 142 | 0.1943 | 38.1918 | [1998, 1748, 1542, 1346] | [2156, 2009, 1862, 1715] | [92.67161410018552, 87.00846192135391, 82.8141783029001, 78.48396501457727] | 0.4489 | 2156 | 3883 |
|
58 |
+
| No log | 3.0 | 213 | 0.1372 | 40.0348 | [2027, 1826, 1649, 1477] | [2133, 1986, 1839, 1692] | [95.03047351148616, 91.9436052366566, 89.66829798803698, 87.29314420803783] | 0.4402 | 2133 | 3883 |
|
59 |
+
| No log | 4.0 | 284 | 0.0938 | 41.0633 | [2055, 1866, 1694, 1526] | [2138, 1991, 1844, 1697] | [96.1178671655753, 93.72174786539428, 91.86550976138828, 89.92339422510312] | 0.4421 | 2138 | 3883 |
|
60 |
+
| No log | 5.0 | 355 | 0.0709 | 41.7562 | [2080, 1906, 1738, 1573] | [2121, 1974, 1827, 1680] | [98.06694955209807, 96.55521783181358, 95.12862616310892, 93.63095238095238] | 0.4357 | 2121 | 3883 |
|
61 |
+
| No log | 6.0 | 426 | 0.0585 | 41.9956 | [2080, 1917, 1758, 1599] | [2113, 1966, 1819, 1672] | [98.43823946994794, 97.50762970498474, 96.64650907091809, 95.63397129186603] | 0.4327 | 2113 | 3883 |
|
62 |
+
| No log | 7.0 | 497 | 0.0467 | 42.3984 | [2091, 1931, 1774, 1617] | [2117, 1970, 1823, 1676] | [98.77184695323571, 98.02030456852792, 97.31212287438288, 96.47971360381861] | 0.4342 | 2117 | 3883 |
|
63 |
+
| 0.3666 | 8.0 | 568 | 0.0431 | 42.2176 | [2083, 1924, 1767, 1610] | [2116, 1969, 1822, 1675] | [98.44045368620039, 97.71457592686643, 96.98133918770581, 96.11940298507463] | 0.4338 | 2116 | 3883 |
|
64 |
+
| 0.3666 | 9.0 | 639 | 0.0361 | 42.5115 | [2095, 1938, 1780, 1622] | [2116, 1969, 1822, 1675] | [99.00756143667297, 98.42559674961909, 97.69484083424808, 96.83582089552239] | 0.4338 | 2116 | 3883 |
|
65 |
+
| 0.3666 | 10.0 | 710 | 0.0269 | 42.4989 | [2091, 1938, 1783, 1627] | [2113, 1966, 1819, 1672] | [98.95882631329863, 98.57578840284842, 98.02089059923034, 97.30861244019138] | 0.4327 | 2113 | 3883 |
|
66 |
+
| 0.3666 | 11.0 | 781 | 0.0227 | 42.4765 | [2092, 1939, 1787, 1636] | [2105, 1958, 1811, 1664] | [99.38242280285036, 99.02962206332992, 98.67476532302595, 98.3173076923077] | 0.4297 | 2105 | 3883 |
|
67 |
+
| 0.3666 | 12.0 | 852 | 0.0233 | 43.0008 | [2106, 1956, 1803, 1650] | [2117, 1970, 1823, 1676] | [99.48039678790741, 99.28934010152284, 98.90290729566648, 98.44868735083533] | 0.4342 | 2117 | 3883 |
|
68 |
+
| 0.3666 | 13.0 | 923 | 0.0185 | 42.8239 | [2103, 1953, 1800, 1646] | [2110, 1963, 1816, 1669] | [99.66824644549763, 99.49057564951605, 99.11894273127753, 98.62192929898143] | 0.4316 | 2110 | 3883 |
|
69 |
+
| 0.3666 | 14.0 | 994 | 0.0184 | 42.8509 | [2099, 1950, 1802, 1654] | [2110, 1963, 1816, 1669] | [99.47867298578198, 99.33774834437087, 99.22907488986785, 99.10125823846614] | 0.4316 | 2110 | 3883 |
|
70 |
+
| 0.0724 | 15.0 | 1065 | 0.0107 | 42.8512 | [2101, 1951, 1800, 1649] | [2112, 1965, 1818, 1671] | [99.47916666666667, 99.28753180661577, 99.00990099009901, 98.68342309994016] | 0.4323 | 2112 | 3883 |
|
71 |
+
| 0.0724 | 16.0 | 1136 | 0.0093 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
72 |
+
| 0.0724 | 17.0 | 1207 | 0.0075 | 42.8327 | [2100, 1950, 1800, 1650] | [2111, 1964, 1817, 1670] | [99.4789199431549, 99.28716904276986, 99.06439185470556, 98.80239520958084] | 0.4320 | 2111 | 3883 |
|
73 |
+
| 0.0724 | 18.0 | 1278 | 0.0106 | 42.8931 | [2102, 1955, 1806, 1657] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.61389961389962, 99.45978391356543] | 0.4305 | 2107 | 3883 |
|
74 |
+
| 0.0724 | 19.0 | 1349 | 0.0051 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
75 |
+
| 0.0724 | 20.0 | 1420 | 0.0045 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
76 |
+
| 0.0724 | 21.0 | 1491 | 0.0026 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
77 |
+
| 0.0341 | 22.0 | 1562 | 0.0036 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
78 |
+
| 0.0341 | 23.0 | 1633 | 0.0024 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
79 |
+
| 0.0341 | 24.0 | 1704 | 0.0021 | 42.8573 | [2102, 1953, 1804, 1655] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.64285714285714, 99.50358521787093, 99.33973589435774] | 0.4305 | 2107 | 3883 |
|
80 |
+
| 0.0341 | 25.0 | 1775 | 0.0026 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
81 |
+
| 0.0341 | 26.0 | 1846 | 0.0023 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
82 |
+
| 0.0341 | 27.0 | 1917 | 0.0017 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
83 |
+
| 0.0341 | 28.0 | 1988 | 0.0014 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
84 |
+
| 0.0193 | 29.0 | 2059 | 0.0013 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
85 |
+
| 0.0193 | 30.0 | 2130 | 0.0012 | 42.9309 | [2102, 1955, 1808, 1661] | [2107, 1960, 1813, 1666] | [99.76269577598481, 99.74489795918367, 99.72421400992829, 99.69987995198079] | 0.4305 | 2107 | 3883 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
86 |
|
87 |
|
88 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 903834408
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7f35e6718e8e88d27002154afa3a4df7d2d604c1f098f382d2cf7010ffadce0
|
3 |
size 903834408
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4411
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a082ca4ad4336a01567c9742f73a787b2772ac4a9f76669ec79a77080b935dca
|
3 |
size 4411
|