Training in progress, epoch 10
Browse files- model.safetensors +1 -1
- tb/events.out.tfevents.1725056887.6b97e535edda.51600.0 +2 -2
- train.log +70 -1
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 496244100
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6017fad51f5dc5737798ff6d9fd9fbfbc1d8d5e8c58fda4e121890b2bd69b001
|
3 |
size 496244100
|
tb/events.out.tfevents.1725056887.6b97e535edda.51600.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6b64cb2bd4ea4ebc30acb43ec3597dee9e97889efa2097df4b63784bd9dd34ba
|
3 |
+
size 9984
|
train.log
CHANGED
@@ -774,4 +774,73 @@ You should probably TRAIN this model on a down-stream task to be able to use it
|
|
774 |
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:34:45,932 >> Special tokens file saved in /content/dissertation/scripts/ner/output/checkpoint-1350/special_tokens_map.json
|
775 |
[INFO|tokenization_utils_base.py:2574] 2024-08-30 22:34:51,508 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/tokenizer_config.json
|
776 |
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:34:51,508 >> Special tokens file saved in /content/dissertation/scripts/ner/output/special_tokens_map.json
|
777 |
-
|
778 |
90%|βββββββββ | 1351/1500 [06:44<09:37, 3.87s/it]
|
779 |
90%|βββββββββ | 1352/1500 [06:44<06:53, 2.80s/it]
|
780 |
90%|βββββββββ | 1353/1500 [06:45<05:01, 2.05s/it]
|
781 |
90%|βββββββββ | 1354/1500 [06:45<03:38, 1.50s/it]
|
782 |
90%|βββββββββ | 1355/1500 [06:45<02:42, 1.12s/it]
|
783 |
90%|βββββββββ | 1356/1500 [06:45<02:03, 1.17it/s]
|
784 |
90%|βββββββββ | 1357/1500 [06:46<01:34, 1.52it/s]
|
785 |
91%|βββββββββ | 1358/1500 [06:46<01:15, 1.89it/s]
|
786 |
91%|βββββββββ | 1359/1500 [06:46<00:59, 2.37it/s]
|
787 |
91%|βββββββββ | 1360/1500 [06:46<00:49, 2.84it/s]
|
788 |
91%|βββββββββ | 1361/1500 [06:46<00:41, 3.36it/s]
|
789 |
91%|βββββββββ | 1362/1500 [06:47<00:43, 3.19it/s]
|
790 |
91%|βββββββββ | 1363/1500 [06:47<00:38, 3.56it/s]
|
791 |
91%|βββββββββ | 1364/1500 [06:47<00:36, 3.75it/s]
|
792 |
91%|βββββββββ | 1365/1500 [06:47<00:34, 3.93it/s]
|
793 |
91%|βββββββββ | 1366/1500 [06:48<00:32, 4.09it/s]
|
794 |
91%|βββββββββ | 1367/1500 [06:48<00:30, 4.43it/s]
|
795 |
91%|βββββββββ | 1368/1500 [06:48<00:28, 4.57it/s]
|
796 |
91%|ββββββββββ| 1369/1500 [06:48<00:27, 4.74it/s]
|
797 |
91%|ββββββββββ| 1370/1500 [06:48<00:28, 4.54it/s]
|
798 |
91%|ββββββββββ| 1371/1500 [06:49<00:32, 4.00it/s]
|
799 |
91%|ββββββββββ| 1372/1500 [06:49<00:32, 3.93it/s]
|
800 |
92%|ββββββββββ| 1373/1500 [06:49<00:31, 4.04it/s]
|
801 |
92%|ββββββββββ| 1374/1500 [06:49<00:30, 4.11it/s]
|
802 |
92%|ββββββββββ| 1375/1500 [06:50<00:28, 4.42it/s]
|
803 |
92%|ββββββββββ| 1376/1500 [06:50<00:27, 4.45it/s]
|
804 |
92%|ββββββββββ| 1377/1500 [06:50<00:27, 4.48it/s]
|
805 |
92%|ββββββββββ| 1378/1500 [06:50<00:25, 4.75it/s]
|
806 |
92%|ββββββββββ| 1379/1500 [06:50<00:24, 4.91it/s]
|
807 |
92%|ββββββββββ| 1380/1500 [06:51<00:25, 4.76it/s]
|
808 |
92%|ββββββββββ| 1381/1500 [06:51<00:24, 4.77it/s]
|
809 |
92%|ββββββββββ| 1382/1500 [06:51<00:24, 4.81it/s]
|
810 |
92%|ββββββββββ| 1383/1500 [06:51<00:24, 4.77it/s]
|
811 |
92%|ββββββββββ| 1384/1500 [06:52<00:26, 4.33it/s]
|
812 |
92%|ββββββββββ| 1385/1500 [06:52<00:25, 4.43it/s]
|
813 |
92%|ββββββββββ| 1386/1500 [06:52<00:27, 4.15it/s]
|
814 |
92%|ββββββββββ| 1387/1500 [06:52<00:24, 4.69it/s]
|
815 |
93%|ββββββββββ| 1388/1500 [06:53<00:26, 4.28it/s]
|
816 |
93%|ββββββββββ| 1389/1500 [06:53<00:25, 4.33it/s]
|
817 |
93%|ββββββββββ| 1390/1500 [06:53<00:24, 4.46it/s]
|
818 |
93%|ββββββββββ| 1391/1500 [06:53<00:29, 3.67it/s]
|
819 |
93%|ββββββββββ| 1392/1500 [06:54<00:27, 3.93it/s]
|
820 |
93%|ββββββββββ| 1393/1500 [06:54<00:26, 3.98it/s]
|
821 |
93%|ββββββββββ| 1394/1500 [06:54<00:24, 4.26it/s]
|
822 |
93%|ββββββββββ| 1395/1500 [06:54<00:23, 4.51it/s]
|
823 |
93%|ββββββββββ| 1396/1500 [06:54<00:22, 4.61it/s]
|
824 |
93%|ββββββββββ| 1397/1500 [06:55<00:24, 4.20it/s]
|
825 |
93%|ββββββββββ| 1398/1500 [06:55<00:22, 4.48it/s]
|
826 |
93%|ββββββββββ| 1399/1500 [06:55<00:24, 4.14it/s]
|
827 |
93%|ββββββββββ| 1400/1500 [06:55<00:25, 3.94it/s]
|
828 |
93%|ββββββββββ| 1401/1500 [06:56<00:24, 4.12it/s]
|
829 |
93%|ββββββββββ| 1402/1500 [06:56<00:24, 4.01it/s]
|
830 |
94%|ββββββββββ| 1403/1500 [06:56<00:25, 3.85it/s]
|
831 |
94%|ββββββββββ| 1404/1500 [06:56<00:23, 4.09it/s]
|
832 |
94%|ββββββββββ| 1405/1500 [06:57<00:23, 4.10it/s]
|
833 |
94%|ββββββββββ| 1406/1500 [06:57<00:21, 4.33it/s]
|
834 |
94%|ββββββββββ| 1407/1500 [06:57<00:24, 3.75it/s]
|
835 |
94%|ββββββββββ| 1408/1500 [06:57<00:22, 4.10it/s]
|
836 |
94%|ββββββββββ| 1409/1500 [06:58<00:24, 3.77it/s]
|
|
|
837 |
90%|βββββββββ | 1351/1500 [06:44<09:37, 3.87s/it]
|
838 |
90%|βββββββββ | 1352/1500 [06:44<06:53, 2.80s/it]
|
839 |
90%|βββββββββ | 1353/1500 [06:45<05:01, 2.05s/it]
|
840 |
90%|βββββββββ | 1354/1500 [06:45<03:38, 1.50s/it]
|
841 |
90%|βββββββββ | 1355/1500 [06:45<02:42, 1.12s/it]
|
842 |
90%|βββββββββ | 1356/1500 [06:45<02:03, 1.17it/s]
|
843 |
90%|βββββββββ | 1357/1500 [06:46<01:34, 1.52it/s]
|
844 |
91%|βββββββββ | 1358/1500 [06:46<01:15, 1.89it/s]
|
845 |
91%|βββββββββ | 1359/1500 [06:46<00:59, 2.37it/s]
|
846 |
91%|βββββββββ | 1360/1500 [06:46<00:49, 2.84it/s]
|
847 |
91%|βββββββββ | 1361/1500 [06:46<00:41, 3.36it/s]
|
848 |
91%|βββββββββ | 1362/1500 [06:47<00:43, 3.19it/s]
|
849 |
91%|βββββββββ | 1363/1500 [06:47<00:38, 3.56it/s]
|
850 |
91%|βββββββββ | 1364/1500 [06:47<00:36, 3.75it/s]
|
851 |
91%|βββββββββ | 1365/1500 [06:47<00:34, 3.93it/s]
|
852 |
91%|βββββββββ | 1366/1500 [06:48<00:32, 4.09it/s]
|
853 |
91%|βββββββββ | 1367/1500 [06:48<00:30, 4.43it/s]
|
854 |
91%|βββββββββ | 1368/1500 [06:48<00:28, 4.57it/s]
|
855 |
91%|ββββββββββ| 1369/1500 [06:48<00:27, 4.74it/s]
|
856 |
91%|ββββββββββ| 1370/1500 [06:48<00:28, 4.54it/s]
|
857 |
91%|ββββββββββ| 1371/1500 [06:49<00:32, 4.00it/s]
|
858 |
91%|ββββββββββ| 1372/1500 [06:49<00:32, 3.93it/s]
|
859 |
92%|ββββββββββ| 1373/1500 [06:49<00:31, 4.04it/s]
|
860 |
92%|ββββββββββ| 1374/1500 [06:49<00:30, 4.11it/s]
|
861 |
92%|ββββββββββ| 1375/1500 [06:50<00:28, 4.42it/s]
|
862 |
92%|ββββββββββ| 1376/1500 [06:50<00:27, 4.45it/s]
|
863 |
92%|ββββββββββ| 1377/1500 [06:50<00:27, 4.48it/s]
|
864 |
92%|ββββββββββ| 1378/1500 [06:50<00:25, 4.75it/s]
|
865 |
92%|ββββββββββ| 1379/1500 [06:50<00:24, 4.91it/s]
|
866 |
92%|ββββββββββ| 1380/1500 [06:51<00:25, 4.76it/s]
|
867 |
92%|ββββββββββ| 1381/1500 [06:51<00:24, 4.77it/s]
|
868 |
92%|ββββββββββ| 1382/1500 [06:51<00:24, 4.81it/s]
|
869 |
92%|ββββββββββ| 1383/1500 [06:51<00:24, 4.77it/s]
|
870 |
92%|ββββββββββ| 1384/1500 [06:52<00:26, 4.33it/s]
|
871 |
92%|ββββββββββ| 1385/1500 [06:52<00:25, 4.43it/s]
|
872 |
92%|ββββββββββ| 1386/1500 [06:52<00:27, 4.15it/s]
|
873 |
92%|ββββββββββ| 1387/1500 [06:52<00:24, 4.69it/s]
|
874 |
93%|ββββββββββ| 1388/1500 [06:53<00:26, 4.28it/s]
|
875 |
93%|ββββββββββ| 1389/1500 [06:53<00:25, 4.33it/s]
|
876 |
93%|ββββββββββ| 1390/1500 [06:53<00:24, 4.46it/s]
|
877 |
93%|ββββββββββ| 1391/1500 [06:53<00:29, 3.67it/s]
|
878 |
93%|ββββββββββ| 1392/1500 [06:54<00:27, 3.93it/s]
|
879 |
93%|ββββββββββ| 1393/1500 [06:54<00:26, 3.98it/s]
|
880 |
93%|ββββββββββ| 1394/1500 [06:54<00:24, 4.26it/s]
|
881 |
93%|ββββββββββ| 1395/1500 [06:54<00:23, 4.51it/s]
|
882 |
93%|ββββββββββ| 1396/1500 [06:54<00:22, 4.61it/s]
|
883 |
93%|ββββββββββ| 1397/1500 [06:55<00:24, 4.20it/s]
|
884 |
93%|ββββββββββ| 1398/1500 [06:55<00:22, 4.48it/s]
|
885 |
93%|ββββββββββ| 1399/1500 [06:55<00:24, 4.14it/s]
|
886 |
93%|ββββββββββ| 1400/1500 [06:55<00:25, 3.94it/s]
|
887 |
93%|ββββββββββ| 1401/1500 [06:56<00:24, 4.12it/s]
|
888 |
93%|ββββββββββ| 1402/1500 [06:56<00:24, 4.01it/s]
|
889 |
94%|ββββββββββ| 1403/1500 [06:56<00:25, 3.85it/s]
|
890 |
94%|ββββββββββ| 1404/1500 [06:56<00:23, 4.09it/s]
|
891 |
94%|ββββββββββ| 1405/1500 [06:57<00:23, 4.10it/s]
|
892 |
94%|ββββββββββ| 1406/1500 [06:57<00:21, 4.33it/s]
|
893 |
94%|ββββββββββ| 1407/1500 [06:57<00:24, 3.75it/s]
|
894 |
94%|ββββββββββ| 1408/1500 [06:57<00:22, 4.10it/s]
|
895 |
94%|ββββββββββ| 1409/1500 [06:58<00:24, 3.77it/s]
|
896 |
94%|ββββββββββ| 1410/1500 [06:58<00:24, 3.70it/s]
|
897 |
94%|ββββββββββ| 1411/1500 [06:58<00:22, 4.04it/s]
|
898 |
94%|ββββββββββ| 1412/1500 [06:58<00:20, 4.20it/s]
|
899 |
94%|ββββββββββ| 1413/1500 [06:59<00:20, 4.26it/s]
|
900 |
94%|ββββββββββ| 1414/1500 [06:59<00:19, 4.35it/s]
|
901 |
94%|ββββββββββ| 1415/1500 [06:59<00:18, 4.61it/s]
|
902 |
94%|ββββββββββ| 1416/1500 [06:59<00:17, 4.69it/s]
|
903 |
94%|ββββββββββ| 1417/1500 [06:59<00:18, 4.58it/s]
|
904 |
95%|ββββββββββ| 1418/1500 [07:00<00:18, 4.49it/s]
|
905 |
95%|ββββββββββ| 1419/1500 [07:00<00:23, 3.52it/s]
|
906 |
95%|ββββββββββ| 1420/1500 [07:00<00:21, 3.80it/s]
|
907 |
95%|ββββββββββ| 1421/1500 [07:01<00:22, 3.55it/s]
|
908 |
95%|ββββββββββ| 1422/1500 [07:01<00:20, 3.83it/s]
|
909 |
95%|ββββββββββ| 1423/1500 [07:01<00:18, 4.06it/s]
|
910 |
95%|ββββββββββ| 1424/1500 [07:01<00:18, 4.02it/s]
|
911 |
95%|ββββββββββ| 1425/1500 [07:02<00:18, 4.06it/s]
|
912 |
95%|ββββββββββ| 1426/1500 [07:02<00:17, 4.11it/s]
|
913 |
95%|ββββββββββ| 1427/1500 [07:02<00:17, 4.29it/s]
|
914 |
95%|ββββββββββ| 1428/1500 [07:02<00:15, 4.55it/s]
|
915 |
95%|ββββββββββ| 1429/1500 [07:02<00:16, 4.29it/s]
|
916 |
95%|ββββββββββ| 1430/1500 [07:03<00:15, 4.38it/s]
|
917 |
95%|ββββββββββ| 1431/1500 [07:03<00:19, 3.60it/s]
|
918 |
95%|ββββββββββ| 1432/1500 [07:03<00:17, 3.92it/s]
|
919 |
96%|ββββββββββ| 1433/1500 [07:03<00:16, 4.12it/s]
|
920 |
96%|ββββββββββ| 1434/1500 [07:04<00:15, 4.30it/s]
|
921 |
96%|ββββββββββ| 1435/1500 [07:04<00:15, 4.31it/s]
|
922 |
96%|ββββββββββ| 1436/1500 [07:04<00:15, 4.19it/s]
|
923 |
96%|ββββββββββ| 1437/1500 [07:04<00:16, 3.93it/s]
|
924 |
96%|ββββββββββ| 1438/1500 [07:05<00:16, 3.84it/s]
|
925 |
96%|ββββββββββ| 1439/1500 [07:05<00:16, 3.77it/s]
|
926 |
96%|ββββββββββ| 1440/1500 [07:05<00:15, 3.80it/s]
|
927 |
96%|ββββββββββ| 1441/1500 [07:06<00:14, 3.99it/s]
|
928 |
96%|ββββββββββ| 1442/1500 [07:06<00:14, 4.01it/s]
|
929 |
96%|ββββββββββ| 1443/1500 [07:06<00:13, 4.29it/s]
|
930 |
96%|ββββββββββ| 1444/1500 [07:06<00:13, 4.08it/s]
|
931 |
96%|ββββββββββ| 1445/1500 [07:07<00:14, 3.83it/s]
|
932 |
96%|ββββββββββ| 1446/1500 [07:07<00:13, 3.97it/s]
|
933 |
96%|ββββββββββ| 1447/1500 [07:07<00:17, 3.09it/s]
|
934 |
97%|ββββββββββ| 1448/1500 [07:07<00:14, 3.57it/s]
|
935 |
97%|ββββββββββ| 1449/1500 [07:08<00:13, 3.85it/s]
|
936 |
97%|ββββββββββ| 1450/1500 [07:08<00:12, 4.11it/s]
|
937 |
97%|ββββββββββ| 1451/1500 [07:08<00:10, 4.46it/s]
|
938 |
97%|ββββββββββ| 1452/1500 [07:08<00:10, 4.38it/s]
|
939 |
97%|ββββββββββ| 1453/1500 [07:09<00:10, 4.31it/s]
|
940 |
97%|ββββββββββ| 1454/1500 [07:09<00:10, 4.44it/s]
|
941 |
97%|ββββββββββ| 1455/1500 [07:09<00:10, 4.29it/s]
|
942 |
97%|ββββββββββ| 1456/1500 [07:09<00:10, 4.28it/s]
|
943 |
97%|ββββββββββ| 1457/1500 [07:09<00:10, 4.27it/s]
|
944 |
97%|ββββββββββ| 1458/1500 [07:10<00:09, 4.49it/s]
|
945 |
97%|ββββββββββ| 1459/1500 [07:10<00:11, 3.73it/s]
|
946 |
97%|ββββββββββ| 1460/1500 [07:10<00:09, 4.11it/s]
|
947 |
97%|ββββββββββ| 1461/1500 [07:10<00:09, 4.08it/s]
|
948 |
97%|ββββββββββ| 1462/1500 [07:11<00:08, 4.30it/s]
|
949 |
98%|ββββββββββ| 1463/1500 [07:11<00:09, 4.03it/s]
|
950 |
98%|ββββββββββ| 1464/1500 [07:11<00:08, 4.28it/s]
|
951 |
98%|ββββββββββ| 1465/1500 [07:11<00:08, 4.37it/s]
|
952 |
98%|ββββββββββ| 1466/1500 [07:12<00:08, 4.23it/s]
|
953 |
98%|ββββββββββ| 1467/1500 [07:12<00:07, 4.23it/s]
|
954 |
98%|ββββββββββ| 1468/1500 [07:12<00:07, 4.35it/s]
|
955 |
98%|ββββββββββ| 1469/1500 [07:12<00:07, 4.23it/s]
|
956 |
98%|ββββββββββ| 1470/1500 [07:13<00:07, 4.15it/s]
|
957 |
98%|ββββββββββ| 1471/1500 [07:13<00:06, 4.58it/s]
|
958 |
98%|ββββββββββ| 1472/1500 [07:13<00:06, 4.31it/s]
|
959 |
98%|ββββββββββ| 1473/1500 [07:13<00:05, 4.54it/s]
|
960 |
98%|ββββββββββ| 1474/1500 [07:13<00:05, 4.59it/s]
|
961 |
98%|ββββββββββ| 1475/1500 [07:14<00:06, 4.05it/s]
|
962 |
98%|ββββββββββ| 1476/1500 [07:14<00:05, 4.09it/s]
|
963 |
98%|ββββββββββ| 1477/1500 [07:14<00:06, 3.68it/s]
|
964 |
99%|ββββββββββ| 1478/1500 [07:14<00:05, 4.05it/s]
|
965 |
99%|ββββββββββ| 1479/1500 [07:15<00:05, 3.96it/s]
|
966 |
99%|ββββββββββ| 1480/1500 [07:15<00:05, 3.97it/s]
|
967 |
99%|ββββββββββ| 1481/1500 [07:15<00:04, 3.98it/s]
|
968 |
99%|ββββββββββ| 1482/1500 [07:15<00:04, 4.07it/s]
|
969 |
99%|ββββββββββ| 1483/1500 [07:16<00:04, 4.22it/s]
|
970 |
99%|ββββββββββ| 1484/1500 [07:16<00:03, 4.54it/s]
|
971 |
99%|ββββββββββ| 1485/1500 [07:16<00:03, 4.71it/s]
|
972 |
99%|ββββββββββ| 1486/1500 [07:16<00:02, 4.75it/s]
|
973 |
99%|ββββββββββ| 1487/1500 [07:16<00:02, 4.95it/s]
|
974 |
99%|ββββββββββ| 1488/1500 [07:17<00:02, 5.11it/s]
|
975 |
99%|ββββββββββ| 1489/1500 [07:17<00:02, 5.07it/s]
|
976 |
99%|ββββββββββ| 1490/1500 [07:17<00:01, 5.08it/s]
|
977 |
99%|ββββββββββ| 1491/1500 [07:17<00:01, 4.71it/s]
|
978 |
99%|ββββββββββ| 1492/1500 [07:18<00:01, 4.23it/s]
|
979 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
980 |
0%| | 0/315 [00:00<?, ?it/s][A
|
|
|
981 |
3%|β | 8/315 [00:00<00:04, 73.73it/s][A
|
|
|
982 |
5%|β | 16/315 [00:00<00:04, 73.93it/s][A
|
|
|
983 |
8%|β | 24/315 [00:00<00:03, 74.53it/s][A
|
|
|
984 |
10%|β | 32/315 [00:00<00:03, 73.07it/s][A
|
|
|
985 |
13%|ββ | 40/315 [00:00<00:03, 73.97it/s][A
|
|
|
986 |
15%|ββ | 48/315 [00:00<00:03, 74.36it/s][A
|
|
|
987 |
18%|ββ | 56/315 [00:00<00:03, 74.36it/s][A
|
|
|
988 |
20%|ββ | 64/315 [00:00<00:03, 72.79it/s][A
|
|
|
989 |
23%|βββ | 72/315 [00:00<00:03, 73.95it/s][A
|
|
|
990 |
25%|βββ | 80/315 [00:01<00:03, 72.93it/s][A
|
|
|
991 |
28%|βββ | 88/315 [00:01<00:03, 72.87it/s][A
|
|
|
992 |
30%|βββ | 96/315 [00:01<00:02, 73.78it/s][A
|
|
|
993 |
33%|ββββ | 104/315 [00:01<00:02, 72.07it/s][A
|
|
|
994 |
36%|ββββ | 112/315 [00:01<00:02, 72.26it/s][A
|
|
|
995 |
38%|ββββ | 120/315 [00:01<00:02, 72.12it/s][A
|
|
|
996 |
41%|ββββ | 128/315 [00:01<00:02, 72.59it/s][A
|
|
|
997 |
43%|βββββ | 136/315 [00:01<00:02, 73.04it/s][A
|
|
|
998 |
46%|βββββ | 144/315 [00:01<00:02, 70.92it/s][A
|
|
|
999 |
48%|βββββ | 152/315 [00:02<00:02, 70.72it/s][A
|
|
|
1000 |
51%|βββββ | 160/315 [00:02<00:02, 71.70it/s][A
|
|
|
1001 |
53%|ββββββ | 168/315 [00:02<00:02, 70.87it/s][A
|
|
|
1002 |
56%|ββββββ | 176/315 [00:02<00:01, 70.65it/s][A
|
|
|
1003 |
58%|ββββββ | 184/315 [00:02<00:01, 72.05it/s][A
|
|
|
1004 |
61%|ββββββ | 192/315 [00:02<00:01, 73.71it/s][A
|
|
|
1005 |
63%|βββββββ | 200/315 [00:02<00:01, 73.88it/s][A
|
|
|
1006 |
66%|βββββββ | 208/315 [00:02<00:01, 70.92it/s][A
|
|
|
1007 |
69%|βββββββ | 216/315 [00:02<00:01, 72.48it/s][A
|
|
|
1008 |
71%|βββββββ | 224/315 [00:03<00:01, 74.05it/s][A
|
|
|
1009 |
74%|ββββββββ | 232/315 [00:03<00:01, 75.07it/s][A
|
|
|
1010 |
76%|ββββββββ | 240/315 [00:03<00:01, 71.76it/s][A
|
|
|
1011 |
79%|ββββββββ | 248/315 [00:03<00:00, 71.84it/s][A
|
|
|
1012 |
81%|βββββββββ | 256/315 [00:03<00:00, 72.35it/s][A
|
|
|
1013 |
84%|βββββββββ | 264/315 [00:03<00:00, 72.07it/s][A
|
|
|
1014 |
86%|βββββββββ | 272/315 [00:03<00:00, 72.92it/s][A
|
|
|
1015 |
89%|βββββββββ | 280/315 [00:03<00:00, 74.87it/s][A
|
|
|
1016 |
91%|ββββββββββ| 288/315 [00:03<00:00, 74.75it/s][A
|
|
|
1017 |
94%|ββββββββββ| 296/315 [00:04<00:00, 73.35it/s][A
|
|
|
1018 |
97%|ββββββββββ| 304/315 [00:04<00:00, 74.36it/s][A
|
|
|
1019 |
99%|ββββββββββ| 313/315 [00:04<00:00, 76.22it/s][A
|
1020 |
|
|
|
1021 |
|
|
|
|
|
1022 |
[A[INFO|trainer.py:3478] 2024-08-30 22:35:36,571 >> Saving model checkpoint to /content/dissertation/scripts/ner/output/checkpoint-1500
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1023 |
|
|
|
|
774 |
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:34:45,932 >> Special tokens file saved in /content/dissertation/scripts/ner/output/checkpoint-1350/special_tokens_map.json
|
775 |
[INFO|tokenization_utils_base.py:2574] 2024-08-30 22:34:51,508 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/tokenizer_config.json
|
776 |
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:34:51,508 >> Special tokens file saved in /content/dissertation/scripts/ner/output/special_tokens_map.json
|
|
|
777 |
90%|βββββββββ | 1351/1500 [06:44<09:37, 3.87s/it]
|
778 |
90%|βββββββββ | 1352/1500 [06:44<06:53, 2.80s/it]
|
779 |
90%|βββββββββ | 1353/1500 [06:45<05:01, 2.05s/it]
|
780 |
90%|βββββββββ | 1354/1500 [06:45<03:38, 1.50s/it]
|
781 |
90%|βββββββββ | 1355/1500 [06:45<02:42, 1.12s/it]
|
782 |
90%|βββββββββ | 1356/1500 [06:45<02:03, 1.17it/s]
|
783 |
90%|βββββββββ | 1357/1500 [06:46<01:34, 1.52it/s]
|
784 |
91%|βββββββββ | 1358/1500 [06:46<01:15, 1.89it/s]
|
785 |
91%|βββββββββ | 1359/1500 [06:46<00:59, 2.37it/s]
|
786 |
91%|βββββββββ | 1360/1500 [06:46<00:49, 2.84it/s]
|
787 |
91%|βββββββββ | 1361/1500 [06:46<00:41, 3.36it/s]
|
788 |
91%|βββββββββ | 1362/1500 [06:47<00:43, 3.19it/s]
|
789 |
91%|βββββββββ | 1363/1500 [06:47<00:38, 3.56it/s]
|
790 |
91%|βββββββββ | 1364/1500 [06:47<00:36, 3.75it/s]
|
791 |
91%|βββββββββ | 1365/1500 [06:47<00:34, 3.93it/s]
|
792 |
91%|βββββββββ | 1366/1500 [06:48<00:32, 4.09it/s]
|
793 |
91%|βββββββββ | 1367/1500 [06:48<00:30, 4.43it/s]
|
794 |
91%|βββββββββ | 1368/1500 [06:48<00:28, 4.57it/s]
|
795 |
91%|ββββββββββ| 1369/1500 [06:48<00:27, 4.74it/s]
|
796 |
91%|ββββββββββ| 1370/1500 [06:48<00:28, 4.54it/s]
|
797 |
91%|ββββββββββ| 1371/1500 [06:49<00:32, 4.00it/s]
|
798 |
91%|ββββββββββ| 1372/1500 [06:49<00:32, 3.93it/s]
|
799 |
92%|ββββββββββ| 1373/1500 [06:49<00:31, 4.04it/s]
|
800 |
92%|ββββββββββ| 1374/1500 [06:49<00:30, 4.11it/s]
|
801 |
92%|ββββββββββ| 1375/1500 [06:50<00:28, 4.42it/s]
|
802 |
92%|ββββββββββ| 1376/1500 [06:50<00:27, 4.45it/s]
|
803 |
92%|ββββββββββ| 1377/1500 [06:50<00:27, 4.48it/s]
|
804 |
92%|ββββββββββ| 1378/1500 [06:50<00:25, 4.75it/s]
|
805 |
92%|ββββββββββ| 1379/1500 [06:50<00:24, 4.91it/s]
|
806 |
92%|ββββββββββ| 1380/1500 [06:51<00:25, 4.76it/s]
|
807 |
92%|ββββββββββ| 1381/1500 [06:51<00:24, 4.77it/s]
|
808 |
92%|ββββββββββ| 1382/1500 [06:51<00:24, 4.81it/s]
|
809 |
92%|ββββββββββ| 1383/1500 [06:51<00:24, 4.77it/s]
|
810 |
92%|ββββββββββ| 1384/1500 [06:52<00:26, 4.33it/s]
|
811 |
92%|ββββββββββ| 1385/1500 [06:52<00:25, 4.43it/s]
|
812 |
92%|ββββββββββ| 1386/1500 [06:52<00:27, 4.15it/s]
|
813 |
92%|ββββββββββ| 1387/1500 [06:52<00:24, 4.69it/s]
|
814 |
93%|ββββββββββ| 1388/1500 [06:53<00:26, 4.28it/s]
|
815 |
93%|ββββββββββ| 1389/1500 [06:53<00:25, 4.33it/s]
|
816 |
93%|ββββββββββ| 1390/1500 [06:53<00:24, 4.46it/s]
|
817 |
93%|ββββββββββ| 1391/1500 [06:53<00:29, 3.67it/s]
|
818 |
93%|ββββββββββ| 1392/1500 [06:54<00:27, 3.93it/s]
|
819 |
93%|ββββββββββ| 1393/1500 [06:54<00:26, 3.98it/s]
|
820 |
93%|ββββββββββ| 1394/1500 [06:54<00:24, 4.26it/s]
|
821 |
93%|ββββββββββ| 1395/1500 [06:54<00:23, 4.51it/s]
|
822 |
93%|ββββββββββ| 1396/1500 [06:54<00:22, 4.61it/s]
|
823 |
93%|ββββββββββ| 1397/1500 [06:55<00:24, 4.20it/s]
|
824 |
93%|ββββββββββ| 1398/1500 [06:55<00:22, 4.48it/s]
|
825 |
93%|ββββββββββ| 1399/1500 [06:55<00:24, 4.14it/s]
|
826 |
93%|ββββββββββ| 1400/1500 [06:55<00:25, 3.94it/s]
|
827 |
93%|ββββββββββ| 1401/1500 [06:56<00:24, 4.12it/s]
|
828 |
93%|ββββββββββ| 1402/1500 [06:56<00:24, 4.01it/s]
|
829 |
94%|ββββββββββ| 1403/1500 [06:56<00:25, 3.85it/s]
|
830 |
94%|ββββββββββ| 1404/1500 [06:56<00:23, 4.09it/s]
|
831 |
94%|ββββββββββ| 1405/1500 [06:57<00:23, 4.10it/s]
|
832 |
94%|ββββββββββ| 1406/1500 [06:57<00:21, 4.33it/s]
|
833 |
94%|ββββββββββ| 1407/1500 [06:57<00:24, 3.75it/s]
|
834 |
94%|ββββββββββ| 1408/1500 [06:57<00:22, 4.10it/s]
|
835 |
94%|ββββββββββ| 1409/1500 [06:58<00:24, 3.77it/s]
|
836 |
+
|
837 |
90%|βββββββββ | 1351/1500 [06:44<09:37, 3.87s/it]
|
838 |
90%|βββββββββ | 1352/1500 [06:44<06:53, 2.80s/it]
|
839 |
90%|βββββββββ | 1353/1500 [06:45<05:01, 2.05s/it]
|
840 |
90%|βββββββββ | 1354/1500 [06:45<03:38, 1.50s/it]
|
841 |
90%|βββββββββ | 1355/1500 [06:45<02:42, 1.12s/it]
|
842 |
90%|βββββββββ | 1356/1500 [06:45<02:03, 1.17it/s]
|
843 |
90%|βββββββββ | 1357/1500 [06:46<01:34, 1.52it/s]
|
844 |
91%|βββββββββ | 1358/1500 [06:46<01:15, 1.89it/s]
|
845 |
91%|βββββββββ | 1359/1500 [06:46<00:59, 2.37it/s]
|
846 |
91%|βββββββββ | 1360/1500 [06:46<00:49, 2.84it/s]
|
847 |
91%|βββββββββ | 1361/1500 [06:46<00:41, 3.36it/s]
|
848 |
91%|βββββββββ | 1362/1500 [06:47<00:43, 3.19it/s]
|
849 |
91%|βββββββββ | 1363/1500 [06:47<00:38, 3.56it/s]
|
850 |
91%|βββββββββ | 1364/1500 [06:47<00:36, 3.75it/s]
|
851 |
91%|βββββββββ | 1365/1500 [06:47<00:34, 3.93it/s]
|
852 |
91%|βββββββββ | 1366/1500 [06:48<00:32, 4.09it/s]
|
853 |
91%|βββββββββ | 1367/1500 [06:48<00:30, 4.43it/s]
|
854 |
91%|βββββββββ | 1368/1500 [06:48<00:28, 4.57it/s]
|
855 |
91%|ββββββββββ| 1369/1500 [06:48<00:27, 4.74it/s]
|
856 |
91%|ββββββββββ| 1370/1500 [06:48<00:28, 4.54it/s]
|
857 |
91%|ββββββββββ| 1371/1500 [06:49<00:32, 4.00it/s]
|
858 |
91%|ββββββββββ| 1372/1500 [06:49<00:32, 3.93it/s]
|
859 |
92%|ββββββββββ| 1373/1500 [06:49<00:31, 4.04it/s]
|
860 |
92%|ββββββββββ| 1374/1500 [06:49<00:30, 4.11it/s]
|
861 |
92%|ββββββββββ| 1375/1500 [06:50<00:28, 4.42it/s]
|
862 |
92%|ββββββββββ| 1376/1500 [06:50<00:27, 4.45it/s]
|
863 |
92%|ββββββββββ| 1377/1500 [06:50<00:27, 4.48it/s]
|
864 |
92%|ββββββββββ| 1378/1500 [06:50<00:25, 4.75it/s]
|
865 |
92%|ββββββββββ| 1379/1500 [06:50<00:24, 4.91it/s]
|
866 |
92%|ββββββββββ| 1380/1500 [06:51<00:25, 4.76it/s]
|
867 |
92%|ββββββββββ| 1381/1500 [06:51<00:24, 4.77it/s]
|
868 |
92%|ββββββββββ| 1382/1500 [06:51<00:24, 4.81it/s]
|
869 |
92%|ββββββββββ| 1383/1500 [06:51<00:24, 4.77it/s]
|
870 |
92%|ββββββββββ| 1384/1500 [06:52<00:26, 4.33it/s]
|
871 |
92%|ββββββββββ| 1385/1500 [06:52<00:25, 4.43it/s]
|
872 |
92%|ββββββββββ| 1386/1500 [06:52<00:27, 4.15it/s]
|
873 |
92%|ββββββββββ| 1387/1500 [06:52<00:24, 4.69it/s]
|
874 |
93%|ββββββββββ| 1388/1500 [06:53<00:26, 4.28it/s]
|
875 |
93%|ββββββββββ| 1389/1500 [06:53<00:25, 4.33it/s]
|
876 |
93%|ββββββββββ| 1390/1500 [06:53<00:24, 4.46it/s]
|
877 |
93%|ββββββββββ| 1391/1500 [06:53<00:29, 3.67it/s]
|
878 |
93%|ββββββββββ| 1392/1500 [06:54<00:27, 3.93it/s]
|
879 |
93%|ββββββββββ| 1393/1500 [06:54<00:26, 3.98it/s]
|
880 |
93%|ββββββββββ| 1394/1500 [06:54<00:24, 4.26it/s]
|
881 |
93%|ββββββββββ| 1395/1500 [06:54<00:23, 4.51it/s]
|
882 |
93%|ββββββββββ| 1396/1500 [06:54<00:22, 4.61it/s]
|
883 |
93%|ββββββββββ| 1397/1500 [06:55<00:24, 4.20it/s]
|
884 |
93%|ββββββββββ| 1398/1500 [06:55<00:22, 4.48it/s]
|
885 |
93%|ββββββββββ| 1399/1500 [06:55<00:24, 4.14it/s]
|
886 |
93%|ββββββββββ| 1400/1500 [06:55<00:25, 3.94it/s]
|
887 |
93%|ββββββββββ| 1401/1500 [06:56<00:24, 4.12it/s]
|
888 |
93%|ββββββββββ| 1402/1500 [06:56<00:24, 4.01it/s]
|
889 |
94%|ββββββββββ| 1403/1500 [06:56<00:25, 3.85it/s]
|
890 |
94%|ββββββββββ| 1404/1500 [06:56<00:23, 4.09it/s]
|
891 |
94%|ββββββββββ| 1405/1500 [06:57<00:23, 4.10it/s]
|
892 |
94%|ββββββββββ| 1406/1500 [06:57<00:21, 4.33it/s]
|
893 |
94%|ββββββββββ| 1407/1500 [06:57<00:24, 3.75it/s]
|
894 |
94%|ββββββββββ| 1408/1500 [06:57<00:22, 4.10it/s]
|
895 |
94%|ββββββββββ| 1409/1500 [06:58<00:24, 3.77it/s]
|
896 |
94%|ββββββββββ| 1410/1500 [06:58<00:24, 3.70it/s]
|
897 |
94%|ββββββββββ| 1411/1500 [06:58<00:22, 4.04it/s]
|
898 |
94%|ββββββββββ| 1412/1500 [06:58<00:20, 4.20it/s]
|
899 |
94%|ββββββββββ| 1413/1500 [06:59<00:20, 4.26it/s]
|
900 |
94%|ββββββββββ| 1414/1500 [06:59<00:19, 4.35it/s]
|
901 |
94%|ββββββββββ| 1415/1500 [06:59<00:18, 4.61it/s]
|
902 |
94%|ββββββββββ| 1416/1500 [06:59<00:17, 4.69it/s]
|
903 |
94%|ββββββββββ| 1417/1500 [06:59<00:18, 4.58it/s]
|
904 |
95%|ββββββββββ| 1418/1500 [07:00<00:18, 4.49it/s]
|
905 |
95%|ββββββββββ| 1419/1500 [07:00<00:23, 3.52it/s]
|
906 |
95%|ββββββββββ| 1420/1500 [07:00<00:21, 3.80it/s]
|
907 |
95%|ββββββββββ| 1421/1500 [07:01<00:22, 3.55it/s]
|
908 |
95%|ββββββββββ| 1422/1500 [07:01<00:20, 3.83it/s]
|
909 |
95%|ββββββββββ| 1423/1500 [07:01<00:18, 4.06it/s]
|
910 |
95%|ββββββββββ| 1424/1500 [07:01<00:18, 4.02it/s]
|
911 |
95%|ββββββββββ| 1425/1500 [07:02<00:18, 4.06it/s]
|
912 |
95%|ββββββββββ| 1426/1500 [07:02<00:17, 4.11it/s]
|
913 |
95%|ββββββββββ| 1427/1500 [07:02<00:17, 4.29it/s]
|
914 |
95%|ββββββββββ| 1428/1500 [07:02<00:15, 4.55it/s]
|
915 |
95%|ββββββββββ| 1429/1500 [07:02<00:16, 4.29it/s]
|
916 |
95%|ββββββββββ| 1430/1500 [07:03<00:15, 4.38it/s]
|
917 |
95%|ββββββββββ| 1431/1500 [07:03<00:19, 3.60it/s]
|
918 |
95%|ββββββββββ| 1432/1500 [07:03<00:17, 3.92it/s]
|
919 |
96%|ββββββββββ| 1433/1500 [07:03<00:16, 4.12it/s]
|
920 |
96%|ββββββββββ| 1434/1500 [07:04<00:15, 4.30it/s]
|
921 |
96%|ββββββββββ| 1435/1500 [07:04<00:15, 4.31it/s]
|
922 |
96%|ββββββββββ| 1436/1500 [07:04<00:15, 4.19it/s]
|
923 |
96%|ββββββββββ| 1437/1500 [07:04<00:16, 3.93it/s]
|
924 |
96%|ββββββββββ| 1438/1500 [07:05<00:16, 3.84it/s]
|
925 |
96%|ββββββββββ| 1439/1500 [07:05<00:16, 3.77it/s]
|
926 |
96%|ββββββββββ| 1440/1500 [07:05<00:15, 3.80it/s]
|
927 |
96%|ββββββββββ| 1441/1500 [07:06<00:14, 3.99it/s]
|
928 |
96%|ββββββββββ| 1442/1500 [07:06<00:14, 4.01it/s]
|
929 |
96%|ββββββββββ| 1443/1500 [07:06<00:13, 4.29it/s]
|
930 |
96%|ββββββββββ| 1444/1500 [07:06<00:13, 4.08it/s]
|
931 |
96%|ββββββββββ| 1445/1500 [07:07<00:14, 3.83it/s]
|
932 |
96%|ββββββββββ| 1446/1500 [07:07<00:13, 3.97it/s]
|
933 |
96%|ββββββββββ| 1447/1500 [07:07<00:17, 3.09it/s]
|
934 |
97%|ββββββββββ| 1448/1500 [07:07<00:14, 3.57it/s]
|
935 |
97%|ββββββββββ| 1449/1500 [07:08<00:13, 3.85it/s]
|
936 |
97%|ββββββββββ| 1450/1500 [07:08<00:12, 4.11it/s]
|
937 |
97%|ββββββββββ| 1451/1500 [07:08<00:10, 4.46it/s]
|
938 |
97%|ββββββββββ| 1452/1500 [07:08<00:10, 4.38it/s]
|
939 |
97%|ββββββββββ| 1453/1500 [07:09<00:10, 4.31it/s]
|
940 |
97%|ββββββββββ| 1454/1500 [07:09<00:10, 4.44it/s]
|
941 |
97%|ββββββββββ| 1455/1500 [07:09<00:10, 4.29it/s]
|
942 |
97%|ββββββββββ| 1456/1500 [07:09<00:10, 4.28it/s]
|
943 |
97%|ββββββββββ| 1457/1500 [07:09<00:10, 4.27it/s]
|
944 |
97%|ββββββββββ| 1458/1500 [07:10<00:09, 4.49it/s]
|
945 |
97%|ββββββββββ| 1459/1500 [07:10<00:11, 3.73it/s]
|
946 |
97%|ββββββββββ| 1460/1500 [07:10<00:09, 4.11it/s]
|
947 |
97%|ββββββββββ| 1461/1500 [07:10<00:09, 4.08it/s]
|
948 |
97%|ββββββββββ| 1462/1500 [07:11<00:08, 4.30it/s]
|
949 |
98%|ββββββββββ| 1463/1500 [07:11<00:09, 4.03it/s]
|
950 |
98%|ββββββββββ| 1464/1500 [07:11<00:08, 4.28it/s]
|
951 |
98%|ββββββββββ| 1465/1500 [07:11<00:08, 4.37it/s]
|
952 |
98%|ββββββββββ| 1466/1500 [07:12<00:08, 4.23it/s]
|
953 |
98%|ββββββββββ| 1467/1500 [07:12<00:07, 4.23it/s]
|
954 |
98%|ββββββββββ| 1468/1500 [07:12<00:07, 4.35it/s]
|
955 |
98%|ββββββββββ| 1469/1500 [07:12<00:07, 4.23it/s]
|
956 |
98%|ββββββββββ| 1470/1500 [07:13<00:07, 4.15it/s]
|
957 |
98%|ββββββββββ| 1471/1500 [07:13<00:06, 4.58it/s]
|
958 |
98%|ββββββββββ| 1472/1500 [07:13<00:06, 4.31it/s]
|
959 |
98%|ββββββββββ| 1473/1500 [07:13<00:05, 4.54it/s]
|
960 |
98%|ββββββββββ| 1474/1500 [07:13<00:05, 4.59it/s]
|
961 |
98%|ββββββββββ| 1475/1500 [07:14<00:06, 4.05it/s]
|
962 |
98%|ββββββββββ| 1476/1500 [07:14<00:05, 4.09it/s]
|
963 |
98%|ββββββββββ| 1477/1500 [07:14<00:06, 3.68it/s]
|
964 |
99%|ββββββββββ| 1478/1500 [07:14<00:05, 4.05it/s]
|
965 |
99%|ββββββββββ| 1479/1500 [07:15<00:05, 3.96it/s]
|
966 |
99%|ββββββββββ| 1480/1500 [07:15<00:05, 3.97it/s]
|
967 |
99%|ββββββββββ| 1481/1500 [07:15<00:04, 3.98it/s]
|
968 |
99%|ββββββββββ| 1482/1500 [07:15<00:04, 4.07it/s]
|
969 |
99%|ββββββββββ| 1483/1500 [07:16<00:04, 4.22it/s]
|
970 |
99%|ββββββββββ| 1484/1500 [07:16<00:03, 4.54it/s]
|
971 |
99%|ββββββββββ| 1485/1500 [07:16<00:03, 4.71it/s]
|
972 |
99%|ββββββββββ| 1486/1500 [07:16<00:02, 4.75it/s]
|
973 |
99%|ββββββββββ| 1487/1500 [07:16<00:02, 4.95it/s]
|
974 |
99%|ββββββββββ| 1488/1500 [07:17<00:02, 5.11it/s]
|
975 |
99%|ββββββββββ| 1489/1500 [07:17<00:02, 5.07it/s]
|
976 |
99%|ββββββββββ| 1490/1500 [07:17<00:01, 5.08it/s]
|
977 |
99%|ββββββββββ| 1491/1500 [07:17<00:01, 4.71it/s]
|
978 |
99%|ββββββββββ| 1492/1500 [07:18<00:01, 4.23it/s]
|
979 |
|
980 |
+
[INFO|configuration_utils.py:472] 2024-08-30 22:35:27,187 >> Configuration saved in /content/dissertation/scripts/ner/output/checkpoint-1500/config.json
|
981 |
+
[INFO|modeling_utils.py:2690] 2024-08-30 22:35:28,204 >> Model weights saved in /content/dissertation/scripts/ner/output/checkpoint-1500/model.safetensors
|
982 |
+
[INFO|tokenization_utils_base.py:2574] 2024-08-30 22:35:28,205 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/checkpoint-1500/tokenizer_config.json
|
983 |
+
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:35:28,205 >> Special tokens file saved in /content/dissertation/scripts/ner/output/checkpoint-1500/special_tokens_map.json
|
984 |
+
[INFO|tokenization_utils_base.py:2574] 2024-08-30 22:35:30,427 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/tokenizer_config.json
|
985 |
+
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:35:30,427 >> Special tokens file saved in /content/dissertation/scripts/ner/output/special_tokens_map.json
|
986 |
+
[INFO|trainer.py:805] 2024-08-30 22:35:30,484 >> The following columns in the evaluation set don't have a corresponding argument in `RobertaForTokenClassification.forward` and have been ignored: tokens, id, ner_tags. If tokens, id, ner_tags are not expected by `RobertaForTokenClassification.forward`, you can safely ignore this message.
|
987 |
+
[INFO|trainer.py:3788] 2024-08-30 22:35:30,487 >>
|
988 |
+
***** Running Evaluation *****
|
989 |
+
[INFO|trainer.py:3790] 2024-08-30 22:35:30,487 >> Num examples = 2519
|
990 |
+
[INFO|trainer.py:3793] 2024-08-30 22:35:30,487 >> Batch size = 8
|
991 |
+
{'eval_loss': 0.27473828196525574, 'eval_precision': 0.6675139806812405, 'eval_recall': 0.7186644772851669, 'eval_f1': 0.6921454928835002, 'eval_accuracy': 0.9483461131252205, 'eval_runtime': 5.4389, 'eval_samples_per_second': 463.146, 'eval_steps_per_second': 57.916, 'epoch': 9.0}
|
992 |
+
{'loss': 0.0091, 'grad_norm': 0.17641158401966095, 'learning_rate': 0.0, 'epoch': 10.0}
|
993 |
+
|
994 |
+
|
995 |
0%| | 0/315 [00:00<?, ?it/s][A
|
996 |
+
|
997 |
3%|β | 8/315 [00:00<00:04, 73.73it/s][A
|
998 |
+
|
999 |
5%|β | 16/315 [00:00<00:04, 73.93it/s][A
|
1000 |
+
|
1001 |
8%|β | 24/315 [00:00<00:03, 74.53it/s][A
|
1002 |
+
|
1003 |
10%|β | 32/315 [00:00<00:03, 73.07it/s][A
|
1004 |
+
|
1005 |
13%|ββ | 40/315 [00:00<00:03, 73.97it/s][A
|
1006 |
+
|
1007 |
15%|ββ | 48/315 [00:00<00:03, 74.36it/s][A
|
1008 |
+
|
1009 |
18%|ββ | 56/315 [00:00<00:03, 74.36it/s][A
|
1010 |
+
|
1011 |
20%|ββ | 64/315 [00:00<00:03, 72.79it/s][A
|
1012 |
+
|
1013 |
23%|βββ | 72/315 [00:00<00:03, 73.95it/s][A
|
1014 |
+
|
1015 |
25%|βββ | 80/315 [00:01<00:03, 72.93it/s][A
|
1016 |
+
|
1017 |
28%|βββ | 88/315 [00:01<00:03, 72.87it/s][A
|
1018 |
+
|
1019 |
30%|βββ | 96/315 [00:01<00:02, 73.78it/s][A
|
1020 |
+
|
1021 |
33%|ββββ | 104/315 [00:01<00:02, 72.07it/s][A
|
1022 |
+
|
1023 |
36%|ββββ | 112/315 [00:01<00:02, 72.26it/s][A
|
1024 |
+
|
1025 |
38%|ββββ | 120/315 [00:01<00:02, 72.12it/s][A
|
1026 |
+
|
1027 |
41%|ββββ | 128/315 [00:01<00:02, 72.59it/s][A
|
1028 |
+
|
1029 |
43%|βββββ | 136/315 [00:01<00:02, 73.04it/s][A
|
1030 |
+
|
1031 |
46%|βββββ | 144/315 [00:01<00:02, 70.92it/s][A
|
1032 |
+
|
1033 |
48%|βββββ | 152/315 [00:02<00:02, 70.72it/s][A
|
1034 |
+
|
1035 |
51%|βββββ | 160/315 [00:02<00:02, 71.70it/s][A
|
1036 |
+
|
1037 |
53%|ββββββ | 168/315 [00:02<00:02, 70.87it/s][A
|
1038 |
+
|
1039 |
56%|ββββββ | 176/315 [00:02<00:01, 70.65it/s][A
|
1040 |
+
|
1041 |
58%|ββββββ | 184/315 [00:02<00:01, 72.05it/s][A
|
1042 |
+
|
1043 |
61%|ββββββ | 192/315 [00:02<00:01, 73.71it/s][A
|
1044 |
+
|
1045 |
63%|βββββββ | 200/315 [00:02<00:01, 73.88it/s][A
|
1046 |
+
|
1047 |
66%|βββββββ | 208/315 [00:02<00:01, 70.92it/s][A
|
1048 |
+
|
1049 |
69%|βββββββ | 216/315 [00:02<00:01, 72.48it/s][A
|
1050 |
+
|
1051 |
71%|βββββββ | 224/315 [00:03<00:01, 74.05it/s][A
|
1052 |
+
|
1053 |
74%|ββββββββ | 232/315 [00:03<00:01, 75.07it/s][A
|
1054 |
+
|
1055 |
76%|ββββββββ | 240/315 [00:03<00:01, 71.76it/s][A
|
1056 |
+
|
1057 |
79%|ββββββββ | 248/315 [00:03<00:00, 71.84it/s][A
|
1058 |
+
|
1059 |
81%|βββββββββ | 256/315 [00:03<00:00, 72.35it/s][A
|
1060 |
+
|
1061 |
84%|βββββββββ | 264/315 [00:03<00:00, 72.07it/s][A
|
1062 |
+
|
1063 |
86%|βββββββββ | 272/315 [00:03<00:00, 72.92it/s][A
|
1064 |
+
|
1065 |
89%|βββββββββ | 280/315 [00:03<00:00, 74.87it/s][A
|
1066 |
+
|
1067 |
91%|ββββββββββ| 288/315 [00:03<00:00, 74.75it/s][A
|
1068 |
+
|
1069 |
94%|ββββββββββ| 296/315 [00:04<00:00, 73.35it/s][A
|
1070 |
+
|
1071 |
97%|ββββββββββ| 304/315 [00:04<00:00, 74.36it/s][A
|
1072 |
+
|
1073 |
99%|ββββββββββ| 313/315 [00:04<00:00, 76.22it/s][A
|
1074 |
|
1075 |
+
|
1076 |
|
1077 |
+
|
1078 |
+
|
1079 |
[A[INFO|trainer.py:3478] 2024-08-30 22:35:36,571 >> Saving model checkpoint to /content/dissertation/scripts/ner/output/checkpoint-1500
|
1080 |
+
[INFO|configuration_utils.py:472] 2024-08-30 22:35:36,573 >> Configuration saved in /content/dissertation/scripts/ner/output/checkpoint-1500/config.json
|
1081 |
+
[INFO|modeling_utils.py:2690] 2024-08-30 22:35:37,986 >> Model weights saved in /content/dissertation/scripts/ner/output/checkpoint-1500/model.safetensors
|
1082 |
+
[INFO|tokenization_utils_base.py:2574] 2024-08-30 22:35:37,987 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/checkpoint-1500/tokenizer_config.json
|
1083 |
+
[INFO|tokenization_utils_base.py:2583] 2024-08-30 22:35:37,987 >> Special tokens file saved in /content/dissertation/scripts/ner/output/checkpoint-1500/special_tokens_map.json
|
1084 |
+
[INFO|trainer.py:2383] 2024-08-30 22:35:40,063 >>
|
1085 |
+
|
1086 |
+
Training completed. Do not forget to share your model on huggingface.co/models =)
|
1087 |
+
|
1088 |
+
|
1089 |
+
[INFO|trainer.py:2621] 2024-08-30 22:35:40,063 >> Loading best model from /content/dissertation/scripts/ner/output/checkpoint-1350 (score: 0.6921454928835002).
|
1090 |
+
|
1091 |
|
1092 |
+
[INFO|trainer.py:4239] 2024-08-30 22:35:40,264 >> Waiting for the current checkpoint push to be finished, this might take a couple of minutes.
|