jialicheng commited on
Commit
722bbe3
·
verified ·
1 Parent(s): d30ce0e

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/vit-large-patch16-224-in21k
4
+ tags:
5
+ - image-classification
6
+ - vision
7
+ - generated_from_trainer
8
+ metrics:
9
+ - accuracy
10
+ model-index:
11
+ - name: vit-large
12
+ results: []
13
+ ---
14
+
15
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
+ should probably proofread and complete it, then remove this comment. -->
17
+
18
+ # vit-large
19
+
20
+ This model is a fine-tuned version of [google/vit-large-patch16-224-in21k](https://huggingface.co/google/vit-large-patch16-224-in21k) on the cifar100 dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 0.3301
23
+ - Accuracy: 0.9309
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 1e-05
43
+ - train_batch_size: 64
44
+ - eval_batch_size: 256
45
+ - seed: 42
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 100
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
54
+ | 1.2884 | 1.0 | 665 | 0.8752 | 0.8834 |
55
+ | 0.7958 | 2.0 | 1330 | 0.4724 | 0.9142 |
56
+ | 0.743 | 3.0 | 1995 | 0.3750 | 0.9207 |
57
+ | 0.6935 | 4.0 | 2660 | 0.3198 | 0.9236 |
58
+ | 0.6159 | 5.0 | 3325 | 0.2945 | 0.9289 |
59
+ | 0.4423 | 6.0 | 3990 | 0.2876 | 0.925 |
60
+ | 0.5506 | 7.0 | 4655 | 0.2617 | 0.9302 |
61
+ | 0.5673 | 8.0 | 5320 | 0.2576 | 0.9324 |
62
+ | 0.4613 | 9.0 | 5985 | 0.2586 | 0.9311 |
63
+ | 0.4179 | 10.0 | 6650 | 0.2555 | 0.9285 |
64
+ | 0.4438 | 11.0 | 7315 | 0.2554 | 0.9316 |
65
+ | 0.4869 | 12.0 | 7980 | 0.2564 | 0.9298 |
66
+ | 0.4289 | 13.0 | 8645 | 0.2713 | 0.9288 |
67
+ | 0.4003 | 14.0 | 9310 | 0.2617 | 0.932 |
68
+ | 0.3227 | 15.0 | 9975 | 0.2567 | 0.9335 |
69
+ | 0.386 | 16.0 | 10640 | 0.2571 | 0.931 |
70
+ | 0.3688 | 17.0 | 11305 | 0.2576 | 0.9346 |
71
+ | 0.3985 | 18.0 | 11970 | 0.2532 | 0.9356 |
72
+ | 0.3213 | 19.0 | 12635 | 0.2728 | 0.9321 |
73
+ | 0.3046 | 20.0 | 13300 | 0.2702 | 0.9334 |
74
+ | 0.3676 | 21.0 | 13965 | 0.2700 | 0.9319 |
75
+ | 0.3329 | 22.0 | 14630 | 0.2720 | 0.9333 |
76
+ | 0.4089 | 23.0 | 15295 | 0.2764 | 0.9325 |
77
+ | 0.3196 | 24.0 | 15960 | 0.2735 | 0.9305 |
78
+ | 0.2982 | 25.0 | 16625 | 0.2771 | 0.9312 |
79
+ | 0.1884 | 26.0 | 17290 | 0.2943 | 0.9304 |
80
+ | 0.3624 | 27.0 | 17955 | 0.2866 | 0.9316 |
81
+ | 0.2957 | 28.0 | 18620 | 0.2708 | 0.932 |
82
+ | 0.3013 | 29.0 | 19285 | 0.2881 | 0.932 |
83
+ | 0.2811 | 30.0 | 19950 | 0.2940 | 0.9304 |
84
+ | 0.2031 | 31.0 | 20615 | 0.2802 | 0.9335 |
85
+ | 0.3268 | 32.0 | 21280 | 0.2803 | 0.9312 |
86
+ | 0.218 | 33.0 | 21945 | 0.2883 | 0.9307 |
87
+ | 0.217 | 34.0 | 22610 | 0.2866 | 0.9356 |
88
+ | 0.2032 | 35.0 | 23275 | 0.2905 | 0.9317 |
89
+ | 0.2539 | 36.0 | 23940 | 0.2818 | 0.9313 |
90
+ | 0.2104 | 37.0 | 24605 | 0.2907 | 0.9329 |
91
+ | 0.264 | 38.0 | 25270 | 0.3030 | 0.9298 |
92
+ | 0.3343 | 39.0 | 25935 | 0.3030 | 0.9299 |
93
+ | 0.2252 | 40.0 | 26600 | 0.2960 | 0.9313 |
94
+ | 0.2453 | 41.0 | 27265 | 0.2977 | 0.9302 |
95
+ | 0.2467 | 42.0 | 27930 | 0.3034 | 0.9293 |
96
+ | 0.2208 | 43.0 | 28595 | 0.3022 | 0.9316 |
97
+ | 0.1808 | 44.0 | 29260 | 0.3067 | 0.9304 |
98
+ | 0.2477 | 45.0 | 29925 | 0.3073 | 0.9289 |
99
+ | 0.2059 | 46.0 | 30590 | 0.3010 | 0.931 |
100
+ | 0.2156 | 47.0 | 31255 | 0.2920 | 0.9318 |
101
+ | 0.2719 | 48.0 | 31920 | 0.3057 | 0.9311 |
102
+ | 0.2156 | 49.0 | 32585 | 0.3127 | 0.9292 |
103
+ | 0.2562 | 50.0 | 33250 | 0.3115 | 0.93 |
104
+ | 0.1847 | 51.0 | 33915 | 0.3058 | 0.9311 |
105
+ | 0.2453 | 52.0 | 34580 | 0.3180 | 0.9308 |
106
+ | 0.2763 | 53.0 | 35245 | 0.3076 | 0.932 |
107
+ | 0.1876 | 54.0 | 35910 | 0.3097 | 0.9318 |
108
+ | 0.1774 | 55.0 | 36575 | 0.3105 | 0.9321 |
109
+ | 0.2011 | 56.0 | 37240 | 0.3108 | 0.9337 |
110
+ | 0.2142 | 57.0 | 37905 | 0.3191 | 0.9312 |
111
+ | 0.1931 | 58.0 | 38570 | 0.3219 | 0.9299 |
112
+ | 0.2328 | 59.0 | 39235 | 0.3155 | 0.9316 |
113
+ | 0.145 | 60.0 | 39900 | 0.3216 | 0.9295 |
114
+ | 0.2804 | 61.0 | 40565 | 0.3253 | 0.9298 |
115
+ | 0.1696 | 62.0 | 41230 | 0.3086 | 0.9315 |
116
+ | 0.2194 | 63.0 | 41895 | 0.3170 | 0.9313 |
117
+ | 0.2297 | 64.0 | 42560 | 0.3231 | 0.9293 |
118
+ | 0.2108 | 65.0 | 43225 | 0.3161 | 0.9313 |
119
+ | 0.1696 | 66.0 | 43890 | 0.3269 | 0.929 |
120
+ | 0.1946 | 67.0 | 44555 | 0.3307 | 0.9302 |
121
+ | 0.1492 | 68.0 | 45220 | 0.3248 | 0.9296 |
122
+ | 0.223 | 69.0 | 45885 | 0.3316 | 0.9293 |
123
+ | 0.1738 | 70.0 | 46550 | 0.3248 | 0.9295 |
124
+ | 0.2251 | 71.0 | 47215 | 0.3297 | 0.9305 |
125
+ | 0.1518 | 72.0 | 47880 | 0.3322 | 0.9311 |
126
+ | 0.1914 | 73.0 | 48545 | 0.3263 | 0.931 |
127
+ | 0.2097 | 74.0 | 49210 | 0.3367 | 0.9294 |
128
+ | 0.1423 | 75.0 | 49875 | 0.3286 | 0.9299 |
129
+ | 0.1953 | 76.0 | 50540 | 0.3337 | 0.9307 |
130
+ | 0.1599 | 77.0 | 51205 | 0.3295 | 0.9313 |
131
+ | 0.2077 | 78.0 | 51870 | 0.3285 | 0.9312 |
132
+ | 0.2053 | 79.0 | 52535 | 0.3278 | 0.9309 |
133
+ | 0.1846 | 80.0 | 53200 | 0.3291 | 0.9307 |
134
+ | 0.1909 | 81.0 | 53865 | 0.3417 | 0.9291 |
135
+ | 0.1971 | 82.0 | 54530 | 0.3323 | 0.9289 |
136
+ | 0.1739 | 83.0 | 55195 | 0.3266 | 0.9323 |
137
+ | 0.1537 | 84.0 | 55860 | 0.3313 | 0.9294 |
138
+ | 0.1706 | 85.0 | 56525 | 0.3395 | 0.928 |
139
+ | 0.199 | 86.0 | 57190 | 0.3344 | 0.9303 |
140
+ | 0.2013 | 87.0 | 57855 | 0.3360 | 0.9294 |
141
+ | 0.1495 | 88.0 | 58520 | 0.3371 | 0.9307 |
142
+ | 0.1042 | 89.0 | 59185 | 0.3302 | 0.9316 |
143
+ | 0.1681 | 90.0 | 59850 | 0.3304 | 0.9295 |
144
+ | 0.1802 | 91.0 | 60515 | 0.3351 | 0.9298 |
145
+ | 0.268 | 92.0 | 61180 | 0.3332 | 0.9305 |
146
+ | 0.1807 | 93.0 | 61845 | 0.3300 | 0.9307 |
147
+ | 0.1855 | 94.0 | 62510 | 0.3315 | 0.9303 |
148
+ | 0.1747 | 95.0 | 63175 | 0.3324 | 0.9295 |
149
+ | 0.1783 | 96.0 | 63840 | 0.3313 | 0.9315 |
150
+ | 0.1256 | 97.0 | 64505 | 0.3327 | 0.9308 |
151
+ | 0.0984 | 98.0 | 65170 | 0.3291 | 0.9317 |
152
+ | 0.1525 | 99.0 | 65835 | 0.3307 | 0.9311 |
153
+ | 0.1471 | 100.0 | 66500 | 0.3301 | 0.9309 |
154
+
155
+
156
+ ### Framework versions
157
+
158
+ - Transformers 4.39.3
159
+ - Pytorch 2.2.2+cu118
160
+ - Datasets 2.18.0
161
+ - Tokenizers 0.15.2
all_results.json ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "dr_accuracy": 0.9972470588235294,
3
+ "dr_loss": 0.020220542326569557,
4
+ "dr_runtime": 353.6004,
5
+ "dr_samples_per_second": 120.192,
6
+ "dr_steps_per_second": 0.472,
7
+ "epoch": 100.0,
8
+ "test_accuracy": 0.9356,
9
+ "test_loss": 0.25322866439819336,
10
+ "test_runtime": 84.4915,
11
+ "test_samples_per_second": 118.355,
12
+ "test_steps_per_second": 0.473,
13
+ "train_loss": 0.2947179788530321,
14
+ "train_runtime": 117326.6726,
15
+ "train_samples_per_second": 36.224,
16
+ "train_steps_per_second": 0.567
17
+ }
checkpoint-11970/config.json ADDED
@@ -0,0 +1,229 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "google/vit-large-patch16-224-in21k",
3
+ "architectures": [
4
+ "ViTForImageClassification"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "encoder_stride": 16,
8
+ "finetuning_task": "image-classification",
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.0,
11
+ "hidden_size": 1024,
12
+ "id2label": {
13
+ "0": "apple",
14
+ "1": "aquarium_fish",
15
+ "10": "bowl",
16
+ "11": "boy",
17
+ "12": "bridge",
18
+ "13": "bus",
19
+ "14": "butterfly",
20
+ "15": "camel",
21
+ "16": "can",
22
+ "17": "castle",
23
+ "18": "caterpillar",
24
+ "19": "cattle",
25
+ "2": "baby",
26
+ "20": "chair",
27
+ "21": "chimpanzee",
28
+ "22": "clock",
29
+ "23": "cloud",
30
+ "24": "cockroach",
31
+ "25": "couch",
32
+ "26": "cra",
33
+ "27": "crocodile",
34
+ "28": "cup",
35
+ "29": "dinosaur",
36
+ "3": "bear",
37
+ "30": "dolphin",
38
+ "31": "elephant",
39
+ "32": "flatfish",
40
+ "33": "forest",
41
+ "34": "fox",
42
+ "35": "girl",
43
+ "36": "hamster",
44
+ "37": "house",
45
+ "38": "kangaroo",
46
+ "39": "keyboard",
47
+ "4": "beaver",
48
+ "40": "lamp",
49
+ "41": "lawn_mower",
50
+ "42": "leopard",
51
+ "43": "lion",
52
+ "44": "lizard",
53
+ "45": "lobster",
54
+ "46": "man",
55
+ "47": "maple_tree",
56
+ "48": "motorcycle",
57
+ "49": "mountain",
58
+ "5": "bed",
59
+ "50": "mouse",
60
+ "51": "mushroom",
61
+ "52": "oak_tree",
62
+ "53": "orange",
63
+ "54": "orchid",
64
+ "55": "otter",
65
+ "56": "palm_tree",
66
+ "57": "pear",
67
+ "58": "pickup_truck",
68
+ "59": "pine_tree",
69
+ "6": "bee",
70
+ "60": "plain",
71
+ "61": "plate",
72
+ "62": "poppy",
73
+ "63": "porcupine",
74
+ "64": "possum",
75
+ "65": "rabbit",
76
+ "66": "raccoon",
77
+ "67": "ray",
78
+ "68": "road",
79
+ "69": "rocket",
80
+ "7": "beetle",
81
+ "70": "rose",
82
+ "71": "sea",
83
+ "72": "seal",
84
+ "73": "shark",
85
+ "74": "shrew",
86
+ "75": "skunk",
87
+ "76": "skyscraper",
88
+ "77": "snail",
89
+ "78": "snake",
90
+ "79": "spider",
91
+ "8": "bicycle",
92
+ "80": "squirrel",
93
+ "81": "streetcar",
94
+ "82": "sunflower",
95
+ "83": "sweet_pepper",
96
+ "84": "table",
97
+ "85": "tank",
98
+ "86": "telephone",
99
+ "87": "television",
100
+ "88": "tiger",
101
+ "89": "tractor",
102
+ "9": "bottle",
103
+ "90": "train",
104
+ "91": "trout",
105
+ "92": "tulip",
106
+ "93": "turtle",
107
+ "94": "wardrobe",
108
+ "95": "whale",
109
+ "96": "willow_tree",
110
+ "97": "wolf",
111
+ "98": "woman",
112
+ "99": "worm"
113
+ },
114
+ "image_size": 224,
115
+ "initializer_range": 0.02,
116
+ "intermediate_size": 4096,
117
+ "label2id": {
118
+ "apple": "0",
119
+ "aquarium_fish": "1",
120
+ "baby": "2",
121
+ "bear": "3",
122
+ "beaver": "4",
123
+ "bed": "5",
124
+ "bee": "6",
125
+ "beetle": "7",
126
+ "bicycle": "8",
127
+ "bottle": "9",
128
+ "bowl": "10",
129
+ "boy": "11",
130
+ "bridge": "12",
131
+ "bus": "13",
132
+ "butterfly": "14",
133
+ "camel": "15",
134
+ "can": "16",
135
+ "castle": "17",
136
+ "caterpillar": "18",
137
+ "cattle": "19",
138
+ "chair": "20",
139
+ "chimpanzee": "21",
140
+ "clock": "22",
141
+ "cloud": "23",
142
+ "cockroach": "24",
143
+ "couch": "25",
144
+ "cra": "26",
145
+ "crocodile": "27",
146
+ "cup": "28",
147
+ "dinosaur": "29",
148
+ "dolphin": "30",
149
+ "elephant": "31",
150
+ "flatfish": "32",
151
+ "forest": "33",
152
+ "fox": "34",
153
+ "girl": "35",
154
+ "hamster": "36",
155
+ "house": "37",
156
+ "kangaroo": "38",
157
+ "keyboard": "39",
158
+ "lamp": "40",
159
+ "lawn_mower": "41",
160
+ "leopard": "42",
161
+ "lion": "43",
162
+ "lizard": "44",
163
+ "lobster": "45",
164
+ "man": "46",
165
+ "maple_tree": "47",
166
+ "motorcycle": "48",
167
+ "mountain": "49",
168
+ "mouse": "50",
169
+ "mushroom": "51",
170
+ "oak_tree": "52",
171
+ "orange": "53",
172
+ "orchid": "54",
173
+ "otter": "55",
174
+ "palm_tree": "56",
175
+ "pear": "57",
176
+ "pickup_truck": "58",
177
+ "pine_tree": "59",
178
+ "plain": "60",
179
+ "plate": "61",
180
+ "poppy": "62",
181
+ "porcupine": "63",
182
+ "possum": "64",
183
+ "rabbit": "65",
184
+ "raccoon": "66",
185
+ "ray": "67",
186
+ "road": "68",
187
+ "rocket": "69",
188
+ "rose": "70",
189
+ "sea": "71",
190
+ "seal": "72",
191
+ "shark": "73",
192
+ "shrew": "74",
193
+ "skunk": "75",
194
+ "skyscraper": "76",
195
+ "snail": "77",
196
+ "snake": "78",
197
+ "spider": "79",
198
+ "squirrel": "80",
199
+ "streetcar": "81",
200
+ "sunflower": "82",
201
+ "sweet_pepper": "83",
202
+ "table": "84",
203
+ "tank": "85",
204
+ "telephone": "86",
205
+ "television": "87",
206
+ "tiger": "88",
207
+ "tractor": "89",
208
+ "train": "90",
209
+ "trout": "91",
210
+ "tulip": "92",
211
+ "turtle": "93",
212
+ "wardrobe": "94",
213
+ "whale": "95",
214
+ "willow_tree": "96",
215
+ "wolf": "97",
216
+ "woman": "98",
217
+ "worm": "99"
218
+ },
219
+ "layer_norm_eps": 1e-12,
220
+ "model_type": "vit",
221
+ "num_attention_heads": 16,
222
+ "num_channels": 3,
223
+ "num_hidden_layers": 24,
224
+ "patch_size": 16,
225
+ "problem_type": "single_label_classification",
226
+ "qkv_bias": true,
227
+ "torch_dtype": "float32",
228
+ "transformers_version": "4.39.3"
229
+ }
checkpoint-11970/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be07959d6fa0cb775a5bdf4c782097bae5869db4a8a50f76cddb16c7f9f0e70f
3
+ size 1213663080
checkpoint-11970/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0d2556096ab91ff4895301d930b2047efdf5cba5251cea69bf66da3267f84af
3
+ size 2427561130
checkpoint-11970/preprocessor_config.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_valid_processor_keys": [
3
+ "images",
4
+ "do_resize",
5
+ "size",
6
+ "resample",
7
+ "do_rescale",
8
+ "rescale_factor",
9
+ "do_normalize",
10
+ "image_mean",
11
+ "image_std",
12
+ "return_tensors",
13
+ "data_format",
14
+ "input_data_format"
15
+ ],
16
+ "do_normalize": true,
17
+ "do_rescale": true,
18
+ "do_resize": true,
19
+ "image_mean": [
20
+ 0.5,
21
+ 0.5,
22
+ 0.5
23
+ ],
24
+ "image_processor_type": "ViTImageProcessor",
25
+ "image_std": [
26
+ 0.5,
27
+ 0.5,
28
+ 0.5
29
+ ],
30
+ "resample": 2,
31
+ "rescale_factor": 0.00392156862745098,
32
+ "size": {
33
+ "height": 224,
34
+ "width": 224
35
+ }
36
+ }
checkpoint-11970/rng_state.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f86f749dc73bc16a2502fef2f98f5c00b4400cb2c67fbe62653b7ed104d13779
3
+ size 14244
checkpoint-11970/scheduler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c72e2edf32b13d9c7ec4c05a3c391fe83fd09b0ae94a7e4dfccdd4f4f4969486
3
+ size 1064
checkpoint-11970/trainer_state.json ADDED
The diff for this file is too large to render. See raw diff
 
checkpoint-11970/training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61511df09538e6804e936fb432e7b68b550ca72e3cbebb2760b47753bbd886cc
3
+ size 4920
config.json ADDED
@@ -0,0 +1,229 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "google/vit-large-patch16-224-in21k",
3
+ "architectures": [
4
+ "ViTForImageClassification"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "encoder_stride": 16,
8
+ "finetuning_task": "image-classification",
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.0,
11
+ "hidden_size": 1024,
12
+ "id2label": {
13
+ "0": "apple",
14
+ "1": "aquarium_fish",
15
+ "10": "bowl",
16
+ "11": "boy",
17
+ "12": "bridge",
18
+ "13": "bus",
19
+ "14": "butterfly",
20
+ "15": "camel",
21
+ "16": "can",
22
+ "17": "castle",
23
+ "18": "caterpillar",
24
+ "19": "cattle",
25
+ "2": "baby",
26
+ "20": "chair",
27
+ "21": "chimpanzee",
28
+ "22": "clock",
29
+ "23": "cloud",
30
+ "24": "cockroach",
31
+ "25": "couch",
32
+ "26": "cra",
33
+ "27": "crocodile",
34
+ "28": "cup",
35
+ "29": "dinosaur",
36
+ "3": "bear",
37
+ "30": "dolphin",
38
+ "31": "elephant",
39
+ "32": "flatfish",
40
+ "33": "forest",
41
+ "34": "fox",
42
+ "35": "girl",
43
+ "36": "hamster",
44
+ "37": "house",
45
+ "38": "kangaroo",
46
+ "39": "keyboard",
47
+ "4": "beaver",
48
+ "40": "lamp",
49
+ "41": "lawn_mower",
50
+ "42": "leopard",
51
+ "43": "lion",
52
+ "44": "lizard",
53
+ "45": "lobster",
54
+ "46": "man",
55
+ "47": "maple_tree",
56
+ "48": "motorcycle",
57
+ "49": "mountain",
58
+ "5": "bed",
59
+ "50": "mouse",
60
+ "51": "mushroom",
61
+ "52": "oak_tree",
62
+ "53": "orange",
63
+ "54": "orchid",
64
+ "55": "otter",
65
+ "56": "palm_tree",
66
+ "57": "pear",
67
+ "58": "pickup_truck",
68
+ "59": "pine_tree",
69
+ "6": "bee",
70
+ "60": "plain",
71
+ "61": "plate",
72
+ "62": "poppy",
73
+ "63": "porcupine",
74
+ "64": "possum",
75
+ "65": "rabbit",
76
+ "66": "raccoon",
77
+ "67": "ray",
78
+ "68": "road",
79
+ "69": "rocket",
80
+ "7": "beetle",
81
+ "70": "rose",
82
+ "71": "sea",
83
+ "72": "seal",
84
+ "73": "shark",
85
+ "74": "shrew",
86
+ "75": "skunk",
87
+ "76": "skyscraper",
88
+ "77": "snail",
89
+ "78": "snake",
90
+ "79": "spider",
91
+ "8": "bicycle",
92
+ "80": "squirrel",
93
+ "81": "streetcar",
94
+ "82": "sunflower",
95
+ "83": "sweet_pepper",
96
+ "84": "table",
97
+ "85": "tank",
98
+ "86": "telephone",
99
+ "87": "television",
100
+ "88": "tiger",
101
+ "89": "tractor",
102
+ "9": "bottle",
103
+ "90": "train",
104
+ "91": "trout",
105
+ "92": "tulip",
106
+ "93": "turtle",
107
+ "94": "wardrobe",
108
+ "95": "whale",
109
+ "96": "willow_tree",
110
+ "97": "wolf",
111
+ "98": "woman",
112
+ "99": "worm"
113
+ },
114
+ "image_size": 224,
115
+ "initializer_range": 0.02,
116
+ "intermediate_size": 4096,
117
+ "label2id": {
118
+ "apple": "0",
119
+ "aquarium_fish": "1",
120
+ "baby": "2",
121
+ "bear": "3",
122
+ "beaver": "4",
123
+ "bed": "5",
124
+ "bee": "6",
125
+ "beetle": "7",
126
+ "bicycle": "8",
127
+ "bottle": "9",
128
+ "bowl": "10",
129
+ "boy": "11",
130
+ "bridge": "12",
131
+ "bus": "13",
132
+ "butterfly": "14",
133
+ "camel": "15",
134
+ "can": "16",
135
+ "castle": "17",
136
+ "caterpillar": "18",
137
+ "cattle": "19",
138
+ "chair": "20",
139
+ "chimpanzee": "21",
140
+ "clock": "22",
141
+ "cloud": "23",
142
+ "cockroach": "24",
143
+ "couch": "25",
144
+ "cra": "26",
145
+ "crocodile": "27",
146
+ "cup": "28",
147
+ "dinosaur": "29",
148
+ "dolphin": "30",
149
+ "elephant": "31",
150
+ "flatfish": "32",
151
+ "forest": "33",
152
+ "fox": "34",
153
+ "girl": "35",
154
+ "hamster": "36",
155
+ "house": "37",
156
+ "kangaroo": "38",
157
+ "keyboard": "39",
158
+ "lamp": "40",
159
+ "lawn_mower": "41",
160
+ "leopard": "42",
161
+ "lion": "43",
162
+ "lizard": "44",
163
+ "lobster": "45",
164
+ "man": "46",
165
+ "maple_tree": "47",
166
+ "motorcycle": "48",
167
+ "mountain": "49",
168
+ "mouse": "50",
169
+ "mushroom": "51",
170
+ "oak_tree": "52",
171
+ "orange": "53",
172
+ "orchid": "54",
173
+ "otter": "55",
174
+ "palm_tree": "56",
175
+ "pear": "57",
176
+ "pickup_truck": "58",
177
+ "pine_tree": "59",
178
+ "plain": "60",
179
+ "plate": "61",
180
+ "poppy": "62",
181
+ "porcupine": "63",
182
+ "possum": "64",
183
+ "rabbit": "65",
184
+ "raccoon": "66",
185
+ "ray": "67",
186
+ "road": "68",
187
+ "rocket": "69",
188
+ "rose": "70",
189
+ "sea": "71",
190
+ "seal": "72",
191
+ "shark": "73",
192
+ "shrew": "74",
193
+ "skunk": "75",
194
+ "skyscraper": "76",
195
+ "snail": "77",
196
+ "snake": "78",
197
+ "spider": "79",
198
+ "squirrel": "80",
199
+ "streetcar": "81",
200
+ "sunflower": "82",
201
+ "sweet_pepper": "83",
202
+ "table": "84",
203
+ "tank": "85",
204
+ "telephone": "86",
205
+ "television": "87",
206
+ "tiger": "88",
207
+ "tractor": "89",
208
+ "train": "90",
209
+ "trout": "91",
210
+ "tulip": "92",
211
+ "turtle": "93",
212
+ "wardrobe": "94",
213
+ "whale": "95",
214
+ "willow_tree": "96",
215
+ "wolf": "97",
216
+ "woman": "98",
217
+ "worm": "99"
218
+ },
219
+ "layer_norm_eps": 1e-12,
220
+ "model_type": "vit",
221
+ "num_attention_heads": 16,
222
+ "num_channels": 3,
223
+ "num_hidden_layers": 24,
224
+ "patch_size": 16,
225
+ "problem_type": "single_label_classification",
226
+ "qkv_bias": true,
227
+ "torch_dtype": "float32",
228
+ "transformers_version": "4.39.3"
229
+ }
dr_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "dr_accuracy": 0.9972470588235294,
3
+ "dr_loss": 0.020220542326569557,
4
+ "dr_runtime": 353.6004,
5
+ "dr_samples_per_second": 120.192,
6
+ "dr_steps_per_second": 0.472,
7
+ "epoch": 100.0
8
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be07959d6fa0cb775a5bdf4c782097bae5869db4a8a50f76cddb16c7f9f0e70f
3
+ size 1213663080
preprocessor_config.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_valid_processor_keys": [
3
+ "images",
4
+ "do_resize",
5
+ "size",
6
+ "resample",
7
+ "do_rescale",
8
+ "rescale_factor",
9
+ "do_normalize",
10
+ "image_mean",
11
+ "image_std",
12
+ "return_tensors",
13
+ "data_format",
14
+ "input_data_format"
15
+ ],
16
+ "do_normalize": true,
17
+ "do_rescale": true,
18
+ "do_resize": true,
19
+ "image_mean": [
20
+ 0.5,
21
+ 0.5,
22
+ 0.5
23
+ ],
24
+ "image_processor_type": "ViTImageProcessor",
25
+ "image_std": [
26
+ 0.5,
27
+ 0.5,
28
+ 0.5
29
+ ],
30
+ "resample": 2,
31
+ "rescale_factor": 0.00392156862745098,
32
+ "size": {
33
+ "height": 224,
34
+ "width": 224
35
+ }
36
+ }
test_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 100.0,
3
+ "test_accuracy": 0.9356,
4
+ "test_loss": 0.25322866439819336,
5
+ "test_runtime": 84.4915,
6
+ "test_samples_per_second": 118.355,
7
+ "test_steps_per_second": 0.473
8
+ }
train_results.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 100.0,
3
+ "train_loss": 0.2947179788530321,
4
+ "train_runtime": 117326.6726,
5
+ "train_samples_per_second": 36.224,
6
+ "train_steps_per_second": 0.567
7
+ }
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff
 
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61511df09538e6804e936fb432e7b68b550ca72e3cbebb2760b47753bbd886cc
3
+ size 4920