jiuhai commited on
Commit
10c1d05
·
verified ·
1 Parent(s): a8b1dcb

Upload folder using huggingface_hub

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitattributes +5 -0
  2. 333891.err +4 -0
  3. 333891.out +0 -0
  4. 333893.err +0 -0
  5. 333893.out +0 -0
  6. 334413.err +0 -0
  7. 334413.out +0 -0
  8. 334457.err +0 -0
  9. 334457.out +0 -0
  10. 335101.err +0 -0
  11. 335101.out +0 -0
  12. 335261.err +0 -0
  13. 335261.out +385 -0
  14. checkpoint-100000/added_tokens.json +27 -0
  15. checkpoint-100000/config.json +69 -0
  16. checkpoint-100000/generation_config.json +15 -0
  17. checkpoint-100000/latest +1 -0
  18. checkpoint-100000/merges.txt +0 -0
  19. checkpoint-100000/model-00001-of-00006.safetensors +3 -0
  20. checkpoint-100000/model-00002-of-00006.safetensors +3 -0
  21. checkpoint-100000/model-00003-of-00006.safetensors +3 -0
  22. checkpoint-100000/model-00004-of-00006.safetensors +3 -0
  23. checkpoint-100000/model-00005-of-00006.safetensors +3 -0
  24. checkpoint-100000/model-00006-of-00006.safetensors +3 -0
  25. checkpoint-100000/model.safetensors.index.json +0 -0
  26. checkpoint-100000/rng_state_0.pth +3 -0
  27. checkpoint-100000/rng_state_1.pth +3 -0
  28. checkpoint-100000/rng_state_10.pth +3 -0
  29. checkpoint-100000/rng_state_100.pth +3 -0
  30. checkpoint-100000/rng_state_101.pth +3 -0
  31. checkpoint-100000/rng_state_102.pth +3 -0
  32. checkpoint-100000/rng_state_103.pth +3 -0
  33. checkpoint-100000/rng_state_104.pth +3 -0
  34. checkpoint-100000/rng_state_105.pth +3 -0
  35. checkpoint-100000/rng_state_106.pth +3 -0
  36. checkpoint-100000/rng_state_107.pth +3 -0
  37. checkpoint-100000/rng_state_108.pth +3 -0
  38. checkpoint-100000/rng_state_109.pth +3 -0
  39. checkpoint-100000/rng_state_11.pth +3 -0
  40. checkpoint-100000/rng_state_110.pth +3 -0
  41. checkpoint-100000/rng_state_111.pth +3 -0
  42. checkpoint-100000/rng_state_112.pth +3 -0
  43. checkpoint-100000/rng_state_113.pth +3 -0
  44. checkpoint-100000/rng_state_114.pth +3 -0
  45. checkpoint-100000/rng_state_115.pth +3 -0
  46. checkpoint-100000/rng_state_116.pth +3 -0
  47. checkpoint-100000/rng_state_117.pth +3 -0
  48. checkpoint-100000/rng_state_118.pth +3 -0
  49. checkpoint-100000/rng_state_119.pth +3 -0
  50. checkpoint-100000/rng_state_12.pth +3 -0
.gitattributes CHANGED
@@ -33,3 +33,8 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ checkpoint-100000/tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
+ checkpoint-100000/trainer_state.json filter=lfs diff=lfs merge=lfs -text
38
+ logs/335933.out filter=lfs diff=lfs merge=lfs -text
39
+ logs/336337.out filter=lfs diff=lfs merge=lfs -text
40
+ logs/338184.out filter=lfs diff=lfs merge=lfs -text
333891.err ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ Traceback (most recent call last):
2
+ File "/opt/hpcaas/.mounts/fs-036153e63d56f4dc2/home/zhaojiang/interleaved-llava/load_caption.py", line 1, in <module>
3
+ from datasets import load_dataset
4
+ ModuleNotFoundError: No module named 'datasets'
333891.out ADDED
File without changes
333893.err ADDED
The diff for this file is too large to render. See raw diff
 
333893.out ADDED
File without changes
334413.err ADDED
The diff for this file is too large to render. See raw diff
 
334413.out ADDED
The diff for this file is too large to render. See raw diff
 
334457.err ADDED
The diff for this file is too large to render. See raw diff
 
334457.out ADDED
The diff for this file is too large to render. See raw diff
 
335101.err ADDED
The diff for this file is too large to render. See raw diff
 
335101.out ADDED
The diff for this file is too large to render. See raw diff
 
335261.err ADDED
The diff for this file is too large to render. See raw diff
 
335261.out ADDED
@@ -0,0 +1,385 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ OpenCLIP not installed
2
+ OpenCLIP not installed
3
+ OpenCLIP not installed
4
+ OpenCLIP not installed
5
+ OpenCLIP not installed
6
+ OpenCLIP not installed
7
+ OpenCLIP not installed
8
+ OpenCLIP not installed
9
+ OpenCLIP not installed
10
+ OpenCLIP not installed
11
+ OpenCLIP not installed
12
+ OpenCLIP not installed
13
+ OpenCLIP not installed
14
+ OpenCLIP not installed
15
+ OpenCLIP not installed
16
+ OpenCLIP not installed
17
+ OpenCLIP not installed
18
+ OpenCLIP not installed
19
+ OpenCLIP not installed
20
+ OpenCLIP not installed
21
+ OpenCLIP not installed
22
+ OpenCLIP not installed
23
+ OpenCLIP not installed
24
+ OpenCLIP not installed
25
+ OpenCLIP not installedOpenCLIP not installed
26
+
27
+ OpenCLIP not installedOpenCLIP not installed
28
+
29
+ OpenCLIP not installedOpenCLIP not installed
30
+
31
+ OpenCLIP not installed
32
+ OpenCLIP not installed
33
+ OpenCLIP not installedOpenCLIP not installed
34
+
35
+ OpenCLIP not installedOpenCLIP not installedOpenCLIP not installed
36
+ OpenCLIP not installed
37
+
38
+
39
+ OpenCLIP not installed
40
+ OpenCLIP not installed
41
+ OpenCLIP not installedOpenCLIP not installed
42
+ OpenCLIP not installed
43
+
44
+ OpenCLIP not installedOpenCLIP not installedOpenCLIP not installed
45
+
46
+
47
+ OpenCLIP not installed
48
+ OpenCLIP not installed
49
+ OpenCLIP not installedOpenCLIP not installed
50
+
51
+ OpenCLIP not installed
52
+ OpenCLIP not installedOpenCLIP not installed
53
+
54
+ OpenCLIP not installedOpenCLIP not installed
55
+
56
+ OpenCLIP not installed
57
+ OpenCLIP not installed
58
+ OpenCLIP not installed
59
+ OpenCLIP not installed
60
+ OpenCLIP not installedOpenCLIP not installed
61
+ OpenCLIP not installed
62
+ OpenCLIP not installed
63
+
64
+ OpenCLIP not installed
65
+ OpenCLIP not installed
66
+ OpenCLIP not installed
67
+ OpenCLIP not installed
68
+ OpenCLIP not installed
69
+ OpenCLIP not installed
70
+ OpenCLIP not installed
71
+ OpenCLIP not installed
72
+ OpenCLIP not installed
73
+ OpenCLIP not installedOpenCLIP not installed
74
+
75
+ OpenCLIP not installed
76
+ OpenCLIP not installed
77
+ OpenCLIP not installedOpenCLIP not installed
78
+
79
+ OpenCLIP not installed
80
+ OpenCLIP not installed
81
+ OpenCLIP not installed
82
+ OpenCLIP not installed
83
+ OpenCLIP not installed
84
+ OpenCLIP not installed
85
+ OpenCLIP not installed
86
+ OpenCLIP not installed
87
+ OpenCLIP not installed
88
+ OpenCLIP not installed
89
+ OpenCLIP not installed
90
+ OpenCLIP not installedOpenCLIP not installedOpenCLIP not installedOpenCLIP not installed
91
+
92
+
93
+
94
+ OpenCLIP not installed
95
+ OpenCLIP not installed
96
+ OpenCLIP not installed
97
+ OpenCLIP not installed
98
+ OpenCLIP not installed
99
+ OpenCLIP not installed
100
+ OpenCLIP not installed
101
+ OpenCLIP not installed
102
+ OpenCLIP not installedOpenCLIP not installed
103
+
104
+ OpenCLIP not installed
105
+ OpenCLIP not installed
106
+ OpenCLIP not installed
107
+ OpenCLIP not installed
108
+ OpenCLIP not installed
109
+ OpenCLIP not installed
110
+ OpenCLIP not installed
111
+ OpenCLIP not installed
112
+ OpenCLIP not installed
113
+ OpenCLIP not installed
114
+ OpenCLIP not installed
115
+ OpenCLIP not installed
116
+ OpenCLIP not installed
117
+ OpenCLIP not installed
118
+ OpenCLIP not installed
119
+ OpenCLIP not installedOpenCLIP not installed
120
+
121
+ OpenCLIP not installed
122
+ OpenCLIP not installed
123
+ OpenCLIP not installed
124
+ OpenCLIP not installed
125
+ OpenCLIP not installed
126
+ OpenCLIP not installedOpenCLIP not installed
127
+
128
+ OpenCLIP not installed
129
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
130
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
131
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
132
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
133
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
134
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
135
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
136
+ [2025-02-16 02:18:02,555] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
137
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
138
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
139
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
140
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
141
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
142
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
143
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
144
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
145
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
146
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
147
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
148
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
149
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
150
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
151
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
152
+ [2025-02-16 02:18:02,558] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
153
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
154
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
155
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
156
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
157
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
158
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
159
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
160
+ [2025-02-16 02:18:02,676] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
161
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
162
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
163
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
164
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
165
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
166
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
167
+ [2025-02-16 02:18:02,765] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
168
+ [2025-02-16 02:18:02,766] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
169
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
170
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
171
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
172
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
173
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
174
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
175
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
176
+ [2025-02-16 02:18:02,884] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
177
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
178
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
179
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
180
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
181
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
182
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
183
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
184
+ [2025-02-16 02:18:03,072] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
185
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
186
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
187
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
188
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
189
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
190
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
191
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
192
+ [2025-02-16 02:18:03,202] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
193
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
194
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
195
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
196
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
197
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
198
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
199
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
200
+ [2025-02-16 02:18:03,234] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
201
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
202
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
203
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
204
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
205
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
206
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
207
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
208
+ [2025-02-16 02:18:03,273] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
209
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
210
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
211
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
212
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
213
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
214
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
215
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
216
+ [2025-02-16 02:18:03,298] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
217
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
218
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
219
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
220
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
221
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
222
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
223
+ [2025-02-16 02:18:03,432] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
224
+ [2025-02-16 02:18:03,433] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
225
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
226
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
227
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
228
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
229
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
230
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
231
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
232
+ [2025-02-16 02:18:03,961] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
233
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
234
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
235
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
236
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
237
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
238
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
239
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
240
+ [2025-02-16 02:18:04,019] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
241
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
242
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
243
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
244
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
245
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
246
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
247
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
248
+ [2025-02-16 02:18:04,036] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
249
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
250
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
251
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
252
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
253
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
254
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
255
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
256
+ [2025-02-16 02:18:05,276] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
257
+ [2025-02-16 02:18:08,126] [INFO] [comm.py:652:init_distributed] cdb=None
258
+ [2025-02-16 02:18:08,126] [INFO] [comm.py:652:init_distributed] cdb=None
259
+ [2025-02-16 02:18:08,126] [INFO] [comm.py:652:init_distributed] cdb=None
260
+ [2025-02-16 02:18:08,127] [INFO] [comm.py:652:init_distributed] cdb=None
261
+ [2025-02-16 02:18:08,127] [INFO] [comm.py:652:init_distributed] cdb=None
262
+ [2025-02-16 02:18:08,127] [INFO] [comm.py:652:init_distributed] cdb=None
263
+ [2025-02-16 02:18:08,127] [INFO] [comm.py:652:init_distributed] cdb=None
264
+ [2025-02-16 02:18:08,128] [INFO] [comm.py:652:init_distributed] cdb=None
265
+ [2025-02-16 02:18:08,128] [INFO] [comm.py:652:init_distributed] cdb=None
266
+ [2025-02-16 02:18:08,129] [INFO] [comm.py:652:init_distributed] cdb=None
267
+ [2025-02-16 02:18:08,131] [INFO] [comm.py:652:init_distributed] cdb=None
268
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
269
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
270
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
271
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
272
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
273
+ [2025-02-16 02:18:08,132] [INFO] [comm.py:652:init_distributed] cdb=None
274
+ [2025-02-16 02:18:08,133] [INFO] [comm.py:652:init_distributed] cdb=None
275
+ [2025-02-16 02:18:08,133] [INFO] [comm.py:652:init_distributed] cdb=None
276
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
277
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
278
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
279
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
280
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
281
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
282
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
283
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
284
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
285
+ [2025-02-16 02:18:08,134] [INFO] [comm.py:652:init_distributed] cdb=None
286
+ [2025-02-16 02:18:08,135] [INFO] [comm.py:652:init_distributed] cdb=None
287
+ [2025-02-16 02:18:08,135] [INFO] [comm.py:652:init_distributed] cdb=None
288
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
289
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
290
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
291
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
292
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
293
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
294
+ [2025-02-16 02:18:08,136] [INFO] [comm.py:652:init_distributed] cdb=None
295
+ [2025-02-16 02:18:08,138] [INFO] [comm.py:652:init_distributed] cdb=None
296
+ [2025-02-16 02:18:08,140] [INFO] [comm.py:652:init_distributed] cdb=None
297
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
298
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
299
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
300
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
301
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
302
+ [2025-02-16 02:18:08,157] [INFO] [comm.py:652:init_distributed] cdb=None
303
+ [2025-02-16 02:18:08,164] [INFO] [comm.py:652:init_distributed] cdb=None
304
+ [2025-02-16 02:18:08,164] [INFO] [comm.py:652:init_distributed] cdb=None
305
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
306
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
307
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
308
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
309
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
310
+ [2025-02-16 02:18:08,394] [INFO] [comm.py:652:init_distributed] cdb=None
311
+ [2025-02-16 02:18:08,400] [INFO] [comm.py:652:init_distributed] cdb=None
312
+ [2025-02-16 02:18:08,401] [INFO] [comm.py:652:init_distributed] cdb=None
313
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
314
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
315
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
316
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
317
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
318
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
319
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
320
+ [2025-02-16 02:18:08,483] [INFO] [comm.py:652:init_distributed] cdb=None
321
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
322
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
323
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
324
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
325
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
326
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
327
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
328
+ [2025-02-16 02:18:08,512] [INFO] [comm.py:652:init_distributed] cdb=None
329
+ [2025-02-16 02:18:08,530] [INFO] [comm.py:652:init_distributed] cdb=None
330
+ [2025-02-16 02:18:08,530] [INFO] [comm.py:652:init_distributed] cdb=None
331
+ [2025-02-16 02:18:08,530] [INFO] [comm.py:652:init_distributed] cdb=None
332
+ [2025-02-16 02:18:08,530] [INFO] [comm.py:652:init_distributed] cdb=None
333
+ [2025-02-16 02:18:08,531] [INFO] [comm.py:652:init_distributed] cdb=None
334
+ [2025-02-16 02:18:08,531] [INFO] [comm.py:652:init_distributed] cdb=None
335
+ [2025-02-16 02:18:08,531] [INFO] [comm.py:652:init_distributed] cdb=None
336
+ [2025-02-16 02:18:08,531] [INFO] [comm.py:652:init_distributed] cdb=None
337
+ [2025-02-16 02:18:08,536] [INFO] [comm.py:652:init_distributed] cdb=None
338
+ [2025-02-16 02:18:08,536] [INFO] [comm.py:652:init_distributed] cdb=None
339
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
340
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
341
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
342
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
343
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
344
+ [2025-02-16 02:18:08,537] [INFO] [comm.py:652:init_distributed] cdb=None
345
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
346
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
347
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
348
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
349
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
350
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
351
+ [2025-02-16 02:18:08,577] [INFO] [comm.py:652:init_distributed] cdb=None
352
+ [2025-02-16 02:18:08,578] [INFO] [comm.py:652:init_distributed] cdb=None
353
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
354
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
355
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
356
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
357
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
358
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
359
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
360
+ [2025-02-16 02:18:09,084] [INFO] [comm.py:652:init_distributed] cdb=None
361
+ [2025-02-16 02:18:09,256] [INFO] [comm.py:652:init_distributed] cdb=None
362
+ [2025-02-16 02:18:09,256] [INFO] [comm.py:652:init_distributed] cdb=None
363
+ [2025-02-16 02:18:09,256] [INFO] [comm.py:652:init_distributed] cdb=None
364
+ [2025-02-16 02:18:09,257] [INFO] [comm.py:652:init_distributed] cdb=None
365
+ [2025-02-16 02:18:09,257] [INFO] [comm.py:652:init_distributed] cdb=None
366
+ [2025-02-16 02:18:09,257] [INFO] [comm.py:652:init_distributed] cdb=None
367
+ [2025-02-16 02:18:09,257] [INFO] [comm.py:652:init_distributed] cdb=None
368
+ [2025-02-16 02:18:09,257] [INFO] [comm.py:652:init_distributed] cdb=None
369
+ [2025-02-16 02:18:09,267] [INFO] [comm.py:652:init_distributed] cdb=None
370
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
371
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
372
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
373
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
374
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
375
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
376
+ [2025-02-16 02:18:09,268] [INFO] [comm.py:652:init_distributed] cdb=None
377
+ [2025-02-16 02:18:10,589] [INFO] [comm.py:652:init_distributed] cdb=None
378
+ [2025-02-16 02:18:10,589] [INFO] [comm.py:652:init_distributed] cdb=None
379
+ [2025-02-16 02:18:10,589] [INFO] [comm.py:652:init_distributed] cdb=None
380
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:652:init_distributed] cdb=None
381
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:652:init_distributed] cdb=None
382
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:652:init_distributed] cdb=None
383
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:652:init_distributed] cdb=None
384
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:652:init_distributed] cdb=None
385
+ [2025-02-16 02:18:10,590] [INFO] [comm.py:683:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl
checkpoint-100000/added_tokens.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "</tool_call>": 151658,
3
+ "<image>": 151667,
4
+ "<tool_call>": 151657,
5
+ "<|box_end|>": 151649,
6
+ "<|box_start|>": 151648,
7
+ "<|endoftext|>": 151643,
8
+ "<|file_sep|>": 151664,
9
+ "<|fim_middle|>": 151660,
10
+ "<|fim_pad|>": 151662,
11
+ "<|fim_prefix|>": 151659,
12
+ "<|fim_suffix|>": 151661,
13
+ "<|im_end|>": 151645,
14
+ "<|im_start|>": 151644,
15
+ "<|image_pad|>": 151655,
16
+ "<|object_ref_end|>": 151647,
17
+ "<|object_ref_start|>": 151646,
18
+ "<|quad_end|>": 151651,
19
+ "<|quad_start|>": 151650,
20
+ "<|repo_name|>": 151663,
21
+ "<|video_pad|>": 151656,
22
+ "<|vision_end|>": 151653,
23
+ "<|vision_pad|>": 151654,
24
+ "<|vision_start|>": 151652,
25
+ "[/IMG]": 151666,
26
+ "[IMG]": 151665
27
+ }
checkpoint-100000/config.json ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Qwen/Qwen2.5-VL-7B-Instruct",
3
+ "architectures": [
4
+ "LlavaQwenForCausalLM"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 151643,
8
+ "eos_token_id": 151645,
9
+ "freeze_mm_mlp_adapter": false,
10
+ "gen_hidden_size": 1792,
11
+ "gen_pooling": "early_pool2d_4",
12
+ "gen_vision_tower": "eva-clip-E-14-plus",
13
+ "hidden_act": "silu",
14
+ "hidden_size": 3584,
15
+ "image_aspect_ratio": "square",
16
+ "image_token_id": 151655,
17
+ "initializer_range": 0.02,
18
+ "intermediate_size": 18944,
19
+ "max_position_embeddings": 128000,
20
+ "max_window_layers": 28,
21
+ "mm_patch_merge_type": "flat",
22
+ "mm_projector_lr": null,
23
+ "mm_projector_type": "mlp2x_gelu",
24
+ "mm_use_im_patch_token": false,
25
+ "mm_use_im_start_end": false,
26
+ "mm_vision_select_feature": "patch",
27
+ "mm_vision_select_layer": -2,
28
+ "model_type": "llava_qwen",
29
+ "n_query": 64,
30
+ "num_attention_heads": 28,
31
+ "num_hidden_layers": 28,
32
+ "num_key_value_heads": 4,
33
+ "pad_token_id": 151643,
34
+ "rms_norm_eps": 1e-06,
35
+ "rope_scaling": {
36
+ "mrope_section": [
37
+ 16,
38
+ 24,
39
+ 24
40
+ ],
41
+ "rope_type": "default",
42
+ "type": "default"
43
+ },
44
+ "rope_theta": 1000000.0,
45
+ "sliding_window": 32768,
46
+ "tie_word_embeddings": false,
47
+ "tokenizer_model_max_length": 2048,
48
+ "tokenizer_padding_side": "right",
49
+ "torch_dtype": "bfloat16",
50
+ "transformers_version": "4.49.0.dev0",
51
+ "tune_mm_mlp_adapter": false,
52
+ "use_cache": false,
53
+ "use_mm_proj": true,
54
+ "use_sliding_window": false,
55
+ "video_token_id": 151656,
56
+ "vision_config": {
57
+ "hidden_size": 1280,
58
+ "in_chans": 3,
59
+ "model_type": "qwen2_5_vl",
60
+ "spatial_patch_size": 14,
61
+ "tokens_per_second": 2,
62
+ "torch_dtype": "bfloat16"
63
+ },
64
+ "vision_end_token_id": 151653,
65
+ "vision_start_token_id": 151652,
66
+ "vision_token_id": 151654,
67
+ "vision_tower_pretrained": null,
68
+ "vocab_size": 151668
69
+ }
checkpoint-100000/generation_config.json ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "attn_implementation": "flash_attention_2",
3
+ "bos_token_id": 151643,
4
+ "do_sample": true,
5
+ "eos_token_id": [
6
+ 151645,
7
+ 151643
8
+ ],
9
+ "pad_token_id": 151643,
10
+ "repetition_penalty": 1.05,
11
+ "temperature": 0.1,
12
+ "top_k": 1,
13
+ "top_p": 0.001,
14
+ "transformers_version": "4.49.0.dev0"
15
+ }
checkpoint-100000/latest ADDED
@@ -0,0 +1 @@
 
 
1
+ global_step100000
checkpoint-100000/merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
checkpoint-100000/model-00001-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1a60bf2f791b5de6ce91ff9728b4f236541fb2fd8b990868ff4365a1cb60e1f
3
+ size 4965863624
checkpoint-100000/model-00002-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1830db6908dcc76df3a71492acbcf2b8cac130114cf1f3c2d9edae8de8c6de3
3
+ size 4991495816
checkpoint-100000/model-00003-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09c1807c6d00d7cab94f7db39d4c02ebb8537225ccde383861ac48db97945aa6
3
+ size 4932751040
checkpoint-100000/model-00004-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eedd8405c6cad3ff8b7990cfde9905f8ba17a01780dfa63a1f0827096ff02145
3
+ size 4999900758
checkpoint-100000/model-00005-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85caeb523023e201267f25d483f5b9aa32a41165fcdb2c4af0504892defc9302
3
+ size 4971582792
checkpoint-100000/model-00006-of-00006.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9eb53d377a698ed095f465913024fde162304954a2cd8a5e6403f78467d6ad7
3
+ size 3315894576
checkpoint-100000/model.safetensors.index.json ADDED
The diff for this file is too large to render. See raw diff
 
checkpoint-100000/rng_state_0.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1b1162c88d3342dc201b3c192ea0257b89a4fb3c4ee89ea4e8ab42cf98603c5
3
+ size 15984
checkpoint-100000/rng_state_1.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50f37287fafe7dcc3c94e9940022e2a929d40874bc4958d425ff8f0f390d8757
3
+ size 15984
checkpoint-100000/rng_state_10.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e03bfd601d219b0acc04c7922a1ffc7c3bfee6650468d7ae04b5266d6f514865
3
+ size 15997
checkpoint-100000/rng_state_100.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:796be6a13bc606c94e0fc03fd3e00d19b828c6232fd41aa7a57668f5a2f72e0c
3
+ size 16010
checkpoint-100000/rng_state_101.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93cf899394a868b64668dbf8a3274817098b69b23cbe004720cf19f9b055bafa
3
+ size 16010
checkpoint-100000/rng_state_102.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67f6a64df759b6c906db795b8d548588959824f1feb51e5156ae9eb2d7f40ed6
3
+ size 16010
checkpoint-100000/rng_state_103.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef5199fc0cf7bfc0aa39f4484d8bf74a8758486687a3132ed2595e69c888b724
3
+ size 16010
checkpoint-100000/rng_state_104.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4bdbd657ae24de990bf0b37b8fc8dd0ca7c88c50aa0ec79eee9021094af2fbab
3
+ size 16010
checkpoint-100000/rng_state_105.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e585856bab0237c9defb0a84d114cb221d084e3730c871d0bb346e0a137d0059
3
+ size 16010
checkpoint-100000/rng_state_106.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c758b81a993c0012da33af3b913c9195b3c565456336d645dd3bf85901cb280f
3
+ size 16010
checkpoint-100000/rng_state_107.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c626b8b62231936d59b432f927e38b3195fc2fbb09f6f4e206ca60e3bea615b8
3
+ size 16010
checkpoint-100000/rng_state_108.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df26203aee375c3effe4393ab48801d1426172aa16d3c78f3efffc716c010038
3
+ size 16010
checkpoint-100000/rng_state_109.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:99bed2535dc3f02f710c2adb30e2d1914df4235851d0227042d2a4ddb6db30ab
3
+ size 16010
checkpoint-100000/rng_state_11.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79fbde036773c9b9f413d71593f20695867859c4d95dff728d9e33d40f594560
3
+ size 15997
checkpoint-100000/rng_state_110.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:724c0d940ed94141cc9123e3feb50dd1a84d302b6993ad893cde2013100ddd6a
3
+ size 16010
checkpoint-100000/rng_state_111.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b13506b1017ca21d834784ce6fa25cff72bb8d580e05d3895f66c46c46c9898b
3
+ size 16010
checkpoint-100000/rng_state_112.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b112b4aedac1e7e9e452f098de33f0c3bef156e61ac253e4b33b03100c18d79
3
+ size 16010
checkpoint-100000/rng_state_113.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:423ca176406f06a4c9ce108150581efc156be3f56562117c706e9ed776c04fd1
3
+ size 16010
checkpoint-100000/rng_state_114.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:814d4897702c3fdb5b8fa6a3a714490d0e0224fc3730c022fd26f794311a331a
3
+ size 16010
checkpoint-100000/rng_state_115.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da429d73c2dd0f77d610a2342f75f122c23be3c5e75264f830d96a63dd9a4cd0
3
+ size 16010
checkpoint-100000/rng_state_116.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d3eb6dbcfea3335a02991452c5dc08ae0752051da1b96e5979f8613c501e429
3
+ size 16010
checkpoint-100000/rng_state_117.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f76ef359e5b6cefad13dbe4a96656eb9c2ebfede4b8ae1b31af839096868898c
3
+ size 16010
checkpoint-100000/rng_state_118.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8822f268ee4d50547b3fdab7e0a6d44eff0242b946c89c51adf9f7ab36627bd4
3
+ size 16010
checkpoint-100000/rng_state_119.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87ec1b100408b81612b47e9d5cfa3a6750f241341e94216b64d2f33cf697cda9
3
+ size 16010
checkpoint-100000/rng_state_12.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b10e2e464d440da4fead19b4ff2cfc49466d7dabac6100c90a6f2139e37f8d5
3
+ size 15997