ThucPD commited on
Commit
3ee14cd
Β·
verified Β·
1 Parent(s): 5def718

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +84 -8
README.md CHANGED
@@ -61,14 +61,90 @@ One standing-out feature of **EraX-VL-2B-V2.0** is the capability to do multi-tu
61
 
62
  ## Benchmarks πŸ“Š
63
 
64
- <!-- <!-- ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/66e93d483745423cbb14c5ff/-OYkSDVyAcAcLLgO2N5XT.jpeg) -->
65
- Below is the evaluation benchmark of **global open-source and proprietary Multimodal Models** on the [MTVQA](https://huggingface.co/datasets/ByteDance/MTVQA) Vietnamese test set conducted by [VinBigdata](https://www.linkedin.com/feed/update/urn:li:activity:7243887708966641664/). We plan to conduct more detailed and diverse evaluations in the near future.
66
- <div align="left">
67
- <img src="https://cdn-uploads.huggingface.co/production/uploads/66e93d483745423cbb14c5ff/-OYkSDVyAcAcLLgO2N5XT.jpeg" width="500"/>
68
- <a href="https://www.linkedin.com/feed/update/urn:li:activity:7243887708966641664/" target="_blank">Source: VinBigData</a>
69
- <br>(20:00 23rd Sept 2024)
70
- </div>
71
- -->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72
 
73
  ## API trial πŸŽ‰
74
  Please contact **[email protected]** for API access inquiry.
 
61
 
62
  ## Benchmarks πŸ“Š
63
 
64
+ ## πŸ† LeaderBoard
65
+
66
+ <table style="width:75%;">
67
+ <tr>
68
+ <th>Models</th>
69
+ <td><b>Open-Source</b></td>
70
+ <td><b>VI</b></td>
71
+ </tr>
72
+ <tr>
73
+ <th align="left"><font color=darkred>Qwen2-VL 72BπŸ₯‡</font></th>
74
+ <td align="middle">✘</td>
75
+ <td>41.6 </td>
76
+ </tr>
77
+ <tr>
78
+ <th align="left"><font color=darkred>ViGPT-VL πŸ₯ˆ </font></th>
79
+ <td align="middle">✘</td>
80
+ <td>39.1 </td>
81
+ </tr>
82
+ <tr>
83
+ <th align="left"><font color=darkred>EraX-VL-2B-V2.0 πŸ₯‰ </font></th>
84
+ <td align="middle"> βœ… </td>
85
+ <td>38.2 </td>
86
+ </tr>
87
+ <tr>
88
+ <th align="left"><font color=darkred>EraX-VL-7B-V1 </font></th>
89
+ <td align="middle"> βœ… </td>
90
+ <td>37.6 </td>
91
+ </tr>
92
+ <tr>
93
+ <th align="left"><font color=darkred>Vintern-1B-V2</font></th>
94
+ <td align="middle"> βœ… </td>
95
+ <td>37.4 </td>
96
+ </tr>
97
+ <tr>
98
+ <th align="left"><font color=darkred>Qwen2-VL 7B </font></th>
99
+ <td align="middle"> βœ… </td>
100
+ <td>30.0 </td>
101
+ </tr>
102
+ <tr>
103
+ <th align="left"><font color=darkred>Claude3 Opus</font></th>
104
+ <td align="middle">✘</td>
105
+ <td>29.1 </td>
106
+ </tr>
107
+ <tr>
108
+ <th align="left"><font color=darkred>GPT-4o mini </font></th>
109
+ <td align="middle"> ✘ </td>
110
+ <td>29.1 </td>
111
+ </tr>
112
+ <tr>
113
+ <th align="left">GPT-4V</th>
114
+ <td align="middle">✘</td>
115
+ <td>28.9 </td>
116
+ </tr>
117
+ <tr>
118
+ <th align="left"><font color=darkred>Gemini Ultra </font></th>
119
+ <td align="middle">✘</td>
120
+ <td>28.6 </td>
121
+ </tr>
122
+ <tr>
123
+ <th align="left">InternVL2 76B</th>
124
+ <td align="middle"> βœ… </td>
125
+ <td>26.9 </td>
126
+ </tr>
127
+ <tr>
128
+ <th align="left">QwenVL Max</th>
129
+ <td align="middle">✘</td>
130
+ <td>23.5 </td>
131
+ </tr>
132
+ <tr>
133
+ <th align="left">Claude3 Sonnet</th>
134
+ <td align="middle">✘</td>
135
+ <td>20.8 </td>
136
+ </tr>
137
+ <tr>
138
+ <th align="left">QwenVL Plus</th>
139
+ <td align="middle">✘</td>
140
+ <td>18.1 </td>
141
+ </tr>
142
+ <tr>
143
+ <th align="left"><font color=blue>MiniCPM-V2.5</font></th>
144
+ <td align="middle">βœ…</td>
145
+ <td>15.3 </td>
146
+ </tr>
147
+ </table>
148
 
149
  ## API trial πŸŽ‰
150
  Please contact **[email protected]** for API access inquiry.