Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ We present SPHINX, a versatile multi-modal large language model (MLLM) with a mi
|
|
26 |
<img src="figs/pipeline1.png"/ width="100%"> <br>
|
27 |
</p>
|
28 |
|
29 |
-
On top of SPHINX, we propose to further
|
30 |
<p align="left">
|
31 |
<img src="figs/pipeline2.png"/ width="100%"> <br>
|
32 |
</p>
|
|
|
26 |
<img src="figs/pipeline1.png"/ width="100%"> <br>
|
27 |
</p>
|
28 |
|
29 |
+
On top of SPHINX, we propose to further mix visual scales and sub-images for better capture fine-grained semantics on high-resolution images.
|
30 |
<p align="left">
|
31 |
<img src="figs/pipeline2.png"/ width="100%"> <br>
|
32 |
</p>
|