Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ pipeline_tag: visual-question-answering
|
|
17 |
## Model Details
|
18 |
- **Model Type:** vision large language model, multimodal chatbot
|
19 |
- **Model Stats:**
|
20 |
-
- Architecture: [InternViT-6B-448px-V1-5](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-
|
21 |
- Params: 25.5B
|
22 |
- Image size: dynamic resolution, max to 40 tiles of 448 x 448 during inference.
|
23 |
- Number of visual tokens: 256 * (number of tiles + 1)
|
|
|
17 |
## Model Details
|
18 |
- **Model Type:** vision large language model, multimodal chatbot
|
19 |
- **Model Stats:**
|
20 |
+
- Architecture: [InternViT-6B-448px-V1-5](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-5) + MLP + [InternLM2-Chat-20B](https://huggingface.co/internlm/internlm2-chat-20b)
|
21 |
- Params: 25.5B
|
22 |
- Image size: dynamic resolution, max to 40 tiles of 448 x 448 during inference.
|
23 |
- Number of visual tokens: 256 * (number of tiles + 1)
|