InternVL 1.0 - a OpenGVLab Collection

OpenGVLab 's Collections

PVT

All-Seeing Project

InternVL 1.0

updated 20 days ago

Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

OpenGVLab/InternViT-6B-224px

Image Feature Extraction • Updated Aug 23 • 1.9k • 19
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated Aug 23 • 1.88k • 33
OpenGVLab/InternVL-Chat-V1-2-Plus

Image-Text-to-Text • Updated Sep 24 • 329 • 33

Note Relased at 2024.02.21 | 40B parameters | More SFT data and stronger.
OpenGVLab/InternVL-Chat-V1-2

Image-Text-to-Text • Updated Sep 24 • 437 • 17

Note Released at 2024.02.11 | 40B parameters | scaling up LLM to 34B.
OpenGVLab/InternVL-Chat-V1-1

Image-Text-to-Text • Updated Sep 24 • 437 • 12

Note Released at 2024.01.24 | 19B parameters | support Chinese and stronger OCR
OpenGVLab/InternViT-6B-448px-V1-2

Image Feature Extraction • Updated Aug 23 • 1.03k • 25

Note Released at 2024.02.11 | Vision Foundation Model | 448 resolution
OpenGVLab/InternViT-6B-448px-V1-0

Image Feature Extraction • Updated Aug 23 • 24 • 8

Note Released at 2024.01.30 | Vision Foundation Model | 448 resolution
OpenGVLab/InternVL-14B-Flickr30K-FT-364px

Feature Extraction • Updated Aug 24 • 7 • 6
OpenGVLab/InternVL-14B-FlickrCN-FT-364px

Updated Aug 24 • 7 • 3
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

Visual Question Answering • Updated Aug 24 • 42 • 8
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B

Visual Question Answering • Updated Aug 24 • 170 • 7
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

Visual Question Answering • Updated Aug 24 • 32 • 3
OpenGVLab/InternVL

Updated 16 days ago • 20
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Paper • 2312.14238 • Published Dec 21, 2023 • 14

Note CVPR 2024, Oral