shauray
/

Llava-v1.5-7B-hf

Text Generation

Model card Files Files and versions Community

shauray commited on Oct 9, 2023

Commit

734401e

·

1 Parent(s): fd29a4e

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-gener
 It is an auto-regressive language model, based on the transformer architecture.
 **Model date:**
-LLaVA-LLaMA-2-7B-Chat-LoRA-Preview was trained in July 2023.
 **Paper or resources for more information:**
 https://llava-vl.github.io/
@@ -38,12 +38,12 @@ The primary intended users of the model are researchers and hobbyists in compute
 ## Training dataset
 - 558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP.
-- 80K GPT-generated multimodal instruction-following data.
 ## Evaluation dataset
-A preliminary evaluation of the model quality is conducted by creating a set of 90 visual reasoning questions from 30 unique images randomly sampled from COCO val 2014 and each is associated with three types of questions: conversational, detailed description, and complex reasoning. We utilize GPT-4 to judge the model outputs.
-We also evaluate our model on the ScienceQA dataset.  Our synergy with GPT-4 sets a new state-of-the-art on the dataset.
-See https://llava-vl.github.io/ for more details.
 ## Usage
 usage is as follows

 It is an auto-regressive language model, based on the transformer architecture.
 **Model date:**
+LLaVA-v1.5-7B was trained in September 2023.
 **Paper or resources for more information:**
 https://llava-vl.github.io/
 ## Training dataset
 - 558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP.
+- 158K GPT-generated multimodal instruction-following data.
+- 450K academic-task-oriented VQA data mixture.
+- 40K ShareGPT data.
 ## Evaluation dataset
+A collection of 12 benchmarks, including 5 academic VQA benchmarks and 7 recent benchmarks specifically proposed for instruction-following LMMs.
 ## Usage
 usage is as follows