Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,7 @@ metrics:
|
|
11 |
|
12 |
Logic2Vision is a [LLaVA-1.5-13B](https://huggingface.co/llava-hf/llava-1.5-13b-hf) model finetuned on [VisReas dataset](https://arxiv.org/abs/2403.10534) for complex visual reasoning tasks.
|
13 |
|
|
|
14 |
|
15 |
## Model Details
|
16 |
|
@@ -81,7 +82,9 @@ processor.batch_decode(generate_ids, skip_special_tokens=True)
|
|
81 |
|
82 |
## Bias, Risks, and Limitations
|
83 |
|
84 |
-
|
|
|
|
|
85 |
|
86 |
## Training / Evaluation Details
|
87 |
|
@@ -90,7 +93,7 @@ LoRA modules were attached to `["q_proj", "v_proj"]`. We use DDP for distributed
|
|
90 |
|
91 |
### Results
|
92 |
|
93 |
-
|
94 |
|
95 |
## Citation
|
96 |
|
@@ -108,4 +111,5 @@ TBD
|
|
108 |
|
109 |
## Model Card Authors
|
110 |
|
111 |
-
|
|
|
|
11 |
|
12 |
Logic2Vision is a [LLaVA-1.5-13B](https://huggingface.co/llava-hf/llava-1.5-13b-hf) model finetuned on [VisReas dataset](https://arxiv.org/abs/2403.10534) for complex visual reasoning tasks.
|
13 |
|
14 |
+
![results](https://huggingface.co/RE-N-Y/logic2vision/resolve/main/Code2Vision.png)
|
15 |
|
16 |
## Model Details
|
17 |
|
|
|
82 |
|
83 |
## Bias, Risks, and Limitations
|
84 |
|
85 |
+
The model has been mostly trained on VisReas dataset which is generated from [Visual Genome](https://homes.cs.washington.edu/~ranjay/visualgenome/index.html) dataset.
|
86 |
+
Furthermore, since the VLM was mostly finetuned to solve visual reasoning tasks by "generating python pseudocode" outputs provided by the user.
|
87 |
+
Hence, it may struggle to adopt to different prompt styles and code formats.
|
88 |
|
89 |
## Training / Evaluation Details
|
90 |
|
|
|
93 |
|
94 |
### Results
|
95 |
|
96 |
+
![results](https://huggingface.co/RE-N-Y/logic2vision/resolve/main/results.png)
|
97 |
|
98 |
## Citation
|
99 |
|
|
|
111 |
|
112 |
## Model Card Authors
|
113 |
|
114 |
+
- Sangwu Lee - [Google Scholar](https://scholar.google.com/citations?user=FBJeGpAAAAAJ) - [Github](https://github.com/RE-N-Y) - [LinkedIn](https://www.linkedin.com/in/sangwulee/)
|
115 |
+
- Syeda Akter - [Google Scholar](https://scholar.google.com/citations?hl=en&user=tZFFHYcAAAAJ) - [Github](https://github.com/snat1505027) - [LinkedIn](https://www.linkedin.com/in/syeda-nahida-akter-989770114/)
|