RE-N-Y commited on
Commit
9f21b15
·
verified ·
1 Parent(s): d4cf0af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -11,6 +11,7 @@ metrics:
11
 
12
  Logic2Vision is a [LLaVA-1.5-13B](https://huggingface.co/llava-hf/llava-1.5-13b-hf) model finetuned on [VisReas dataset](https://arxiv.org/abs/2403.10534) for complex visual reasoning tasks.
13
 
 
14
 
15
  ## Model Details
16
 
@@ -81,7 +82,9 @@ processor.batch_decode(generate_ids, skip_special_tokens=True)
81
 
82
  ## Bias, Risks, and Limitations
83
 
84
- TBD
 
 
85
 
86
  ## Training / Evaluation Details
87
 
@@ -90,7 +93,7 @@ LoRA modules were attached to `["q_proj", "v_proj"]`. We use DDP for distributed
90
 
91
  ### Results
92
 
93
- TBD
94
 
95
  ## Citation
96
 
@@ -108,4 +111,5 @@ TBD
108
 
109
  ## Model Card Authors
110
 
111
- TBD
 
 
11
 
12
  Logic2Vision is a [LLaVA-1.5-13B](https://huggingface.co/llava-hf/llava-1.5-13b-hf) model finetuned on [VisReas dataset](https://arxiv.org/abs/2403.10534) for complex visual reasoning tasks.
13
 
14
+ ![results](https://huggingface.co/RE-N-Y/logic2vision/resolve/main/Code2Vision.png)
15
 
16
  ## Model Details
17
 
 
82
 
83
  ## Bias, Risks, and Limitations
84
 
85
+ The model has been mostly trained on VisReas dataset which is generated from [Visual Genome](https://homes.cs.washington.edu/~ranjay/visualgenome/index.html) dataset.
86
+ Furthermore, since the VLM was mostly finetuned to solve visual reasoning tasks by "generating python pseudocode" outputs provided by the user.
87
+ Hence, it may struggle to adopt to different prompt styles and code formats.
88
 
89
  ## Training / Evaluation Details
90
 
 
93
 
94
  ### Results
95
 
96
+ ![results](https://huggingface.co/RE-N-Y/logic2vision/resolve/main/results.png)
97
 
98
  ## Citation
99
 
 
111
 
112
  ## Model Card Authors
113
 
114
+ - Sangwu Lee - [Google Scholar](https://scholar.google.com/citations?user=FBJeGpAAAAAJ) - [Github](https://github.com/RE-N-Y) - [LinkedIn](https://www.linkedin.com/in/sangwulee/)
115
+ - Syeda Akter - [Google Scholar](https://scholar.google.com/citations?hl=en&user=tZFFHYcAAAAJ) - [Github](https://github.com/snat1505027) - [LinkedIn](https://www.linkedin.com/in/syeda-nahida-akter-989770114/)