X2FD

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

menglc authored a paper 28 days ago

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

menglc authored a paper 7 months ago

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

menglc authored a paper 7 months ago

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

View all activity

X2FD's activity

menglc

authored a paper 28 days ago

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Paper • 2412.03565 • Published Dec 4, 2024 • 11

menglc

authored 2 papers 7 months ago

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Paper • 2311.14671 • Published Nov 24, 2023

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

Paper • 2406.04334 • Published Jun 6, 2024

Daniel0724

authored a paper 12 months ago

MouSi: Poly-Visual-Expert Vision-Language Models

Paper • 2401.17221 • Published Jan 30, 2024 • 8

Daniel0724

updated 3 datasets about 1 year ago

X2FD/LVIS-Instruct4V-Nodetail-mix619k

Updated Nov 28, 2023 • 12

X2FD/LVIS-Instruct4V-LLaVA-Instruct-mix880k

Updated Nov 28, 2023 • 10 • 1

X2FD/LVIS-Instruct4V-mix730k

Updated Nov 21, 2023 • 12 • 3

menglc

authored a paper about 1 year ago

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Paper • 2311.07574 • Published Nov 13, 2023 • 14

Daniel0724

authored a paper about 1 year ago

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Paper • 2311.07574 • Published Nov 13, 2023 • 14

Daniel0724

updated a dataset about 1 year ago

X2FD/LVIS-Instruct4V

Viewer • Updated Nov 13, 2023 • 223k • 17 • 83