Mu Cai's picture

6 7 3

Mu Cai

mucai

·

https://pages.cs.wisc.edu/~mucai/

AI & ML interests

Computer Vision, Deep Learning, 3D Vision, Vision and Language,

Recent Activity

upvoted a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

authored a paper 5 months ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

commented on a paper 5 months ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

View all activity

Organizations

Collections 3

Papers 4

arxiv:2410.10818

arxiv:2410.02763

arxiv:2406.20095

arxiv:2405.17430

spaces 2

No application file

Matryoshka Multimodal Models

ViP-Bench Evaluator

models 14

mucai/vip-llava-7b-refcocog-ft

Text Generation • Updated Jun 28, 2024 • 11

mucai/llava-next-vicuna-7b-m3

Text Generation • Updated Jun 3, 2024 • 210 • 3

mucai/llava-v1.5-7b-m3

Text Generation • Updated Jun 3, 2024 • 1.43k • 2

mucai/vip-llava-phi-3-mini-3.8B

Text Generation • Updated Apr 26, 2024 • 8

mucai/vip-llava-llama-3-8b

Text Generation • Updated Apr 26, 2024 • 11

mucai/llava-1.5-phi-3-mini-3.8B

Text Generation • Updated Apr 26, 2024 • 12

mucai/llava-1.5-llama-3-8b

Text Generation • Updated Apr 26, 2024 • 8

mucai/vip-llava-13b-base

Text Generation • Updated Apr 21, 2024 • 24

mucai/vip-llava-7b-base-vcr-ft

Text Generation • Updated Feb 23, 2024 • 22

mucai/vip-llava-7b-base

Text Generation • Updated Jan 23, 2024 • 24

datasets 2

mucai/ViP-LLaVA-Instruct

Preview • Updated Feb 26, 2024 • 116 • 10

mucai/ViP-Bench

Viewer • Updated Dec 15, 2023 • 888 • 1.31k • 4