Mu Cai
mucai
AI & ML interests
Computer Vision, Deep Learning, 3D Vision, Vision and Language,
Recent Activity
upvoted
a
paper
about 1 month ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
authored
a paper
5 months ago
TemporalBench: Benchmarking Fine-grained Temporal Understanding for
Multimodal Video Models
commented on
a paper
5 months ago
TemporalBench: Benchmarking Fine-grained Temporal Understanding for
Multimodal Video Models
Organizations
Collections
3
models
14

mucai/vip-llava-7b-refcocog-ft
Text Generation
•
Updated
•
11

mucai/llava-next-vicuna-7b-m3
Text Generation
•
Updated
•
210
•
3

mucai/llava-v1.5-7b-m3
Text Generation
•
Updated
•
1.43k
•
2

mucai/vip-llava-phi-3-mini-3.8B
Text Generation
•
Updated
•
8

mucai/vip-llava-llama-3-8b
Text Generation
•
Updated
•
11

mucai/llava-1.5-phi-3-mini-3.8B
Text Generation
•
Updated
•
12

mucai/llava-1.5-llama-3-8b
Text Generation
•
Updated
•
8

mucai/vip-llava-13b-base
Text Generation
•
Updated
•
24

mucai/vip-llava-7b-base-vcr-ft
Text Generation
•
Updated
•
22

mucai/vip-llava-7b-base
Text Generation
•
Updated
•
24