Zesen Cheng
ClownRat
AI & ML interests
multi-modal foundation model; Segmentation, Detection, and Tracking;
Recent Activity
authored
a paper
1 day ago
GraCo: Granularity-Controllable Interactive Segmentation
authored
a paper
1 day ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video
Understanding