7 22 94

Zijian Zhou PRO

franciszzj

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted a paper 5 days ago

Qwen2.5-VL Technical Report

liked a Space 11 days ago

ameerazam08/Diffusion-Eraser

View all activity

Organizations

None yet

franciszzj's activity

upvoted 2 papers 5 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 6 days ago • 115

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 7 days ago • 145

upvoted a paper about 2 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection about 2 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 207

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 346

upvoted a collection 2 months ago

AI Paper of the Day

Collection

A collection of papers that I think are interesting, one added each day • 300 items • Updated 4 days ago • 37

upvoted 2 papers 3 months ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published Dec 11, 2024 • 45

Learning Flow Fields in Attention for Controllable Person Image Generation

Paper • 2412.08486 • Published Dec 11, 2024 • 33

upvoted 2 papers 4 months ago

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 93

upvoted an article 6 months ago

Article

Breaking resolution curse of vision-language models

•

Feb 24, 2024

• 14

upvoted a collection 6 months ago

Playground v2

Collection

Collection of Playground v2 models • 4 items • Updated Dec 6, 2023 • 7

upvoted 2 papers 7 months ago

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Paper • 2407.11213 • Published Jul 15, 2024 • 3

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Paper • 2407.16224 • Published Jul 23, 2024 • 27

upvoted 3 papers 9 months ago

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6, 2024 • 24

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 73

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Paper • 2403.06728 • Published Mar 11, 2024 • 2

upvoted 3 papers about 1 year ago