Cheng-Yu Hsieh's picture

1 4

Cheng-Yu Hsieh

cydhsieh01

·

https://chengyuhsieh.github.io/

cydhsieh

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

updated a model 2 months ago

vila-molmo/molmo-dense-captioner-v22-qwen2

upvoted a paper 6 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

View all activity

Organizations

cydhsieh01's activity

upvoted a paper 6 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 22

upvoted 3 papers 7 months ago

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Paper • 2407.06723 • Published Jul 9, 2024 • 11

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

Paper • 2407.07071 • Published Jul 9, 2024 • 12

Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

Paper • 2406.16008 • Published Jun 23, 2024 • 6