OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper ā¢ 2501.09751 ā¢ Published 3 days ago ā¢ 34 ā¢ 2
Multimodal LLMs Can Reason about Aesthetics in Zero-Shot Paper ā¢ 2501.09012 ā¢ Published 4 days ago ā¢ 10 ā¢ 2
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper ā¢ 2501.08828 ā¢ Published 4 days ago ā¢ 26 ā¢ 2
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Paper ā¢ 2501.09019 ā¢ Published 4 days ago ā¢ 10 ā¢ 2
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper ā¢ 2501.08292 ā¢ Published 5 days ago ā¢ 16 ā¢ 2
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper ā¢ 2501.01427 ā¢ Published 17 days ago ā¢ 49 ā¢ 3