VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper β’ 2501.13106 β’ Published 4 days ago β’ 66
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 4 days ago β’ 185
DONOTUSE Collection A look into hate speech in LLM data and how to combat it. β’ 5 items β’ Updated Oct 1, 2024
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models Paper β’ 2409.13592 β’ Published Sep 20, 2024 β’ 49 β’ 9