Collections
Discover the best community collections!
Collections including paper arxiv:2408.00874
-
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Paper • 2404.07839 • Published • 41 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 60 -
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Paper • 2404.05674 • Published • 13 -
Agentless: Demystifying LLM-based Software Engineering Agents
Paper • 2407.01489 • Published • 42
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 124 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 50 -
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper • 2402.03766 • Published • 12 -
LLM Agent Operating System
Paper • 2403.16971 • Published • 65
-
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
Paper • 2311.02077 • Published • 14 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 39 -
Large Language Models for Mathematicians
Paper • 2312.04556 • Published • 11 -
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Paper • 2403.00522 • Published • 44
-
Tracking Anything with Decoupled Video Segmentation
Paper • 2309.03903 • Published • 27 -
ProPainter: Improving Propagation and Transformer for Video Inpainting
Paper • 2309.03897 • Published • 26 -
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Paper • 2312.15715 • Published • 19 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 107