SRMT: Shared Memory for Multi-agent Lifelong Pathfinding Paper • 2501.13200 • Published 2 days ago • 45
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 2 days ago • 61
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning Paper • 2501.12570 • Published 3 days ago • 17
IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems Paper • 2501.11067 • Published 5 days ago • 9
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 3 days ago • 28
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published 2 days ago • 44
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 2 days ago • 161
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 2 days ago • 51
The Geometry of Tokens in Internal Representations of Large Language Models Paper • 2501.10573 • Published 7 days ago • 7
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper • 2501.12224 • Published 3 days ago • 41
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Paper • 2501.12368 • Published 3 days ago • 34
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published 3 days ago • 73
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 10 days ago • 58
view article Article Yay! Organizations can now publish blog Articles By huggingface • 4 days ago • 29
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 26 items • Updated 4 days ago • 57
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 8 days ago • 37
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper • 2501.10020 • Published 8 days ago • 21
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 26