42 77 335

MoonRide

AI & ML interests

None yet

Recent Activity

liked a model about 11 hours ago

Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B

liked a model about 11 hours ago

mradermacher/L3-Umbral-Mind-RP-v0.3-8B-GGUF

liked a model about 12 hours ago

invisietch/EtherealRainbow-v0.2-8B

View all activity

Organizations

MoonRide's activity

upvoted a paper 4 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 5 days ago • 115

upvoted a paper 7 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 9 days ago • 134

upvoted a paper 11 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 18 days ago • 117

upvoted an article 15 days ago

Article

Open R1: Update #2

and 6 others •

15 days ago

• 185

upvoted a collection 17 days ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 201

upvoted a paper 19 days ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 21 days ago • 56

upvoted an article 21 days ago

Article

Open-source DeepResearch – Freeing our search agents

22 days ago

• 1.1k

upvoted a paper 21 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 27 days ago • 55

upvoted a paper 22 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 26 days ago • 9

upvoted a paper 23 days ago

Diffusion Autoencoders are Scalable Image Tokenizers

Paper • 2501.18593 • Published 26 days ago • 1

upvoted an article 24 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

29 days ago

• 773

upvoted a paper 29 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 63

upvoted a paper about 1 month ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 20

upvoted 4 papers about 2 months ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 42

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 50

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 99

upvoted a paper 2 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 92

upvoted a collection 3 months ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 22 items • Updated 10 days ago • 50

upvoted a paper 5 months ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Paper • 2410.05295 • Published Oct 3, 2024 • 12