Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.18600

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29
Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 50
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 60
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

about 8 hours ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 26
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 13
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 43
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 22

about 9 hours ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 13
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 55
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 47

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 104
Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 9 days ago • 38

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 3 days ago • 58
Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 110
Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 3 days ago • 55
When an LLM is apprehensive about its answers -- and when its uncertainty is justified

Paper • 2503.01688 • Published 3 days ago • 18

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 9 days ago • 38

about 10 hours ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 8 days ago • 24
Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published 10 days ago • 24
Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 17 days ago • 28
Language Models can Self-Improve at State-Value Estimation for Better Search

Paper • 2503.02878 • Published 2 days ago • 7

GRADIO examples

Running

2

2

Whisper Realtime Transcription (Gradio UI)

👂

Transcribe audio in realtime - Gradio UI version
Running

6

6

DeepSeek R1 Distill Qwen 1.5B Demo Q8

🔥

DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 9 days ago • 38

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106
Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published Jan 20 • 32
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Paper • 2501.09775 • Published Jan 16 • 29
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published Jan 16 • 47
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 339
s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 108

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs