Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages.

Running on T4

280

280

Aya Models

🌍

Interact with the Aya family of models.
CohereForAI/aya-vision-8b

Image-Text-to-Text • Updated 5 days ago • 144k • 202
CohereForAI/aya-vision-32b

Image-Text-to-Text • Updated 5 days ago • 577 • 143
CohereForAI/AyaVisionBench

Viewer • Updated 5 days ago • 3.11k • 1.81k • 14

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org

allenai/olmOCR-7B-0225-preview

Image-Text-to-Text • Updated 12 days ago • 142k • 494
allenai/olmOCR-mix-0225

Viewer • Updated 12 days ago • 259k • 2.8k • 75
allenai/olmOCR-7B-0225-preview-GGUF

Updated 11 days ago • 5.1k • 20

Phi-4 family of small language and multi-modal models.

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 231k • 1.03k
microsoft/Phi-4-mini-instruct

Text Generation • Updated 4 days ago • 102k • 318
microsoft/phi-4

Text Generation • Updated 13 days ago • 563k • • 1.88k
microsoft/Phi-4-multimodal-instruct-onnx

Automatic Speech Recognition • Updated 6 days ago • 679 • 53

deepseek-ai/DeepSeek-R1

Text Generation • Updated 13 days ago • 3.64M • • 11k
deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 13 days ago • 11.6k • 860
deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • Updated 13 days ago • 430k • • 620
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 13 days ago • 1.49M • • 1.24k

Qwen with Questions

Running

257

257

QwQ 32B Demo

🌖

Generate text responses based on user input
Qwen/QwQ-32B

Text Generation • Updated 2 days ago • 103k • • 1.65k
Qwen/QwQ-32B-AWQ

Text Generation • Updated 2 days ago • 37.3k • 67
Qwen/QwQ-32B-GGUF

Text Generation • Updated 2 days ago • 57.2k • 99

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.

Running

650

650

Qwen2.5

🚀

Chat with Qwen, a helpful assistant
Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349
Qwen/Qwen2.5-0.5B

Text Generation • Updated Sep 25, 2024 • 506k • • 226
Qwen/Qwen2.5-0.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 1.09M • • 255

Hallucination detection

Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications.

KRLabsOrg/lettucedect-base-modernbert-en-v1

Token Classification • Updated 10 days ago • 3.41k • 14
KRLabsOrg/lettucedect-large-modernbert-en-v1

Token Classification • Updated 10 days ago • 299 • 21
LettuceDetect: A Hallucination Detection Framework for RAG Applications

Paper • 2502.17125 • Published 13 days ago • 7
Running

4

4

LettuceDetect

🥬

Let Us Detect your hallucinations! Demo for our framework.

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 17
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 1

SmolVLM2 📺 Smallest video LM ever 🤏🏻

Running on Zero

51

51

SmolVLM

📊

Generate text by analyzing images and videos
Running on A100

51

51

SmolVLM2 HighlightGenerator

🐨

Generate video highlights from uploaded video
Running

17

17

SmolVLM2 IPhone Waitlist

⏰

sign in to receive news on the iPhone app
Running on A100

20

20

SmolVLM2 XSPFGenerator (VLC prototype)

🎞

Generate video highlights and playlist

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks

CohereForAI/Global-MMLU

Viewer • Updated 6 days ago • 487k • 14.2k • 109
CohereForAI/Global-MMLU-Lite

Viewer • Updated Dec 19, 2024 • 9.23k • 4.75k • 16
CohereForAI/m-ArenaHard

Viewer • Updated 6 days ago • 10.5k • 688 • 18
CohereForAI/include-base-44

Viewer • Updated 4 days ago • 23.5k • 3.79k • 31

Previous
1
2
3
...
10,088
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs