Miscellaneous - a GayatriValley Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

GayatriValley 's Collections

Miscellaneous

updated Dec 13, 2024

Running on Zero

695

695

Unique3D

⚡

Create a 1M faces 3D colored model from an image!
Running on Zero

50

50

Paligemma Doc

📚

Try PaliGemma on document understanding tasks
wangfuyun/PCM_Weights

Text-to-Image • Updated Oct 30, 2024 • 26 • 85
Running on Zero

397

397

Stable Audio Open Zero

🔥

Generate audio from text prompts
Running on T4

311

311

PaliGemma Demo

🤲

Annotate and describe images with text prompts
Running on Zero

41

41

T2V Turbo

🖼

Fastest high-quality video diffusion model.
atcsecure/dolphin-2.9.2-qwen72b-8.0bpw-h8-exl2

Text Generation • Updated Jun 9, 2024 • 11 • 2
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 390k • 2.85k
DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • Updated Aug 13, 2024 • 1.7k • 40
SakanaAI/DiscoPOP-zephyr-7b-gemma

Text Generation • Updated Jun 13, 2024 • 5.01k • 36
madebyollin/taesd3

Updated Jun 14, 2024 • 104 • 34
hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17, 2024 • 45k • 56
Running on Zero

83

83

NaRCan

💊

Generate an edited video from a prompt
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF

Text Generation • Updated Aug 2, 2024 • 536 • 13
Running on Zero

86

86

DiffIR2VR

👌

Video upscaler/restorer
CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 212 • 448
cognitivecomputations/dolphin-vision-72b

Text Generation • Updated Jul 16, 2024 • 892 • 122
Running on Zero

72

72

Florence-2 for Videos

🎬

Generate annotated video with object detection
Running on Zero

129

129

FLUX.1-dev + Captioner

🐨

Generate images from text or captions
Running on Zero

308

308

Video Transcription Smart Summary

⚡

Generate summaries from YouTube videos or uploaded videos
qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • Updated Sep 21, 2024 • 2.18k • 107
Running on Zero

124

124

nanoLLaVA-1.5

🚀

Chat about images with AI
THUDM/codegeex4-all-9b

Text Generation • Updated Jul 18, 2024 • 572 • 244
Running

10

10

Langflow Crewai

💻

Launch a language processing workflow
Running on Zero

655

655

Tile Upscaler

🚀

Enhance and upscale images with controlnet guidance
Running

193

193

Whisper Timestamped

🕒

In-browser speech recognition w/ word-level timestamps
Running on Zero

1.8k

1.8k

IDM VTON

👕

High-fidelity Virtual Try-on
deepseek-ai/DeepSeek-V2-Chat-0628

Text Generation • Updated Jul 18, 2024 • 395 • 175
TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF

Updated Jul 14, 2024 • 4.31k • 63
fal/AuraFlow

Text-to-Image • Updated Jul 18, 2024 • 2.81k • 640
xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 85.9k • 1.28k
TheBloke/MythoMax-L2-13B-GPTQ

Text Generation • Updated Sep 27, 2023 • 3.35k • 193
Gryphe/MythoMax-L2-13b

Text Generation • Updated Apr 21, 2024 • 13.2k • 282
Gryphe/Pantheon-RP-1.0-8b-Llama-3

Text Generation • Updated May 13, 2024 • 106 • 46
Gryphe/Tiamat-8b-1.2-Llama-3-DPO

Text Generation • Updated May 3, 2024 • 11 • 6
BeaverLegacy/Smegmma-9B-v1

Text Generation • Updated Jul 13, 2024 • 88 • 44
mradermacher/Nymph_8B-i1-GGUF

Updated Aug 2, 2024 • 85 • 1
Sleeping

29

29

MusiConGen

🪩
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Text Generation • Updated Sep 14, 2024 • 248k • 151
Runtime error

58

58

CosyVoice 300M

📉
FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 1.56k • 213
Sleeping

21

21

Video-to-Audio Ldm

🎧

Video-to-Audio Generation with Hidden Alignment
CofeAI/Tele-FLM-1T

Text Generation • Updated Jul 29, 2024 • 43 • 80
maxin-cn/Cinemo

Image-to-Video • Updated Aug 14, 2024 • 17 • 32
Running on Zero

202

202

Cinemo

🎥

Multimodal Image-to-Video
Running

15

15

Mms Zeroshot

🌍

Generate transcript from audio input
Running on Zero

54

54

AccDiffusion

🏆

Generate images from text prompts
Running on Zero

183

183

Artist

🎨

Aesthetically Controllable Text-Driven Stylization w/o Train
Running on Zero

82

82

EchoMimic

🐨

Generate lifelike audio-driven portrait animations from images and audio
HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 48.5k • 264
parler-tts/parler-tts-mini-v1

Text-to-Speech • Updated Nov 25, 2024 • 24k • 133
parler-tts/parler-tts-large-v1

Text-to-Speech • Updated Nov 22, 2024 • 19.3k • 237
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • Updated Nov 20, 2024 • 22.2k • 93
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.58M • • 8.58k
Running on Zero

154

154

CatVTON

🐈

Try on clothes virtually with images
wanglab/ecg-fm-preprint

Updated Aug 8, 2024 • 7
XLabs-AI/flux-lora-collection

Text-to-Image • Updated Aug 14, 2024 • 504
Runtime error

57

57

Vgg Heads

🖼
migtissera/Tess-3-Mistral-Nemo-12B

Updated Sep 4, 2024 • 5.19k • 12
nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 124 • 106
Running

609

609

Qwen2-VL-72B

🌖

Engage in multi-modal conversations with images and videos
DAMO-NLP-SG/VideoLLaMA2-72B

Visual Question Answering • Updated Aug 14, 2024 • 35 • 10
answerdotai/answerai-colbert-small-v1

Updated Nov 18, 2024 • 3.04M • 141
mlabonne/Hermes-3-Llama-3.1-8B-lorablated-GGUF

Updated Aug 16, 2024 • 1.28k • 24
labotollama3/lobotollama-5.5b

Text Generation • Updated Apr 22, 2024 • 5 • 4
Mozilla/whisperfile

Updated Oct 2, 2024 • 1.79k • 240
Running

44

44

FAI Fuzer Medium v0.3

🎨

Generate images by combining a foreground with a custom background
ZhengPeng7/BiRefNet

Image Segmentation • Updated 4 days ago • 650k • 313
Running on CPU Upgrade

7.28k

7.28k

Kolors Virtual Try-On

👕

Virtual try-on for clothes on a person
fal/AuraFace-v1

Updated Aug 26, 2024 • 84
cognitivecomputations/dolphin-2.9.4-gemma2-2b

Updated Aug 27, 2024 • 54 • 36
pzc163/MiniCPMv2_6-prompt-generator

Updated Aug 24, 2024 • 436 • 41
Running on L40S

833

833

CogVideoX-5B

🎥

Text-to-Video
yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6, 2024 • 510 • 130
InstantX/FLUX.1-dev-Controlnet-Union

Updated Aug 26, 2024 • 39.5k • 392
Running on Zero

73

73

Qwen2-VL-2B

🔥

Generate text from images and videos
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • Updated 29 days ago • 1.02M • 392
Running

56

56

Groq Gradio Voice Assistant

👁

Process audio to text and generate AI response
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 14 • 30
facebook/sapiens

Updated Sep 20, 2024 • 2 • 230
Running on Zero

27

27

Tb Ocr

📈

Convert image text to markdown format
YuWangX/memoryllm-8b-chat

Updated Nov 17, 2024 • 44 • 19
Running

128

128

HivisionIDPhotos

🌖

Remove background from ID photos
virtuals-protocol/mario-videogamegen

Updated Sep 6, 2024 • 13
Running on Zero

247

247

Qwen2-VL-7B

🔥

Generate text by combining an image and a question
Runtime error

257

257

Latent Navigation

🪐
mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 585 • 1.71k
Running on Zero

101

101

ViewCrafter

🐨

Create a video from an image with camera motion
Running on Zero

18

18

Text Image Analyzer

💻

Analyse any image with Llama3.2
vidore/colqwen2-v0.1

Updated 4 days ago • 42k • 167
Runtime error

12

12

Llama 3.2 Vision Free

🐢
facebook/Self-taught-evaluator-llama3.1-70B

Updated Sep 30, 2024 • 41
openai/clip-vit-large-patch14-336

Zero-Shot Image Classification • Updated Oct 4, 2022 • 4.43M • 223
jasperai/Flux.1-dev-Controlnet-Upscaler

Image-to-Image • Updated Sep 30, 2024 • 10.7k • 576
Running on Zero

292

292

Diffusers Image Fill

🏃

Erase or change parts of images using masks
Runtime error

30

30

PDF to Page Images Dataset

📂

Convert PDFs to page images for dataset creation
Running on Zero

74

74

ColPali fine-tuning Query Generator

🔍

Generate retrieval queries from document images
Running on Zero

9

9

Vision Pipeline

🌍

Query an image index to get answers
nvidia/NVLM-D-72B

Image-Text-to-Text • Updated 26 days ago • 47.9k • 766
Running on Zero

770

770

Whisper Turbo

🤯

Transcribe or translate audio and YouTube videos
davanstrien/ufo-ColPali

Viewer • Updated Sep 23, 2024 • 2.24k • 325 • 22
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 95 • 62
Running on Zero

216

216

OpenMusic

🎶

Generate high-quality music from text descriptions
Running

394

394

Pdf2audio

📚

Generate detailed script for podcast or lecture from text input
Running on Zero

228

228

Ultrapixel-demo

😻

Ultra-high resolution image synthesis
Running

36

36

KoolCogVideoX

🎥

Text-to-Video
PleIAs/OCRonos-Vintage

Text Generation • Updated Aug 8, 2024 • 223 • 79
Running on Zero

261

261

EzAudio

🟣

Generate and edit audio from text prompts
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated 6 days ago • 411k • 1.37k
Running on CPU Upgrade

600

600

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection
Running

62

62

ArxivCopilot

🏢

Generate personalized research profiles and chat with Arxiv Copilot
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 1 • 416
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 383
ICTNLP/Llama-3.1-8B-Omni

Updated Nov 14, 2024 • 3.3k • 393
fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 1.06k • 448
bartowski/Reflection-Llama-3.1-70B-GGUF

Text Generation • Updated Sep 7, 2024 • 1.45k • 54
lelapa/InkubaLM-0.4B

Text Generation • Updated Sep 5, 2024 • 1.92k • 44
Running

139

139

Qwen 2.5 Code Interpreter

🐍

Interpret and execute code with responses
Running on Zero

269

269

Virtual Try On

👕

High-fidelity Virtual Try-on
Running on Zero

33

33

Ferret Demo

📚

Upload an image and ask questions about it
Running on T4

45

45

ColPali 🤝 Vespa - Visual Retrieval

👀

Visual Retrieval with ColPali and Vespa
oxyapi/oxy-1-small

Text Generation • Updated Dec 4, 2024 • 13.9k • 76
QuantFactory/MN-Chunky-Lotus-12B-GGUF

Updated Dec 4, 2024 • 190 • 3
Running

14

14

ScholarCopilot

📊

Using RAG LLM to assist your academic writing
Running on L40S

450

450

Leffa

👗

Generate images with virtual try-on or pose transfer
Lightricks/LTX-Video

Image-to-Video • Updated 5 days ago • 319k • 948

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs