Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 6 days ago • 36
view article Article Distilling from Dialogues: Finding Meaning in LLM Interactions By chansung • about 13 hours ago • 3
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published about 24 hours ago • 5
LoRAs / Models (SDXL1.0, Pony, SD1.5, Flux, ...) Collection 1105 items • Updated about 3 hours ago • 10
Resources for Tagging / Captioning / Prompting / LLM Collection 8439 items • Updated about 3 hours ago • 5
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated about 15 hours ago • 10
Running on Zero 2 2 MIDI Templates Inpainter 🖌 Inpaint pitches in MIDI templates to create unique songs
view post Post 301 Wan2.1 🔥📹 new OPEN video model by Alibaba Wan team!Model: Wan-AI/Wan2.1-T2V-14BDemo: Wan-AI/Wan2.1✨Apache 2.0✨8.19GB VRAM, runs on most GPUs✨Multi-Tasking: T2V, I2V, Video Editing, T2I, V2A✨Text Generation: Supports Chinese & English✨Powerful Video VAE: Encode/decode 1080P w/ temporal precision See translation 🔥 4 4 🚀 3 3 👍 2 2 + Reply
Crimean Tatar Text-to-Speech Collection Use three voices: Abibullah, Sevil, or Arslan • 4 items • Updated about 19 hours ago • 2