๐ฃ Looking for labeled, high-quality synthetic audio/TTS data ๐ฃ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.
If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.
What does this mean? If you've been calling closed-source TTS or audio API endpoints to: - Build voice agents - Make long-form audio, like audiobooks or podcasts - Handle customer support, etc Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ
if you can record the problem and share it there , or on the forums in your own post , please dont be shy because i'm not sure but i do think it helps ๐ค๐ค๐ค
> Oasis: First Real-Time Video Game Without a Game Engine! ๐ฎ
DecartAI & Etched just released Oasis - a fully AI-generated video game running at 20 FPS (frames per second). The model takes keyboard inputs and generates everything - physics, rules, graphics - on the fly, without any game engine.
โก๏ธ What makes this special? Current text-to-video models (Mochi-1, Sora, Kling) generate about 1 frame every 10-20 seconds (that's the kind of device I had to play LoL back in the day, thus my low rankings). Oasis is 200 times faster, making it the first playable AI-generated game.
โ๏ธ Under the hood, it uses a vision transformer to encode space and a diffusion model to generate frames. The secret sauce is "dynamic noising" - a technique that keeps the video stable between frames.
Key insights: โก๏ธ Generates 20 FPS, vs 0.2 FPS for other DIT-based video models โฃ The specialized hardware Sohu developed by Etched allows to handle 10x more player than H100
๐ฎ Features real game mechanics โฃ Movement, jumping, item management โฃ Physics and lighting โฃ Procedurally generated worlds
โ ๏ธ Current limitations โฃ Blurry graphics at a distance โฃ Objects sometimes change appearance โฃ Memory issues in long sessions