WebML Community

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Xenova new activity about 14 hours ago

webml-community/kokoro-webgpu:output is bugged 100%

Xenova new activity 2 days ago

webml-community/kokoro-webgpu:Bump to v1.1.1 of kokoro-js

Xenova updated a Space 2 days ago

webml-community/kokoro-webgpu

View all activity

webml-community's activity

Xenova

in webml-community/kokoro-webgpu about 14 hours ago

output is bugged 100%

#2 opened about 22 hours ago by

froilo

Xenova

posted an update 2 days ago

Post

3181

We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️

Generate 10 seconds of speech in ~1 second for $0.

What will you build? 🔥
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
✂️ Implement sentence splitting, allowing for streamed responses
🌍 Multilingual support (only phonemization left)

Who wants to help?

6 replies

Xenova

in webml-community/kokoro-webgpu 2 days ago

Bump to v1.1.1 of kokoro-js

#1 opened 2 days ago by

Xenova

updated a Space 2 days ago

Kokoro Text-to-Speech (WebGPU)

🗣

High-quality speech synthesis powered by Kokoro TTS

Xenova

published a Space 2 days ago

Kokoro Text-to-Speech (WebGPU)

🗣

High-quality speech synthesis powered by Kokoro TTS

Xenova

authored a paper 3 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 5 days ago • 140

Xenova

updated a Space 4 days ago

Next.js + Transformers.js Client Template

🗄

Xenova

published a Space 4 days ago

Next.js + Transformers.js Client Template

🗄

Xenova

updated a Space 4 days ago

Next.js + Transformers.js Server Template

🗄

Xenova

in webml-community/TinySwallow-1.5B-Instruct-WebGPU 4 days ago

Update demo

#1 opened 4 days ago by

Xenova

updated a Space 4 days ago

TinySwallow-1.5B-Instruct-WebGPU

🐦

A compact Japanese LLM that runs locally in your browser.

Xenova

published a Space 6 days ago

TinySwallow-1.5B-Instruct-WebGPU

🐦

A compact Japanese LLM that runs locally in your browser.

Xenova

updated a Space 13 days ago

191

Janus Pro WebGPU

🏛

In-browser unified multimodal understanding and generation.

Xenova

published a Space 13 days ago

191

Janus Pro WebGPU

🏛

In-browser unified multimodal understanding and generation.

Xenova

updated a Space 18 days ago

510

DeepSeek-R1 WebGPU

🧠

Next-generation reasoning model that runs locally in-browser

Xenova

published a Space 20 days ago

510

DeepSeek-R1 WebGPU

🧠

Next-generation reasoning model that runs locally in-browser

Xenova

posted an update 24 days ago

Post

5115

Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon!
👉 npm i kokoro-js 👈

Try it out yourself: webml-community/kokoro-web
Link to models/samples: onnx-community/Kokoro-82M-ONNX

You can get started in just a few lines of code!

import { KokoroTTS } from "kokoro-js";

const tts = await KokoroTTS.from_pretrained(
  "onnx-community/Kokoro-82M-ONNX",
  { dtype: "q8" }, // fp32, fp16, q8, q4, q4f16
);

const text = "Life is like a box of chocolates. You never know what you're gonna get.";
const audio = await tts.generate(text,
  { voice: "af_sky" }, // See `tts.list_voices()`
);
audio.save("audio.wav");

Huge kudos to the Kokoro TTS community, especially taylorchu for the ONNX exports and Hexgrad for the amazing project! None of this would be possible without you all! 🤗

The model is also extremely resilient to quantization. The smallest variant is only 86 MB in size (down from the original 326 MB), with no noticeable difference in audio quality! 🤯