text-to-speech is not a valid pipeline

by boddles2 - opened Mar 7, 2023

Discussion

boddles2

Mar 7, 2023

When I run the "Hosted inference API" I get text-to-speech is not a valid pipeline

nielsr

Mar 8, 2023

cc @Matthijs is SpeechT5 supported by a pipeline?

boddles2

Mar 8, 2023

Matthijs

Mar 9, 2023

Indeed, there currently is no pipeline for the TTS task in Transformers.

RageAgainstThePixel

Jun 5, 2023

Bump

joshdebelec

Jun 21, 2023

:disappointed_face:

nielsr

Jun 21, 2023

Update, a TTS pipeline will be added once at least 2 different TTS models are present in the Transformers library. This is to ensure the design is robust enough to handle different models

RageAgainstThePixel

Jun 27, 2023

Update, a TTS pipeline will be added once at least 2 different TTS models are present in the Transformers library. This is to ensure the design is robust enough to handle different models

What is required to accomplish this?

Rkosasih

Jul 8, 2023

Bump, problem still persists.

nielsr

Aug 17, 2023

A text-to-audio pipeline is now available: https://github.com/huggingface/transformers/pull/24952, supporting SpeechT5 and Bark. Usage is as follows:

from transformers import pipeline

classifier = pipeline(model="suno/bark")
output = pipeline("Hey it's HuggingFace on the phone!")
audio = output["audio"]
sampling_rate = output["sampling_rate"]

Next step is to create a corresponding inference widget for it, cc @mishig .

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment