MBZUAI
/

ArTST

Model card Files Files and versions Community

Request: Add generation_config.json and preprocessor_config.json to Model Weights Release"

#1

by muhtasham - opened 5 days ago

5 days ago

First, thank you for making the weights publicly available!

Request

Would it be possible to include the following configuration files with the model weights:

generation_config.json - for consistent generation settings
preprocessor_config.json - for proper text preprocessing
config.json - for complete model configuration

It would reduce the chance of audio generation issues due to misconfigurations

Currently using:

CLARTTS_ArTST_TTS.pt
CLARTTS_ArTSTstar_TTS.pt

Request for Sample Inference Code

Would greatly appreciate:

A complete working inference script showing:
- Correct model loading process
- Proper speaker embedding handling
- Text-to-speech generation
- Audio saving
A few example outputs (wav files) generated using this script
Expected audio parameters (sample rate, duration, etc.)

Specific Questions

Which speaker embeddings work best with the model?
Are there any specific preprocessing steps needed for Arabic text?
What are the expected audio characteristics of a successful generation?

Having this sample code would help ensure users are implementing the model as intended and getting the expected quality of speech synthesis.

CC @herwoww @amupd

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment