Whisper for audio captioning
Collection
Whisper models finetuned on audio captioning instead of speech recognition. These model aim to briefly describe what happens in the audio scene.
•
3 items
•
Updated
•
2