metadata
language: en
license: mit
tags:
- audio
- captioning
- text
- audio-captioning
- automated-audio-captioning
task_categories:
- audio-captioning
CoNeTTE (ConvNext-Transformer with Task Embedding) for Automated Audio Captioning
This model generate a short textual description of any audio file.
pip install conette
from conette import CoNeTTEConfig, CoNeTTEModel
config = CoNeTTEConfig.from_pretrained("Labbeti/conette")
model = CoNeTTEModel.from_pretrained("Labbeti/conette", config=config)
It was created by @Labbeti.