BigVGAN Collection BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. β’ 11 items β’ Updated about 1 month ago β’ 11
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Paper β’ 2407.04051 β’ Published Jul 4, 2024 β’ 36
Running on CPU Upgrade 109 109 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. β’ 14 items β’ Updated May 8, 2024 β’ 24
Salesforce/xgen-mm-phi3-mini-instruct-r-v1 Image-Text-to-Text β’ Updated 13 days ago β’ 1.08k β’ 186