language: | |
- en | |
- zh | |
- de | |
- es | |
- ru | |
- ko | |
- fr | |
- ja | |
- pt | |
- tr | |
- pl | |
- ca | |
- nl | |
- ar | |
- sv | |
- it | |
- id | |
- hi | |
- fi | |
- vi | |
- he | |
- uk | |
- el | |
- ms | |
- cs | |
- ro | |
- da | |
- hu | |
- ta | |
- 'no' | |
- th | |
- ur | |
- hr | |
- bg | |
- lt | |
- la | |
- mi | |
- ml | |
- cy | |
- sk | |
- te | |
- fa | |
- lv | |
- bn | |
- sr | |
- az | |
- sl | |
- kn | |
- et | |
- mk | |
- br | |
- eu | |
- is | |
- hy | |
- ne | |
- mn | |
- bs | |
- kk | |
- sq | |
- sw | |
- gl | |
- mr | |
- pa | |
- si | |
- km | |
- sn | |
- yo | |
- so | |
- af | |
- oc | |
- ka | |
- be | |
- tg | |
- sd | |
- gu | |
- am | |
- yi | |
- lo | |
- uz | |
- fo | |
- ht | |
- ps | |
- tk | |
- nn | |
- mt | |
- sa | |
- lb | |
- my | |
- bo | |
- tl | |
- mg | |
- as | |
- tt | |
- haw | |
- ln | |
- ha | |
- ba | |
- jw | |
- su | |
- yue | |
tags: | |
- audio | |
- automatic-speech-recognition | |
license: mit | |
library_name: ctranslate2 | |
# Whisper large-v3 model for CTranslate2 | |
This repository contains the conversion of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) to the [CTranslate2](https://github.com/OpenNMT/CTranslate2) model format. | |
This model can be used in CTranslate2 or projects based on CTranslate2 such as [faster-whisper](https://github.com/systran/faster-whisper). | |
## Example | |
```python | |
from faster_whisper import WhisperModel | |
model = WhisperModel("large-v3") | |
segments, info = model.transcribe("audio.mp3") | |
for segment in segments: | |
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text)) | |
``` | |
## Conversion details | |
The original model was converted with the following command: | |
``` | |
ct2-transformers-converter --model openai/whisper-large-v3 --output_dir faster-whisper-large-v3 \ | |
--copy_files tokenizer.json preprocessor_config.json --quantization float16 | |
``` | |
Note that the model weights are saved in FP16. This type can be changed when the model is loaded using the [`compute_type` option in CTranslate2](https://opennmt.net/CTranslate2/quantization.html). | |
## More information | |
**For more information about the original model, see its [model card](https://huggingface.co/openai/whisper-large-v3).** | |