chuuhtetnaing
/

whisper-small-myanmar

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

chuuhtetnaing commited on 27 days ago

Commit

904c06a

•

1 Parent(s): 679fa9b

Update README.md

Files changed (1) hide show

README.md +21 -11

README.md CHANGED Viewed

@@ -8,6 +8,12 @@ metrics:
 model-index:
 - name: whisper-small-myanmar
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,24 +21,28 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-small-myanmar
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.1904
 - Wer: 49.0650
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -88,4 +98,4 @@ The following hyperparameters were used during training:
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
-- Tokenizers 0.15.1

 model-index:
 - name: whisper-small-myanmar
   results: []
+datasets:
+- chuuhtetnaing/myanmar-speech-dataset-openslr-80
+language:
+- my
+pipeline_tag: automatic-speech-recognition
+library_name: transformers
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # whisper-small-myanmar
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the [chuuhtetnaing/myanmar-speech-dataset-openslr-80](https://huggingface.co/datasets/chuuhtetnaing/myanmar-speech-dataset-openslr-80) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.1904
 - Wer: 49.0650
+## Usage
+```python
+from datasets import Audio, load_dataset
+from transformers import pipeline
+# Load a sample audio
+dataset = load_dataset("chuuhtetnaing/myanmar-speech-dataset-openslr-80")
+dataset = dataset.cast_column("audio", Audio(sampling_rate=16000))
+test_dataset = dataset['test']
+input_speech = test_dataset[42]['audio']
+pipe = pipeline(model='chuuhtetnaing/whisper-small-myanmar')
+output = pipe(input_speech, generate_kwargs={"language": "myanmar", "task": "transcribe"})
+print(output['text']) # ကျမ ပြည်ပ မှာ ပညာသင် တော့ စာမေးပွဲ ကို တပတ်တခါ စစ်တယ်
+```
 ### Training hyperparameters
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
+- Tokenizers 0.15.1