fawzanaramam's picture
Update README.md
56c4444 verified
metadata
language:
  - ar
license: apache-2.0
base_model: openai/whisper-small
tags:
  - fine-tuned
  - Quran
  - automatic-speech-recognition
  - arabic
  - whisper
datasets:
  - fawzanaramam/the-truth-1st-chapter
metrics:
  - wer
model-index:
  - name: Whisper Small Finetuned on Surah Fatiha
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: The Truth 2.0 - Surah Fatiha
          type: fawzanaramam/the-truth-1st-chapter
          args: 'config: ar, split: train'
        metrics:
          - name: Word Error Rate (WER)
            type: wer
            value: 0

Whisper Small Finetuned on Surah Fatiha

This model is a fine-tuned version of openai/whisper-small, transcribing Surah Fatiha, the first chapter of the Quran. It has been trained using The Truth 2.0 - Surah Fatiha dataset and achieves excellent results with a Word Error Rate (WER) of 0.0, indicating perfect transcription on the evaluation set.

Model Description

Whisper Small is a transformer-based automatic speech recognition (ASR) model developed by OpenAI. By fine-tuning it on the Surah Fatiha dataset, this model becomes highly accurate in transcribing Quranic recitation. It is designed to assist in religious, educational, and research-oriented tasks that require precise Quranic transcription.

Performance Metrics

On the evaluation set, the model achieved:

  • Loss: 0.0088
  • Word Error Rate (WER): 0.0

These metrics showcase the model's exceptional performance and reliability in transcribing Surah Fatiha audio.

Training Results

The following table summarizes the training process and results:

Training Loss Epoch Step Validation Loss WER
No log 0.5556 10 1.1057 96.2766
No log 1.1111 20 0.3582 29.7872
0.6771 1.6667 30 0.1882 23.4043
0.6771 2.2222 40 0.0928 25.0
0.0289 2.7778 50 0.0660 34.0426
0.0289 3.3333 60 0.0484 32.9787
0.0289 3.8889 70 0.0241 25.5319
0.0056 4.4444 80 0.0184 28.7234
0.0056 5.0 90 0.0111 0.0
0.0019 5.5556 100 0.0088 0.0

Intended Uses & Limitations

Intended Uses

  • Speech-to-text transcription of Quranic recitation for Surah Fatiha.
  • Educational tools to assist in learning and practicing Quranic recitation.
  • Research and analysis of Quranic audio transcription methods.

Limitations

  • This model is fine-tuned specifically for Surah Fatiha and may not generalize well to other chapters or non-Quranic Arabic audio.
  • Variability in audio quality, accents, or recitation styles might affect performance.
  • Optimal performance is achieved with high-quality audio inputs.

Training and Evaluation Data

The model was trained on The Truth 2.0 - Surah Fatiha dataset, which comprises high-quality audio recordings of Surah Fatiha and their corresponding transcripts. The dataset was meticulously curated to ensure the accuracy and authenticity of Quranic content.

Training Procedure

Training Hyperparameters

The following hyperparameters were used during training:

  • Learning Rate: 1e-05
  • Training Batch Size: 16
  • Evaluation Batch Size: 8
  • Seed: 42
  • Optimizer: Adam (betas=(0.9, 0.999), epsilon=1e-08)
  • Learning Rate Scheduler: Linear
  • Warmup Steps: 10
  • Training Steps: 100
  • Mixed Precision Training: Native AMP

Framework Versions

  • Transformers: 4.41.1
  • PyTorch: 2.2.1+cu121
  • Datasets: 2.19.1
  • Tokenizers: 0.19.1