README.md · neuralspace/wer_calculator at e8ec2723556bf7ee34f1700df9811a40382344b2

metadata

title: WER
emoji: 🖩
colorFrom: blue
colorTo: red
sdk: gradio
sdk_version: 3.19.1
app_file: app.py
pinned: false
tags:
  - evaluate
  - wer
  - neuralspace
  - STT

About this Demo

This demo was built as a part of NeuralSpace's VoiceAI blog on Word Error Rate 101: Your Guide to STT Vendor Evaluation.

What is WER?

WER or Word Error Rate is a metric used primarily in the field of speech recognition to measure the performance of an automatic speech recognition (ASR) system. WER calculates the minimum number of operations (substitutions, deletions, and insertions) required to change the system's transcription (prediction) into the reference transcription (truth), divided by the number of words in the reference.

WER = (Substitutions + Insertions + Deletions)/Total number of words in truth

WER can range from 0 to infinite. The closer the WER is to 0, the better. WER is often also represented as a percentage. It is usually calculated by just multiplying 100 to it. For example, a WER of 0.15 might also be represented as 15%.

WER is important because it provides:

Performance Measure: It gives an objective measure of how well an ASR system is transcribing speech into text.
Comparison: It allows for comparison between different ASR systems or versions of a system.

References

Python package to calculate WER: jiwer

FAQ

Have any questions or comments? Reach out to NeuralSpace at [email protected]. Interested to try out NeuralSpace VoiceAI for your enterprise? Book time with our expert here.

Space authored by : Aditya Dalmia