Spaces:
Running
title: WER
emoji: 🖩
colorFrom: blue
colorTo: red
sdk: gradio
sdk_version: 3.19.1
app_file: app.py
pinned: false
tags:
- evaluate
- wer
- neuralspace
- STT
About this Demo
This demo was built as a part of NeuralSpace's VoiceAI blog on Word Error Rate 101: Your Guide to STT Vendor Evaluation.
What is WER?
WER or Word Error Rate is a metric used primarily in the field of speech recognition to measure the performance of an automatic speech recognition (ASR) system. WER calculates the minimum number of operations (substitutions, deletions, and insertions) required to change the system's transcription (prediction) into the reference transcription (truth), divided by the number of words in the reference.
WER = (Substitutions + Insertions + Deletions)/Total number of words in truth
WER can range from 0 to infinite. The closer the WER is to 0, the better. WER is often also represented as a percentage. It is usually calculated by just multiplying 100 to it. For example, a WER of 0.15 might also be represented as 15%.
WER is important because it provides:
- Performance Measure: It gives an objective measure of how well an ASR system is transcribing speech into text.
- Comparison: It allows for comparison between different ASR systems or versions of a system.
References
- Python package to calculate WER: jiwer
FAQ
Have any questions or comments? Reach out to NeuralSpace at [email protected]. Interested to try out NeuralSpace VoiceAI for your enterprise? Book time with our expert here.
Space authored by : Aditya Dalmia