April 2026 data · Public methodology

Spanish transcription benchmark

Why does AudioMap use AssemblyAI and not Whisper? Because every percentage point of Word Error Rate (WER) shows in the final result. Here is the public data by engine and regional variant.

ASR Engine	es-ES	es-MX	es-AR	es-CO	es-CL
AssemblyAI Universal-2AudioMap	7.4%	8.2%	9.6%	8.9%	10.7%
Deepgram Nova-2	8.1%	8.9%	11.2%	—	—
Google Cloud STT-v2	10.3%	—	—	—	—
OpenAI Whisper-large-v3	14.7%	16.2%	18.3%	15.4%	19.1%

WER (Word Error Rate): percentage of incorrect words out of total. Lower = better. 0% would be perfect. For professional use (legal, medical, journalism) WER <10% is considered acceptable; ideal <7%.

AssemblyAI wins in all variants

Universal-2 maintains WER <11% across all 5 tested Hispanic variants. Whisper-large-v3 ranges from 14.7% to 19.1%. A 7-10 point difference in regions with yeísmo or high speech rate (Rioplatense, Chilean).

Whisper is reasonable only in neutral Spanish

Whisper performs acceptably in standard es-ES but loses quality in regional variants. TurboScribe, Fireflies (non-English) and many low-cost alternatives use it — you are paying for quality that is not the best available.

Coming: AudioMap own benchmark

Q3 2026 we will publish a benchmark with our own curated audio (10h per regional variant, professional domain, human evaluation). We promise transparency: open dataset and methodology.

Try AudioMap free

120 minutes per month included. No credit card.