Question 1

Does transcription happen on my device?

Accepted Answer

Yes. Everything runs in your browser. Your audio files are never sent to any server.

Question 2

How accurate is the transcription?

Accepted Answer

Using the Whisper small model, accuracy is typically 90–95% for clear speech in supported languages. Accuracy decreases with heavy accents or background noise.

Question 3

How long does the first load take?

Accepted Answer

The Whisper small model is approximately 150 MB. Download time depends on your connection speed. It is cached after the first download.

Question 4

Which languages are supported?

Accepted Answer

All 99 languages supported by the Whisper model, including English, Japanese, Spanish, French, German, Chinese, Arabic, and many more.

Question 5

What is the maximum file size?

Accepted Answer

Files up to 500 MB are supported. For very large files, processing may take longer depending on your device.

Transcripción de Audio

What is Transcripción de Audio?

Key Features

Powered by OpenAI Whisper (tiny/base/small models in WebAssembly)

Supports 99 languages with automatic language detection

Word-level timestamps for precise navigation

Export to SRT, VTT, and plain TXT formats

Speaker diarization (beta)

Real-time transcription for microphone input

Edit and correct transcript in-browser

Upload MP3, WAV, M4A, FLAC, OGG, WebM up to 500 MB

How It Works

Load Model

Upload Audio

Transcription

Review & Export

Who Is This For?

Why Use Transcripción de Audio?

Technical Details

Frequently Asked Questions

Related Tools