Question 1

Is my audio uploaded to a server?

Accepted Answer

No. The Whisper model downloads once to your browser and processes everything locally. Your files never leave your device at any point.

Question 2

Is it actually free? What's the catch?

Accepted Answer

It's free because your own device does the computing, not our servers. We have zero compute costs to pass on to you. No minute limits, no file limits.

Question 3

Does it work with any YouTube video?

Accepted Answer

It works with videos that have captions available, which is most of them. If a video has no captions, download the audio and drag it here. Whisper will transcribe it.

Question 4

How long does transcription take?

Accepted Answer

Depends on your hardware. With a compatible GPU (WebGPU in Chrome or Edge), a 5-minute audio takes around 15–30 seconds. Without GPU, expect 1–3 minutes. The first run takes longer because it downloads the model.

Question 5

What audio formats are supported?

Accepted Answer

mp3, wav, m4a, ogg, and webm. Video formats like mp4 also work in most modern browsers.

Question 6

What languages can it transcribe?

Accepted Answer

Whisper is multilingual: English, Spanish, French, German, Italian, Portuguese, Japanese, Chinese, Arabic, and many more. You can force a language or let it auto-detect.

Question 7

Which browser do I need?

Accepted Answer

Any modern browser works. For top speed with WebGPU you need Chrome 113+ or Edge 113+. Firefox and Safari run it on CPU, a bit slower but just as accurate.

Question 8

Why is the first run slower?

Accepted Answer

The first time, it downloads the Whisper model (between 75 MB and 480 MB depending on the tier). It gets cached in your browser after that, so subsequent runs start instantly.

Question 9

How accurate is the transcription?

Accepted Answer

It depends on the model. whisper-small (480 MB) delivers very high accuracy for major languages. whisper-tiny is faster but makes more mistakes with accents or background noise. For meetings with decent audio quality, all three models produce very usable results.

Question 10

Does it work on mobile?

Accepted Answer

Yes, but it's slower. Mobile devices don't have WebGPU, so Whisper runs on the CPU. A 5-minute audio can take 3–5 minutes on a phone. On a laptop or desktop the experience is much better.

Question 11

Is there an audio length limit?

Accepted Answer

There's no imposed limit. The only constraint is your device's RAM. Audio files up to 2–3 hours work without issues on devices with 8 GB of RAM or more.

Question 12

Is my data safe? Is it GDPR compliant?

Accepted Answer

Your audio never leaves your device, so there's no personal data for us to protect on our end. We don't use tracking cookies or collect personal information. It's about as GDPR-friendly as a tool can get.

	OpenTranscript	Typical services
Cost	Free, always	$0.006 – $0.05 / minute
Privacy	Audio never leaves your device	Your audio is uploaded to their servers
Sign-up	Not required	Mandatory
Minute limit	No limit	Limited on the free plan
Speed	Depends on your hardware	Dedicated GPU servers
Maximum accuracy	whisper-small (very good)	whisper-large (excellent)

Transcribe YouTube and audio,
free and without uploading anything

Why OpenTranscript

Your audio goes nowhere

YouTube: paste the link and you're done

Actually free, no tricks

Adapts to your hardware

What you can use it for

Transcribe podcasts

Transcribe meetings

Transcribe lectures and talks

Get text from YouTube videos

Transcribe interviews

Accessibility

How it works

Paste the link or drop your audio

We process the text

Copy or download

OpenTranscript vs. other services

Compare Whisper models

whisper-tiny

whisper-base

whisper-small

Languages Whisper can transcribe

Frequently asked questions

Transcribe now

Transcribe YouTube and audio,free and without uploading anything