Convert MP4 to Text

Zoom calls, screen recordings, phone videos — drop the MP4 and get a timestamped transcript or ready-to-use subtitles.

No sign-up No watermark TXT · SRT · VTT exports Files auto-delete in 24h
Drag & drop your file here
or browse your files
Free: {0} files a day · up to {1} min & {2} MB each

Bigger files or more uploads? Free account: 5 files/day, 1-hour files · Pro: 10-hour files + speaker labels

0%
Uploading…
Keep this tab open — you'll be redirected to your transcript.
Zoom/TeamsScreen recordingsiPhone videoLectures

The meeting-recording workhorse

MP4 is what Zoom, Teams, Meet, OBS, and every phone camera hand you. Only the audio track inside the MP4 is read — the video stream is ignored entirely, so a 4K hour-long recording transcribes exactly as fast as a 480p one of the same length. Resolution is irrelevant; duration is what counts against your limit.

That also means the fastest workflow for very large MP4s is extracting the audio first: the transcript is identical and the upload is a fraction of the size.

Subtitles or minutes from the same file

For published video, export SRT (YouTube, Premiere, Resolve, CapCut) or VTT (HTML5 players, Vimeo). For meetings and lectures, export TXT and you have searchable minutes with the filler stripped out by your own edits. Speaker labels for multi-person calls are available on Pro.

Frequently asked questions

Do Zoom, Teams, and screen recordings work?
Yes — meeting-app MP4s are the most common upload on this page. Local and cloud Zoom recordings, Teams downloads, and OBS/QuickTime screen captures all carry standard AAC audio that transcribes well. For multi-speaker calls, Pro's speaker labels tag who said what.
My MP4 has multiple audio tracks — which is used?
The first audio track in the container. Some screen recorders (OBS in particular) can save microphone and system audio as separate tracks — if you need both sides transcribed, export/re-record with tracks merged, or extract and upload the mixed track.
Are iPhone HEVC videos (.mp4/.mov) supported?
Yes. iPhone footage — whether H.264 or HEVC video, in .mp4 or .mov — carries AAC audio, and that's all we read. No need to convert the video codec; the audio track is identical either way.
Should I export SRT or TXT from my MP4?
Depends where the text goes. SRT/VTT keep the timing and drop straight onto the video as subtitles or captions. TXT drops the timing and gives you clean prose for notes, documentation, or quoting. Both exports come from the same transcript, so you can grab both.
Why does duration matter more than file size?
Because only the audio is processed. A 3 GB 4K video with 10 minutes of speech is a 10-minute job; a 40 MB voice-only MP4 that runs 2 hours is a 2-hour job. The per-file caps (30 min anonymous, 1 h free account, 10 h paid) are all about duration, with generous size ceilings alongside.
Can I upload 4K files? Is there a size limit?
Yes — 4K is fine and changes nothing about accuracy or speed. Size ceilings: 100 MB anonymous, 500 MB free account, 5 GB Pro/packs. If your 4K file exceeds the ceiling for your tier, extract the audio (tiny by comparison) and upload that instead.