How to Transcribe Audio Files on Mac
You have an audio file — a recorded interview, a voice memo, a lecture recording, a podcast episode — and you need it in text. Traditional transcription services charge $1-3 per minute and take hours to deliver. Cloud-based tools require you to upload sensitive recordings to external servers. Here is how to transcribe audio files on your Mac entirely locally, in real time.
Step-by-Step Guide
Prepare your audio file
Make sure your audio file is accessible on your Mac. Common formats like MP3, WAV, M4A, AAC, and AIFF all work fine. If your file is in an unusual format, convert it first using a free tool like FFmpeg or the built-in macOS audio conversion in Finder (right-click > Encode Selected Audio Files).
Open the audio file in a media player
Open the file in any media player — QuickTime Player, VLC, IINA, or even just double-clicking to open in Music. The audio needs to play through your Mac's audio output for system audio capture to work.
Set up system audio capture
Open Glasscribe from the menu bar and select "System Audio" as the input source. This captures whatever audio is playing on your Mac. Set the transcription language to match the language spoken in the recording.
Play the audio and transcribe
Start the transcription in Glasscribe, then press play in your media player. The transcript appears in real time as the audio plays. For long recordings, you can pause both the player and transcription, then resume together.
Export the transcript
When the audio file finishes, stop the transcription. Export as .txt for a clean text document, or as .srt if you need timestamps (useful for syncing with video or referencing specific moments in the recording).
Pro Tips
Frequently Asked Questions
Can I transcribe audio files faster than real time?
The transcription happens at the speed the audio plays. You can increase playback speed in your media player (1.5x or 2x) to transcribe faster. Most speech engines handle moderately faster playback well, though very high speeds may reduce accuracy.
Do I need to upload my audio file anywhere?
No. The entire process happens on your Mac. The audio plays locally, and the speech-to-text processing runs on-device. Your recording never leaves your computer, which is important for confidential recordings like legal depositions, medical dictations, or private interviews.
What audio formats are supported?
Any audio format that your media player can play will work, since the transcription captures the system audio output. This includes MP3, WAV, M4A, AAC, AIFF, FLAC, OGG, and more. If your player can play it, you can transcribe it.