May 04, 2016 If you’re like me, you really want the original text more than a synopsis. It take s a bit of extra effort, but I have a nice solution that uses only a Mac and open source software. Read below for instructions on converting an MP3 audio file to a text document. The Basics of Configuring Your Mac to Transcribe.MP3 Audio. Here’s what you need. Apr 11, 2020 This simple and free speech to text converter can help you perfectly convert audio file to the text as long as you use the below steps. You can upload.mp3.mpeg.wav.flac, or.opus files on IBM Watson Speech to Text converter.
I listen to podcasts. I watch videos. I watch podcasts of different languages. But more than anything I read and write. I practice languages. That’s just how I roll. And sometimes, my ramblings bring me as far as understanding English meaning of some specific kikuyu translation texts.
Frequently I want to save an audio snippet or video clip for future reference. Sure I could save the source media file, if I had unlimited disk space. But what I usually do is keep a link to the original source and text synopsis of the snippet. That both saves on storage and makes future searches for that particular item simpler.
If you’re like me, you really want the original text more than a synopsis. It take s a bit of extra effort, but I have a nice solution that uses only a Mac and open source software. Read below for instructions on converting an MP3 audio file to a text document.
Here’s what you need:
Follow the instructions on the developer websites to get all of the software installed and working on your system. Once you have the software installed, the next step is to configure your Mac to use Soundflower for dictation.
Your Mac is ready for dictation. When dictation is turned on in TextEdit (or a another word processing app), your Mac will transcribe sound from the Soundflower input source.
Next, you need to queue up the audio file in Audacity and direct output to Soundflower. For those who are new to Audacity, this will be the trickiest step. But relax, you don’t need to learn much about Audacity beyond deciding what section of sound to play and how to select the audio output from the default speakers to Soundflower.
With Audacity and your sound file queued up, its time to turn your attention to TextEdit.
It’s time to start audio playback and dictation transcription. Here both sequence and timing are important:
This is the fun part: watch as transcription happens in real time right in the document window. Look Ma, no hands!
And now you have the original text (and most likely a few errors) as text to save. In the future you can easily search and retrieve the information.
While the solution above works great for offline work, one alternative with a lot of promise is Google Docs. The Voice Typing feature work much like the dictation service in Mac OS. It has the crowdsourcing advantages and privacy disadvantages of other Google products. If you’re OK with that, I found Voice Typing to do an very good job with accuracy and it can go longer that Mac OS dictation.
To use Google Voice Typing, follow all of the steps above with Soundflower, Dictation preferences and configuring Audacity. Instead of using TextEdit, you’ll want to start the Chrome browser and create a Google Doc. Once you are in document, Select Tools –> Voice typing
The user interface and process of starting and stopping transcription is the same as with TextEdit.
This process sets you well on you way to the goal of a high fidelity audio transcription. But it will be short of perfect. Here’s what you can do to go from good to perfect: