fasiha/audio2text.md

## audio2text.md

      
    Raw
  

              audio2text.md
            
          
    Install homebrew

Via https://brew.sh/
Install CMU Sphinx

Per https://github.com/watsonbox/homebrew-cmu-sphinx,
$ brew tap watsonbox/cmu-sphinx
$ brew install --HEAD watsonbox/cmu-sphinx/cmu-sphinxbase
$ brew install --HEAD watsonbox/cmu-sphinx/cmu-pocketsphinx

Install ffmpeg

$ brew install ffmpeg

Prepare the audio using ffmpeg:

$ ffmpeg -i INPUT_FILE -ac 1 -ar 16000 output.wav


-ac 1 means mono
-ar 16000 means 16 KHz sample rate

Run

$ pocketsphinx_continuous -infile output.wav > output.txt

A lot of output will be generated but the text will go to output.txt. You can monitor its progress in another shell:
$ tail -f output.txt