Use OpenAudible.
-
Install Python 3
-
Clone the repo
git clone https://github.com/tomchang25/whisper-auto-transcribe.git cd whisper-auto-transcribe
-
Install dependency
pip install git+https://github.com/openai/whisper.git pip install gradio
-
Install ffmpeg
# on Ubuntu or Debian sudo apt update && sudo apt install ffmpeg # on MacOS using Homebrew (https://brew.sh/) brew install ffmpeg # on Windows using Chocolatey (https://chocolatey.org/) choco install ffmpeg # on Windows using Scoop (https://scoop.sh/) scoop install ffmpeg
-
Open application
python gui.py
Or, use the command line:
python ./cli.py "./7 powers.mp3" --output "./tmp/7 Powers.srt" --task transcribe --model large
3. iina, a player based upon MPV
- Installation:
brew install iina
or, download from iina Official Website.
- Create mpv configuration file:
mkdir ~/.config/mpv
cat <<EOT >> ~/.config/mpv/mpv.conf
[extension.mp3]
profile-desc="profile for audiobooks"
audio-display=external-first
sub-align-x=left
sub-margin-x=100
sub-pos=85
sub-font='SauceCodePro Nerd Font'
sub-font-size=50
save-position-on-quit=yes
sub-auto=all
cover-art-auto=all
stop-screensaver=always
# autofit=640
ontop=yes
# Adjust subtitle timing is to remove minor gaps or overlaps
# between subtitles (if the difference is smaller than 210 ms,
# the gap or overlap is removed).
sub-fix-timing=yes
EOT
-
iina → Settings → Advanced:
-
Enable advanced settings
-
Check
Use config directory
-
Navigate to next subtitle: Shift + Right
Navigate to previous subtitle: Shift + Left
Toggle Music Mode: Option + Command + m
Enlarge Window: Command + =
Shrink Window: Command + -