Skip to content

Instantly share code, notes, and snippets.

@hmen97
Last active February 8, 2021 12:18
Show Gist options
  • Save hmen97/4f9ac5823dd323071f685e25d1794ec9 to your computer and use it in GitHub Desktop.
Save hmen97/4f9ac5823dd323071f685e25d1794ec9 to your computer and use it in GitHub Desktop.
0.6.0
./bin/lmplz --text ../../tts/vocabulary.txt --arpa words.arpa --o 3 --discount_fallback
./bin/build_binary -T -s -v words.arpa lm.binary
python3 generate_package.py --alphabet ../alphabet.txt --lm lm.binary --vocab librispeech-vocab-500k.txt --default_alpha 0.75 --default_beta 1.85 --package kenlm.scorer
python3 lm_optimizer.py --test_files bin/librispeech/librivox-test-clean.csv --checkpoint_dir deepspeech-0.7.0-checkpoint --n_hidden 2048
0.9.0
~/DeepSpeech/tensorflow/bazel-bin/native_client/generate_scorer_package --alphabet ~/DeepSpeech/data/alphabet.txt --lm ~/path/to/binaries/lm.binary --vocab ~/path/to/sentences.txt --default_alpha 0.75 --default_beta 1.85 --package ~/path/to/binaries/kenlm.scorer
python3.6 ~/DeepSpeech/data/lm/generate_lm.py --input_txt /path/to/sentences.txt --output_dir ~/destination/path/binaries/ --top_k 50000 --kenlm_bins /path/to/kenlm/build/bin/ --arpa_order 5 --max_arpa_memory "85%" --arpa_prune "0|0|1" --binary_a_bits 255 --binary_q_bits 8 --binary_type trie --discount_fallback
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment