10/2/2018
Use Kaldi on WSJ data to train and decode with traditional HMM-GMM monophone model, then a triphone HMM-GMM model, and then a simple HMM-DNN model.
- GMM & E-M Algorithm Gaussian mixture models and the EM algorithm: https://people.csail.mit.edu/rameshvs/content/gmm-em.pdf
- Jurafsky & Martin (Chapter 6, 7, and 9): http://stp.lingfil.uu.se/~santinim/ml/2014/JurafskyMartinSpeechAndLanguageProcessing2ed_draft%202007.pdf
- HMM-GMM Aoustic Models for Speech Reognition: http://www1.icsi.berkeley.edu/~arlo/publications/faria_cs281a_proj.pdf