Skip to content

Instantly share code, notes, and snippets.

@jeff1evesque
Created July 11, 2014 15:22
Show Gist options
  • Save jeff1evesque/723d130235e5d68921b9 to your computer and use it in GitHub Desktop.
Save jeff1evesque/723d130235e5d68921b9 to your computer and use it in GitHub Desktop.
Audio Analyzer: `sudo reboot`, audio scripts (_rate/_text) use inotifywait `-m -e close_write`
+ shopt -s extglob
+ haystack=(./@(audio|git|build)/*)
+ true ./audio/converter_wav_rate ./audio/converter_wav_text ./build/grunt ./git/local_ignore_rules
+ for file in '"${haystack[@]}"'
+ for file in '"${haystack[@]}"'
+ ././audio/converter_wav_rate
+ for file in '"${haystack[@]}"'
+ for file in '"${haystack[@]}"'
+ ././git/local_ignore_rules
+ ././audio/converter_wav_text
+ ././build/grunt
Setting up watches.
Watches established.
Setting up watches.
Watches established.
#[4mRunning "sass:dist" (sass) task#[24m
File ../assets/css/main.min.css created.
#[4mRunning "watch" task#[24m
Waiting...
FFmpeg version 0.6.6-4:0.6.6-0ubuntu0.11.04.1, Copyright (c) 2000-2010 the Libav developers
built on Jun 12 2012 16:35:16 with gcc 4.5.2
configuration: --extra-version=4:0.6.6-0ubuntu0.11.04.1 --prefix=/usr --enable-avfilter --enable-avfilter-lavf --enable-vdpau --enable-bzlib --enable-libgsm --enable-libschroedinger --enable-libspeex --enable-libtheora --enable-libvorbis --enable-pthreads --enable-zlib --enable-libvpx --disable-stripping --enable-runtime-cpudetect --enable-vaapi --enable-gpl --enable-postproc --enable-swscale --enable-x11grab --enable-libdc1394 --enable-shared --disable-static
libavutil 50.15. 1 / 50.15. 1
libavcodec 52.72. 2 / 52.72. 2
libavformat 52.64. 2 / 52.64. 2
libavdevice 52. 2. 0 / 52. 2. 0
libavfilter 1.19. 0 / 1.19. 0
libswscale 0.11. 0 / 0.11. 0
libpostproc 51. 2. 0 / 51. 2. 0
[wav @ 0x25fb420]max_analyze_duration reached
[wav @ 0x25fb420]Estimating duration from bitrate, this may be inaccurate
Input #0, wav, from '../audio/recording/sample.wav':
Duration: 00:00:11.49, bitrate: 255 kb/s
Stream #0.0: Audio: pcm_s16le, 16000 Hz, 1 channels, s16, 256 kb/s
Output #0, wav, to '../audio/recording_converted/sample.wav':
Metadata:
encoder : Lavf52.64.2
Stream #0.0: Audio: pcm_s16le, 16000 Hz, 1 channels, s16, 256 kb/s
Stream mapping:
Stream #0.0 -> #0.0
Press [q] to stop encoding
size= 359kB time=11.50 bitrate= 256.0kbits/s
video:0kB audio:359kB global headers:0kB muxing overhead 0.011960%
INFO: cmd_ln.c(696): Parsing command line:
pocketsphinx_continuous \
-infile ../audio/recording_converted/sample.wav \
-hmm en-us \
-lm cmusphinx-5.0-en-us.lm.dmp
Current configuration:
[NAME] [DEFLT] [VALUE]
-adcdev
-agc none none
-agcthresh 2.0 2.000000e+00
-allphone
-allphone_ci no no
-alpha 0.97 9.700000e-01
-argfile
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm en-us
-infile ../audio/recording_converted/sample.wav
-inmic no no
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-keyphrase
-kws
-kws_plp 1e-1 1.000000e-01
-kws_threshold 1 1.000000e+00
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm cmusphinx-5.0-en-us.lm.dmp
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf 10000 10000
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-remove_noise yes yes
-remove_silence yes yes
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-time no no
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-vad_postspeech 50 50
-vad_prespeech 10 10
-vad_threshold 2.0 2.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
INFO: cmd_ln.c(696): Parsing command line:
\
-nfilt 25 \
-lowerf 130 \
-upperf 6800 \
-transform dct \
-lifter 22 \
-feat 1s_c_d_dd \
-agc none \
-cmn current \
-varnorm no \
-cmninit 40
Current configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 40
-dither no no
-doublebw no no
-feat 1s_c_d_dd 1s_c_d_dd
-frate 100 100
-input_endian little little
-lda /usr/local/share/pocketsphinx/model/hmm/en-us/feature_transform
-ldadim 0 0
-lifter 0 22
-logspec no no
-lowerf 133.33334 1.300000e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 25
-remove_dc no no
-remove_noise yes yes
-remove_silence yes yes
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy dct
-unit_area yes yes
-upperf 6855.4976 6.800000e+03
-vad_postspeech 50 50
-vad_prespeech 10 10
-vad_threshold 2.0 2.000000e+00
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02
INFO: acmod.c(251): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/hmm/en-us/feat.params
INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(160): Reading linear feature transformation from /usr/local/share/pocketsphinx/model/hmm/en-us/feature_transform
INFO: mdef.c(517): Reading model definition: /usr/local/share/pocketsphinx/model/hmm/en-us/mdef
INFO: bin_mdef.c(181): Allocating 173954 * 8 bytes (1359 KiB) for CD tree
INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/hmm/en-us/transition_matrices
INFO: acmod.c(123): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/means
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/variances
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(354): 768 variance values floored
INFO: acmod.c(125): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/means
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/variances
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(354): 768 variance values floored
INFO: ptm_mgau.c(792): Number of codebooks exceeds 256: 6138
INFO: acmod.c(127): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/means
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/hmm/en-us/variances
INFO: ms_gauden.c(292): 6138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x32
INFO: ms_gauden.c(354): 768 variance values floored
INFO: ms_senone.c(149): Reading senone mixture weights: /usr/local/share/pocketsphinx/model/hmm/en-us/mixture_weights
INFO: ms_senone.c(200): Truncating senone logs3(pdf) values by 10 bits
INFO: ms_senone.c(207): Not transposing mixture weights in memory
INFO: ms_senone.c(268): Read mixture weights for 6138 senones: 1 features x 32 codewords
INFO: ms_senone.c(320): Mapping senones to individual codebooks
INFO: ms_mgau.c(141): The value of topn: 4
INFO: dict.c(320): Allocating 137541 * 32 bytes (4298 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: /usr/local/share/pocketsphinx/model/lm/en_US/cmu07a.dic
INFO: dict.c(213): Allocated 1010 KiB for strings, 1664 KiB for phones
INFO: dict.c(336): 133436 words read
INFO: dict.c(342): Reading filler dictionary: /usr/local/share/pocketsphinx/model/hmm/en-us/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(345): 9 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 46^3 * 2 bytes (190 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 51152 bytes (49 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 51152 bytes (49 KiB) for single-phone word triphones
INFO: ngram_model_arpa.c(77): No \data\ mark in LM file
INFO: ngram_model_dmp.c(142): Will use memory-mapped I/O for LM file
INFO: ngram_model_dmp.c(196): ngrams 1=19794, 2=1377200, 3=3178194
INFO: ngram_model_dmp.c(242): 19794 = LM.unigrams(+trailer) read
INFO: ngram_model_dmp.c(288): 1377200 = LM.bigrams(+trailer) read
INFO: ngram_model_dmp.c(314): 3178194 = LM.trigrams read
INFO: ngram_model_dmp.c(339): 57155 = LM.prob2 entries read
INFO: ngram_model_dmp.c(359): 10935 = LM.bo_wt2 entries read
INFO: ngram_model_dmp.c(379): 34843 = LM.prob3 entries read
INFO: ngram_model_dmp.c(407): 2690 = LM.tseg_base entries read
INFO: ngram_model_dmp.c(463): 19794 = ascii word strings read
INFO: ngram_search_fwdtree.c(99): 788 unique initial diphones
INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 58 single-phone words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 58 single-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 45060
INFO: ngram_search_fwdtree.c(339): after: 572 root, 44932 non-root channels, 50 single-phone words
INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
INFO: continuous.c(280): pocketsphinx_continuous COMPILED ON: Jun 25 2014, AT: 10:27:43
INFO: ngram_search.c(466): Resized score stack to 200000 entries
INFO: ngram_search.c(458): Resized backpointer table to 10000 entries
INFO: ngram_search.c(466): Resized score stack to 400000 entries
INFO: ngram_search.c(458): Resized backpointer table to 20000 entries
INFO: ngram_search.c(466): Resized score stack to 800000 entries
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 43.65 -3.85 2.18 11.03 7.06 0.67 7.95 8.83 -2.52 4.97 -7.04 -6.91 1.38 >
INFO: ngram_search_fwdtree.c(1550): 16346 words recognized (37/fr)
INFO: ngram_search_fwdtree.c(1552): 1488022 senones evaluated (3329/fr)
INFO: ngram_search_fwdtree.c(1556): 3448051 channels searched (7713/fr), 213631 1st, 675250 last
INFO: ngram_search_fwdtree.c(1559): 33640 words for which last channels evaluated (75/fr)
INFO: ngram_search_fwdtree.c(1561): 314582 candidate words for entering last phone (703/fr)
INFO: ngram_search_fwdtree.c(1564): fwdtree 11.11 CPU 2.485 xRT
INFO: ngram_search_fwdtree.c(1567):
fwdtree 17.01 wall 3.806 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 909 words
INFO: ngram_search_fwdflat.c(938): 8390 words recognized (19/fr)
INFO: ngram_search_fwdflat.c(940): 578766 senones evaluated (1295/fr)
INFO: ngram_search_fwdflat.c(942): 1219292 channels searched (2727/fr)
INFO: ngram_search_fwdflat.c(944): 74002 words searched (165/fr)
INFO: ngram_search_fwdflat.c(947): 55424 word transitions (123/fr)
INFO: ngram_search_fwdflat.c(950): fwdflat 4.41 CPU 0.987 xRT
INFO: ngram_search_fwdflat.c(953): fwdflat 7.20 wall 1.612 xRT
INFO: ngram_search.c(1252): lattice start node <s>.0 end node </s>.444
INFO: ngram_search.c(1278): Eliminated 2 nodes before end node
INFO: ngram_search.c(1383): Lattice has 1345 nodes, 11615 links
INFO: ps_lattice.c(1368): Normalizer P(O) = alpha(</s>:444:445) = -1275739
INFO: ps_lattice.c(1403): Joint P(O,S) = -1332077 P(S|O) = -56338
INFO: ngram_search.c(874): bestpath 0.26 CPU 0.058 xRT
INFO: ngram_search.c(877): bestpath 0.41 wall 0.093 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 43.65 -3.85 2.18 11.03 7.06 0.67 7.95 8.83 -2.52 4.97 -7.04 -6.91 1.38 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 43.28 -1.41 0.48 11.08 3.92 -0.41 10.37 6.40 -1.88 6.94 -4.56 -5.46 2.86 >
INFO: cmn_prior.c(99): cmn_prior_update: from < 43.28 -1.41 0.48 11.08 3.92 -0.41 10.37 6.40 -1.88 6.94 -4.56 -5.46 2.86 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 42.95 -2.65 -0.63 14.10 5.23 -0.42 10.45 5.86 0.25 6.33 -5.96 -5.84 1.81 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 42.95 -2.65 -0.63 14.10 5.23 -0.42 10.45 5.86 0.25 6.33 -5.96 -5.84 1.81 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 42.78 -3.29 -0.36 12.69 5.05 -0.33 10.46 5.91 0.15 5.99 -5.86 -5.57 1.81 >
INFO: ngram_search_fwdtree.c(1550): 17142 words recognized (25/fr)
INFO: ngram_search_fwdtree.c(1552): 2253052 senones evaluated (3270/fr)
INFO: ngram_search_fwdtree.c(1556): 4863288 channels searched (7058/fr), 333578 1st, 695122 last
INFO: ngram_search_fwdtree.c(1559): 40884 words for which last channels evaluated (59/fr)
INFO: ngram_search_fwdtree.c(1561): 415408 candidate words for entering last phone (602/fr)
INFO: ngram_search_fwdtree.c(1564): fwdtree 17.46 CPU 2.534 xRT
INFO: ngram_search_fwdtree.c(1567): fwdtree 25.52 wall 3.704 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 675 words
INFO: ngram_search_fwdflat.c(938): 8566 words recognized (12/fr)
INFO: ngram_search_fwdflat.c(940): 654135 senones evaluated (949/fr)
INFO: ngram_search_fwdflat.c(942): 1026525 channels searched (1489/fr)
INFO: ngram_search_fwdflat.c(944): 61449 words searched (89/fr)
INFO: ngram_search_fwdflat.c(947): 47734 word transitions (69/fr)
INFO: ngram_search_fwdflat.c(950): fwdflat 5.06 CPU 0.734 xRT
INFO: ngram_search_fwdflat.c(953): fwdflat 6.69 wall 0.971 xRT
INFO: ngram_search.c(1199): </s> not found in last frame, using tax.687 instead
INFO: ngram_search.c(1252): lattice start node <s>.0 end node tax.637
INFO: ngram_search.c(1278): Eliminated 123 nodes before end node
INFO: ngram_search.c(1383): Lattice has 1838 nodes, 4858 links
INFO: ps_lattice.c(1368): Normalizer P(O) = alpha(tax:637:687) = -1726098
INFO: ps_lattice.c(1403): Joint P(O,S) = -1765851 P(S|O) = -39753
INFO: ngram_search.c(874): bestpath 0.04 CPU 0.006 xRT
INFO: ngram_search.c(877): bestpath 0.06 wall 0.009 xRT
INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 28.57 CPU 2.519 xRT
INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 42.54 wall 3.751 xRT
INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 9.47 CPU 0.835 xRT
INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 13.89 wall 1.225 xRT
INFO: ngram_search.c(303): TOTAL bestpath 0.30 CPU 0.026 xRT
INFO: ngram_search.c(306): TOTAL bestpath 0.47 wall 0.042 xRT
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment