Skip to content

Instantly share code, notes, and snippets.

Disfluencies

This document contains list of conversations pause fillers which have been removed from the different stages of processing. For the text obtained from speech transcripts (via ASR) only disfluencies from ASR lexicon are filtered. The text retrieved from human transcripts disfluencies relevant transcripts are filtered.

Disfluency in human transcript ASR Lexicon
huh yes yes

Keybase proof

I hereby claim:

  • I am balakkvj on github.
  • I am balakkvj (https://keybase.io/balakkvj) on keybase.
  • I have a public key ASBMtaB-XnKwAxdTlxjN0w-ZEfUma91YTJ6-3L46sk1Mzwo

To claim this, I am signing this object: