This document contains list of conversations pause fillers which have been removed from the different stages of processing. For the text obtained from speech transcripts (via ASR) only disfluencies from ASR lexicon are filtered. The text retrieved from human transcripts disfluencies relevant transcripts are filtered.
Disfluency | in human transcript | ASR Lexicon |
---|---|---|
huh | yes | yes |