hmmscribe.tcl - Select pronunciation variants using Viterbi forced alignment.
hmmscribe.tcl modelname [options]
set config(scribe,dictionary) dict.db set config(scribe,input) transcript set config(scribe,output) alignment
These values will override the preset default values. All subsequent
command line options will override the values specified by the
configuration script. Command line parameters are specified using the
param variable.
set param(scribe,modelname) foo
Since command line parameters are typically not optional the user needs to specify the command line parameters as a single "-" character for the settings defined in the configuration script to take effect.
The configuration file is also used to specify the feature post-processing script.
set config(feature,script) user.tcl
If this variable is not defined in the configuration file, then only the base features as saved in the feature cache are used during training.
hmmscribe.tcl selects the word pronunciation variants as they appear in the word pronunciation dictionary.
The input word transcriptions are used to create a finite state grammar where each node or state in the grammar contains a word and its pronunciation variants. The standard Viterbi algorithm is then used to find the best possible path through the grammar, resulting in the selection of the pronunciation variants to fit the sentence.
The resulting output word transcriptions may then be used to generate the associated model transcriptions for HMM embedded training (see genmodel.tcl, hmmembed.tcl).
gendict.tcl, worddb.tcl, Master Label File
Johan Schalkwyk
Center for Spoken Language
Understanding
Oregon Graduate Institute of Science &
Technology