Parameter
Name
|
Type
|
Description
|
allowWordCutoff
|
integer
|
If
this value is 1, then the waveform may begin or end with a partial word
(a word cutoff) anywhere in the grammar. If this value is 0, then
the waveform must begin and end as specified in the grammar. The
default value is 0.
|
beamRatio
|
float
|
If, at
any point in time, the number of states that survive the beam divided
by the
total number of states in the network is less than beamRatio,
then
only those states that survive the beam are evaluated at the next time
frame.
If the ratio is greater than or equal to beamRatio, then
all
states are evaluated at the next time frame (the beam is effectively
set
to all possible states). This provides a simple method of making
the
beam width maximum (by setting beamRatio to 0.0), and in other
cases
the value of beamRatio affects processing speed. The
default value is 0.0, although a value of 0.5 should yield reasonable
results.
|
beamThreshold
|
float
|
If the
accumulated probability score of a state at a point in time is less
than the
maximum accumulated probability score at that time minus beamThreshold,
then the state is removed from the beam search. If the state
score minus
the maximum score is greater than or equal to beamThreshold,
then
the state remains active at the next time frame. The default
value is
-500.0.
|
beamThresholdAdjust
|
float
|
If the
number of states active at any point in time is greater than targetBeamSize
+ targetBeamVariance and targetBeamSize is greater than
zero, then beamThreshold is decreased by beamThresholdAdjust.
Or, if the number of
states active at any point in time is less than targetBeamSize
- targetBeamVariance and targetBeamSize is greater than
zero, then beamThreshold is increased by beamThresholdAdjust.
The default
value of beamThresholdAdjust is -5.0.
|
blockSize
|
integer
|
Memory
is allocated in chunks of blockSize units (where a single unit
may be an integer or an entire structure). A smaller value of blockSize
(with a lower limit of 1) reduces total memory consumption by
only allocating
space that will probably be used, but causes an increase in processing
time
by repeated calls to malloc() and realloc(). The default value is
500.
|
durationLongPenalty
|
float
|
If the
"minmax" duration model is employed, then a state's score is
penalized by durationLongPenalty for every frame that the state
is longer than its maximum allowed duration. If the "minmax"
duration model is not employed, this parameter has no effect. The
default value is -1.0; this value is in the log domain, and corresponds
to a probability value of 0.368.
|
durationModel
|
string
|
This
string specifies the duration model used in the Viterbi search.
Allowable values are "exponential", "gamma", and "minmax",
although only the "minmax" duration model is currently
implemented. An exponential model corresponds to the model
implicit in HMMs through the use of state transition probabilities.
The gamma model models state duration using a gamma distribution
function. The minmax model applies penalties to a state if its
duration is longer or shorter than specified values.
|
durationShortPenalty
|
float
|
If the
"minmax" duration model is employed, then a state's score is
penalized by durationShortPenalty for every frame that the
state is shorter than
its minimum allowed duration. If the "minmax" duration
model is not employed, this parameter has no effect. The default
value is -6.5; this value is in the log domain, and corresponds to a
probability value of 0.0015.
|
maxBeamThreshold
|
float
|
The
beam threshold value beamThreshold is guaranteed to never be
greater (closer to zero) than maxBeamThreshold. The
default value is -50.0. This
prevents the beam from becoming too restrictive.
|
observationProbFloor
|
float
|
If a
state observation probability is less than observationProbFloor,
the
probability value will be set to observationProbFloor.
The
default value is 1.0E-5.
|
probabilityMatrix
|
2D float vector object
|
This
two-dimensional array of floating-point values contains the observation
probabilities
for the utterance. The number of frames is the first dimension,
and
the number of categories is the second dimension. The exact
structure
is specified in the AfloatT type definition. This object is typically
generated by the "nnet x" or "garbage median" commands.
|
returnName
|
integer
|
If
this value is 1, then the viterbi answer command will return
the name of the grammar as the last item in a list. If this value
is 0, then the name will not be returned by the viterbi answer command.
The default value is 1.
|
returnPhonemes
|
integer
|
If
this value is 1, then the viterbi answer command will return
the list of best phonemes (with time information) as determined by the
search. If this value is 0, then the phoneme information will not
be returned by the viterbi answer command. The default
value is 0.
|
returnScore
|
integer
|
If
this value is 1, then the viterbi answer command will return
the highest log-probability score of the utterance as the first item in
a list. If this value is 0, then the score will not be returned
by the viterbi answer command. The default value is 1.
|
returnStates
|
integer
|
If
this value is 1, then the viterbi answer command will return
the list of best states (with time information) as determined by the
search. If this value is 0, then the state information will not
be returned by the viterbi answer command. The default
value is 1.
|
returnWords
|
integer
|
If
this value is 1, then the viterbi answer command will return
the list of best words (with time information) as determined by the
search. If
this value is 0, then the word information will not be returned by the viterbi
answer command. The default value is 1.
|
statenetObject
|
statenet
object
|
This
is the object returned by statenet create and used or modified
by other Statenet commands. It is used by the Viterbi package to
define the state network of the HMM.
|
targetBeamSize
|
integer
|
This
is the "ideal" number of states that survive the beam at each time
frame. If targetBeamSize is -1, then all states will
survive the beam. If targetBeamSize is 0, then the
number of states in the beam is not adjusted to reach any particular
target value.
|
targetBeamVariance
|
integer
|
If the
number of states active at any point in time is greater than targetBeamSize
+ targetBeamVariance and targetBeamSize is greater than
zero, then beamThreshold is decreased by beamThresholdAdjust.
Or, if the number of
states active at any point in time is less than targetBeamSize
- targetBeamVariance and targetBeamSize is greater than
zero, then beamThreshold is increased by beamThresholdAdjust.
The default
value of targetBeamVariance is 10. NOTE: The
use of the word "variance" in the variable name indicates that the
target beam size is allowed to vary before applying adjustments; it is not
indicative of the statistical notion of variance.
|
viterbiObject
|
viterbi
object
|
This
is the object returned by viterbi init and used or modified by
other Viterbi commands. This object contains information relevant
to the search
process other than the state network structure.
|
wordBoundaryPenalty
|
float
|
At
every word boundary, wordBoundaryPenalty is added to the
accumulated log probability score. This allows some control over
the relative numbers of insertion and deletion errors, although it's
not a very elegant technique. The default value is -3.0; this value is in the log domain,
and corresponds to a probability value of approximately 0.05.
|