Parameter
Name
|
Type
|
Description
|
cdRulesFile
|
string
|
A file
that specifies context-dependent rules for a grammar. The format,
which is a modification of ABNF,
is described in the grammar format section.
|
className
|
string
|
The
name of the grammar class. The grammar class name is typically
the type of expansion performed by the grammar. For example, a
top-level class name would usually be "grammar", the lexicon
file would have the "word" grammar class, and the expansion of
phonemes into states would have the "phoneme" grammar class.
The "word" grammar class is particularly important, as
this class name will determine where in the state network word
boundaries are specified. The use of "phoneme" as a
grammar class name will determine where in the state network phoneme
boundaries are specified. Other class names may be chosen
according to the user's preference.
|
collapseValue
|
integer
|
If collapseValue
is 1, then if a context-independent node in
the state network expands to context-dependent nodes that have the same
name, these context-dependent nodes will be collapsed into a single
node
with that name. This may greatly reduce the size of the state
network,
and for most applications will work fine. However, the collapsed
state
network is not guaranteed to be equivalent to the non-collapsed network
in
all cases. If non-equivalence is suspected, then set collapseValue
to 0 to not perform this collapsing of nodes. The default value
of collapseValue is 1.
|
grammar
|
string
|
The
value of grammar may be either a filename or a Tcl list.
The Statenet package automatically determines if grammar is
a filename or Tcl list. The format for a grammar file or list is
described in the grammar format section.
|
parts
|
string
|
The
value of parts may be either a filename or a Tcl list.
The Statenet package automatically determines if parts is
a filename or a Tcl list. The format for a parts file or a parts
list is described in the parts format
section.
|
| recognizerSpecification |
string
|
The
value of recognizerSpecification may be either a filename or a
Tcl list. The Statenet package automatically determines if recognizerSpecification is a filename or Tcl
list. The format for a specification file or list is described in
the recognizer specification format
section.
|
returnInfo
|
integer
|
This
value should be 1 if information about a parameter should be returned,
or 0 if information about a parameter should not be returned. The
parameters are:
-name
|
The
name of the top-level grammar
|
-numStates
|
The
current number of states in the state network
|
-enter
|
The
state ID of the enter node (usually 0)
|
-exit
|
The
state ID of the exit node (usually 1)
|
-collapse
|
Whether
identical context-dependent nodes are collapsed. See collapseValue
for a description.
|
-selfLoops
|
Whether
self loops will be added to each state during creation of new states.
This is specified in the statenet create, statenet add,
and statenet addspec commands.
|
-durModel
|
The
type of duration model. Valid types are "exponential", "gamma",
and "minmax", although currently only "minmax" is
supported. This parameter is specified in recognizerSpecification.
|
-sampFreq
|
The
sampling frequency used by the
recognizer. This
parameter
is specified in recognizerSpecification. There is no
default
value.
|
-frameSize
|
The
frame size used by the recognizer. This parameter is
specified in recognizerSpecification. There is no default value.
|
-featuresURI
|
The
URI that specifies the location and filename of Tcl code used by the
recognizer to compute features for
recognition. This
parameter is specified in recognizerSpecification.
Currently,
the URI specification is restricted to local files only.
|
-contextURI
|
The
URI that specifies the location and filename of Tcl code used by the
recognizer to compute a context window of features for recognition.
This parameter
is specified in recognizerSpecification. Currently, the
URI specification is restricted to local files only.
|
|
selfLoopsValue
|
integer
|
If selfLoopsValue
is 1, then any state created by the statenet
create, statenet add, or statenet addSpec commands
will
automatically receive a self-loop. If selfLoopsValue is
0,
then states do not recieve self loops. In typical usage, selfLoopsValue
is 0 when specifying the grammar and lexicon, and then set to 1 only at
the
very last expansion when recognizer-specific HMM states are created.
The default value of selfLoopsValue
is 0.
|
startTokenName
|
string
|
startTokenName
specifies the name of the root token that is the "top" of the
grammar. This name must exist in the grammar specification
as
a token for a context-independent rule. The default value is "$grammar".
|
stateFormat
|
string
|
This
parameter specifies the format for specifying states that is used in
the recognizerSpecification description. Currently, only
one format is supported, "multistatebiphone". This format
allows for context-independent single-state monophones as well as
context-dependent biphones specified in two or three states. This
format is described in the state
specification format.
In the future, other formats, such as context-dependent triphones, are
expected. Everything done by the statenet addSpec command can
also
be done by a statenet add command using context-dependent
grammar
rules; the statenet addSpec command simply processes known
formats
(e.g. multi-state biphones) faster that statenet add.
|
oneLevelValue
|
integer
|
If oneLevelValue
is
0 (the default), then rules are continuously expanded until no more
rules can be applied. If oneLevelValue
is
1, then only one "level" of rules is applied to a token within a single
call to statenet create or statenet add. For
example, consider the case in which a lexicon contains the words "I"
and "did", the pronunciation of "I" is (Worldbet) aI,
and the
pronunciation of "did" is dc
d I dc [d]. When expanding a
grammar, all occurrences of "I" will be expanded to aI,
and all
occurrences of "did" will be expanded to dc d I dc [d]. If oneLevelValue is 0 (default), then
the I
in dc d I dc [d] will be further
expanded with the rule I
-> aI,
yielding dc
d aI dc [d]. If oneLevelValue is 1, then this
second "level" of applying rules is not performed, and the
pronunciation of "did" remains dc
d I dc [d]. Because lexicons
in general only require one level of applying rules, the default for a
lexicon grammar that is specified using the lexicon keyword within a
higher-level grammar is that oneLevelValue set to 1.
Any value of oneLevelValue
specified as part of a statenet create
or statenet add command will
override the default value.
|
statenetObject
|
statenet
object
|
This
is the object returned by statenet create and used or modified
by other Statenet commands as well as Viterbi search commands.
|