Next: Dual Estimation
Up: Introduction
Previous: State-estimation
Parameter Estimation
The classic machine learning problem involves determining
a nonlinear mapping
 |
|
(3) |
where
is the input,
is the output, and the
nonlinear map
is parameterized by the vector
.
The nonlinear map, for example, may be a
feedforward or recurrent neural network (
are the weights),
with numerous applications in regression, classification, and dynamic modeling.
Learning corresponds to estimating the parameters
.
Typically, a training set is provided with sample pairs consisting of
known input and desired outputs,
. The
error of the machine is defined as
, and
the goal of learning involves solving for
the parameters
in order to minimize the expected squared
error.
While a number of optimization
approaches exist ( e.g., gradient descent using backpropagation), the
EKF may be used to estimate the parameters by writing a new
state-space representation
where the parameters
correspond to a stationary process
with identity state transition matrix, driven by process noise
(the choice of variance determines tracking performance).
The output
corresponds to a nonlinear observation on
.
The EKF can then be applied directly as an efficient ``second-order'' technique for
learning the parameters. In the linear case, the relationship between the
Kalman Filter (KF) and Recursive Least Squares (RLS) is given in
[3]. The use of the EKF for training neural networks
has been developed by Singhal and Wu [4] and Puskorious and Feldkamp [5].
Next: Dual Estimation
Up: Introduction
Previous: State-estimation