Baum E. B. & Wilczek F. (1988): "Supervised lerning of probabilty distributions by neural networks," Neural Information Processing Systems, Ed. D. Z Anderson, American Institute of Physics.
Bertenstam, J. Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Högberg, J., Lindell, R., Neovius, L., de Serpa-Leitao, A. and Ström, N. (1995a): "The Waxholm Application Database," Proc. EUROSPEECH '95, Madrid. pp. 833-836.
Bertenstam, J. Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Högberg, J., Lindell, R., Neovius, L., de Serpa-Leitao, A., Nord, L. and Ström, N. (1995b): "Spoken dialogue data collection in the Waxholm project," STL-QPSR 1/1995, pp. 50-73.
Bishop C. M. (1995): Neural Networks for Pattern Recognition, Oxford University Press, Oxford.
Blomberg M., Carlson R., Elenius K., Granström B., Gustafson J., Hunnicut S., Lindell R., & Neovius L. (1993): "An Experimental Dialogue System: Waxholm," Proc EUROSPEECH '93, pp. 1867- 1870.
Bourlard & Wellekens (1988): "Links between Markov Models and Multilayer Perceptrons," IEEE Trans on PAMI, 12(12), pp. 1167-1178.
Bourlard H. & Morgan N. (1993): "Continuous Speech Recognition by Connectionist Statistical Methods," IEEE trans. on Neural Networks, 4(6), pp. 893-909.
Bridle J. S. (1989): "Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition," in Neuro-computing: Algorithms, Architectures and Applications, Eds: Fougelman-Soulie and Hérault, pp. 227-236, Springer Verlag.
Bundine W. L. & Wiegend A. S. (1994): "Computing Second Derivatives in Feed-Forward Networks: A Review," IEEE Trans on Neural Networks, 5(3), pp. 1-9.
Cohen M., Franco H., Morgan N., Rumelhart D. & Abrash V. (1992): "Hybrid neural network/Hidden Markov Model continuous-speech recognition," Proc ICSLP '92, pp. 915-918.
Digalakis V. V., Ostendorf M. & Rohlicek J. R. (1992): "Fast algorithms for phone classification and recognition using segment-based models," IEEE Trans. on Signal Processing, Vol 40, pp. 2885- 2896.
Duda R. O. & Hart P. E. (1973): Pattern Classification and Scene Analysis, John Wiley and Sons, New York.
English, T. M. & Boggess, L. C. (1992): "Back-propagation training of a neural network for word spotting," Proc. IEEE ICASP '92, Vol 2, pp. 357-360.
Fahlman S. E. (1988): "An empirical study of learning speed in back-propagation networks," Technical Report CMU-CS-88-162, Carnegie-Mellon University, Computer Science Dept., Pittsburgh, PA.
Fant G. (1969): Acoustic Theory of Speech Perception, Mouton, The Hague, The Netherlands.
Ghosh G. & Tumer K. (1994): "Structural adaptation and generalization in supervised feed-forward networks," Journal of Artificial Neural Networks, 1(4), pp. 430-458.
Gish, H. (1990): "A Probabilistic Approach to the Understanding and Training of Neural Network Classifiers," Proc IEEE ICASSP '90, pp1361-1364.
Glass J., Chang J., & McCandless M. (1996): "A Probabilistic Framework for Feature-Based Speech Recognition," Proc ICSLP '96, pp. 2277-2280.
Goldenthal W. (1994): "Statistical trajectory models for phonetic recognition," Technical Report MIT/LCS/TR-642, MIT Lab. for Computer Science.
Hampshire J. B. & Pearlmutter B. A. (1990): "Equivalence Proofs for Multi-Layer Perceptron Classifiers and the Bayesian Discriminant Function," Proc. of the 1990 Connectionist Models Summer School, Eds: Touretsky, Sejnowski and Hinton, Morgan Kaufmann, San Mateo CA.
Högberg J. & Sjölander K. (1996): "Cross Phone State Clustering Using Lexical Stress and Context," Proc ICSLP '96, pp. 474-477.
Hornik K., Stinchcombe M. & White H. (1989): "Multilayer feed-forward networks are universal approximators," Neural Networks, Vol 2, pp. 359-366.
Juang B.-H. & Katagiri S. (1992): "Discriminative Learning for Minimum Error Classification," IEEE trans. On Signal Processing, 40(12), pp. 3043-3054.
Kershaw D. J., Hochberg M. M. & Robinson A. J. (1996): "Context-dependent classes in a hybrid recurrent network-HMM speech recognition system," In Advances in Neural Information Processing Systems 8, eds: Touretsky D. S., Mozer M. C, and Hasselmo M. E., Morgan Kaufmann.
Lamel L. & Gauvain J. L. (1993): "High performance speaker-independent phone recognition using CDHMM," Proc. EUROSPEECH, pp. 121- 124.
Le Cun Y., Boser B., Denker J. S., Henderson J. S., Howard R. E., Hubbard W. & Jackel L. D. (1990b): "Handwritten Digit Recognition with a Back-propagation Network," In Advances in Neural Information Processing Systems vol. II, ed: Touretsky D. S., pp. 396-404, San Mateo, California IEEE, Morgan Kaufmann.
Le Cun Y., Denker J. S. & Solla S. A. (1990a): "Optimal brain damage," In Advances in Neural Information Processing Systems vol. II, ed: Touretsky D. S., pp. 589-605, San Mateo, California IEEE, Morgan Kaufmann.
Lee, K. F. (1989): Automatic Speech Recognition; The Development of the SPHINX System, Kluwer Academic Publishers, Dordrecht.
Lee K-F & Hon H-W (1989): "Speaker-independent Phone Recognition using Hidden Markov Models," IEEE Trans. On Acoustics, Speech, and Signal Processing, 37(11), pp. 1641-1648.
Levenberg K. (1944): "A method for the solution of certain problems in least squares," Quart. Appl. Math., Vol 2, pp. 164-168.
Levin, E. (1990): "Word recognition using hidden control neural architecture," Proc IEEE ICASSP '90, Vol 1, pp. 433-436.
Li, K. P., Naylor, J. A. & Rossen, M. L. (1992): "A whole word recurrent neural network for keyword spotting," Proc. IEEE ICASP '92, Vol 2, pp. 81-84.
Luenberger G. L. (1984): Linear and Nonlinear Programming, Addison-Wesley Publishing Company, Inc.
Mari J. F., Fohr D. & Junqua J.C. (1996) "A second-order HMM for high per-formance word and phoneme-based continuous speech recognition," Proc. ICASSP '96, pp. 435--438.
Marquardt D. (1963): "An algorithm for least-squares estimation of nonlinear parameters," SIAM Jl. Appl. Math., Vol 11, pp. 431-441.
Mitchel C. D., Harper M. P. & Jamieson L. H. (1996): "Stochastic Observation Hidden Markov Models," Proc IEEE ICASSP '96, pp. 617-620.
Pearlmutter B. A. (1990): "Dynamic Recurrent Neural Networks," Technical Report CMU-CS-88-191, Carnegie-Mellon University, Computer Science Dept. Pittsburg, PA.
Rabiner L. and Juang B-H (1993): Fundamentals of Speech Recognition, Englewood Cliffs NJ, Prentice Hall.
Richard M. D. & Lippman R. P. (1991): "Neural network classifiers estimate Bayesian a posteriori probabilities," Neural Computation, Vol 3, pp. 461-483.
Ripley B. D. (1996): Pattern Recognition and Neural Networks, Cambridge University Press, Cambridge.
Robinson A.J. (1994): "An application of Recurrent Nets to Phone Probability Estimation," IEEE trans. on Neural Networks, 5(2), pp. 298-305.
Robinson T. & Fallside F. (1991): "A Recurrent Error Propagation Network Speech Recognition System," Computer Speech & Language, 5:3, pp. 259-274.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986): "Learning internal representations by error propagation," in Rumelhart, D. E., G. E. Hinton, (eds.), Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Vol 1 Foundations., chapter 8. Bradford Books/MIT Press, Cambridge, MA, ISBN 0-262-18120-7.
Schroeder M. R., Atal B. S. & Hall J. L. (1979): "Objective Measure of Certain Speech Signal Degradations Based on Masking Properties of the Human Auditory Perception," in Frontiers of Speech Communication Research, Eds: B. Lindblom and S. Öhman, Academic Press, pp. 217-229.
Sietsma J.& Dow R. J. F. (1991): "Creating artificial neural networks that generalize," Neural Networks, 4(1) pp. 67-69.
Sjölander & Högberg (1996): "Trying to improve phone and word recognition using finely tuned phone-like units," Proc. Swedish Phonetics Conference '96, PHONUM 4:1996, pp. 125-128, Umeå universitets tryckeri, Umeå, Sweden.
Steeneken H. J. M. & van Leeuwen D. A. (1995): "Multi-lingual Assessment of Speaker Independent Large Vocabulary Speech-recognition Systems: the SQALE-project," Proc. EUROSPEECH '95, pp. 1271-1274.
Solla S. A., Levin E. & Fleisher M. (1988): "Accelerated Learning in Layered Neural Networks," Complex Systems, Vol 2, pp. 625-640.
Ström N. (1992): "Development of a Recurrent Time-Delay Neural Net Speech Recognition System," STL-QPSR 2-3/1992, pp. 1-44, KTH (Royal Institute of Technology), Dept. of Speech, Music and Hearing, Sweden.
Ström N. (1996): "Continuous speech recognition in the WAXHOLM dialogue system," STL-QPSR 4/1996, pp., KTH (Royal Institute of Technology), Dept. of Speech, Music and Hearing, Sweden. (Abstract) (Paper, postscript 2391K)
Tebelskis, J. & Waibel, A. (1990): "Large vocabulary recognition using linked predictive neural networks," Proc. IEEE ICASSP '90, Vol 1, pp. 437-440.
Thimm G. & Fiesler E. (1995): "Evaluationg pruning methods," In 1995 International Symposium on Artificial Neural Networks," Proc. ISANN '95, pp. A2 20-25, National Chiao-Tung University, Hsinchu, Taiwan.
Waibel A., Hanazawa T., Hinton G., Shikano K. & Lang K. (1987) : "Phoneme Recognition Using Time-Delay Neural Networks," ATR Technical Report TR-006, ATR, Japan.
White H. (1989): "Learning in Artificial Neural Networks: A Statistical Perspective," Neural Computation 1(4), pp. 425-464.
Young S., Jansen J., Odell J., Ollason D. & Woodland P. (1995): HTK - Hidden Markov Toolkit, Entropic Cambridge Research Laboratory.
Zue V., Seneff S. & Glass J. (1991): "Speech Database Development: TIMIT and beyond," Speech Communication, 9(4), pp. 351-356.