next up previous contents
Next: Chapter 2: Written Language Input Up: Ch. 1 Title Page Previous: 1.8: Spoken Language Understanding

References

ADNS94
X. Aubert, C. Dugast, H. Ney, and V. Steinbiss. Large vocabulary continuous speech recognition of wall street journal data. In ICASSP [ICA94], pages 129--132.

AH89
T. H. Applebaum and B. A. Hanson. Regression features for recognition of speech in quiet and in noise. In ICASSP [ICA89], pages 985--988.

AHH93
F. Alleva, X. Huang, and M. Y. Hwang. An improved search algorithm using incremental knowledge for continuous speech recognition. In ICASSP [ICA93], pages 307--310.

AMS94
T. Anastasakos, J. Makhoul, and R. Schwartz. Adaptation to new microphones using tied-mixture normalization. In ICASSP [ICA94], pages 433--436.

ARP93
Advanced Research Projects Agency. Proceedings of the 1993 ARPA Human Language Technology Workshop, Princeton, New Jersey, March 1993. Morgan Kaufmann.

ARP94
Advanced Research Projects Agency. Proceedings of the 1994 ARPA Human Language Technology Workshop, Princeton, New Jersey, March 1994. Morgan Kaufmann.

ARP95a
Advanced Research Projects Agency. Proceedings of the 1995 ARPA Human Language Technology Workshop. Morgan Kaufmann, January 1995.

ARP95b
Advanced Research Projects Agency. Proceedings of the ARPA Spoken Language Systems Technology Workshop. Morgan Kaufmann, January 1995.

AS90a
A. Acero and R. M. Stern. Environmental robustness in automatic speech recognition. In ICASSP [ICA90], pages 849--852.

AS90b
V. M. Alvarado and H. F. Silverman. Experimental results showing the effects of optimal spacing between elements of a linear microphone array. In ICASSP [ICA90], pages 837--840.

Ata74
Bishnu S. Atal. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. Journal of the Acoustical Society of America, 55(6):1304--1312, 1974.

BBdS93a
L. R. Bahl, J. R. Bellegarda, P. V. de Souza, P. S. Gopalakrishnan, D. Nahamoo, and M. A. Picheny. Multonic Markov word models for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 1(3):334--344, July 1993.

BBdS93b
L. R. Bahl, P. F. Brown, P. V. de Souza, R. L. Mercer, and M. A. Picheny. A method for the construction of acoustic Markov models for words. IEEE Transactions on Speech and Audio Processing, 1(4):443--452, October 1993.

BDFK92
Y. Bengio, R. DeMori, G. Flammia, and R. Kompe. Global optimization of a neural network---hidden Markov model hybrid. IEEE Transactions on Neural Networks, 3(2):252--259, March 1992.

BDPDP94
A. Berger, S. Della Pietra, and V. Della Pietra. Maximum entropy methods in machine translation. Technical report, IBM Research Report, 1994.

BDPdS90
P. F. Brown, V. J. Della Pietra, P. V. de Souza, J. C. Lai, and R. L. Mercer. Class-based n-gram models of natural language. In Proceedings of the IBM Natural Language ITL, Paris, France, March 1990.

BdSG91
L. R. Bahl, P. V. de Souza, P. S. Gopalakrishnan, D. Nahamoo, and M. A. Picheny. Decision trees for phonological rules in continuous speech. In ICASSP [ICA91], pages 185--188.

BdSN92
J. R. Bellegarda, P. V. de Souza, A. J. Nadas, D. Nahamoo, M. A. Picheny, and L. Bahl. Robust speaker adaptation using a piecewise linear acoustic mapping. In ICASSP [ICA92], pages 445--448.

BJM83
Lalit R. Bahl, Fred Jelinek, and R. L. Mercer. A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2):179--190, March 1983.

Boc93
E. L. Bocchieri. Vector quantization for the efficient computation of continuous density likelihoods. In ICASSP [ICA93], pages 692--694.

CGF94
J. Cohen, H. Gish, and J. Flanagan. Switchboard---the second year. Technical Report /pub/caipworks2 at ftp.rutgers.edu, CAIP Summer Workshop in Speech Recognition: Frontiers in Speech Processing II, July 1994.

CH92
R. A. Cole, L. Hirschman, et al. Workshop on spoken language understanding. Technical Report CSE 92-014, Oregon Graduate Institute of Science & Technology, P.O.Box 91000, Portland, OR 97291-1000 USA, September 1992.

CLP94
C. Che, J. Lin, J. Pearson, B. de Vries, and J. Flanagan. Microphones arrays and neural networks for robust speech recognition. In ARPA [ARP94].

Coh89
J. R. Cohen. Application of an auditory model to speech recognition. Journal of the Acoustical Society of America, 85(6):2623--2629, June 1989.

DAR90
Defense Advanced Research Projects Agency. Proceedings of the Third DARPA Speech and Natural Language Workshop, Hidden Valley, Pennsylvania, June 1990. Morgan Kaufmann.

DAR91
Defense Advanced Research Projects Agency. Proceedings of the Fourth DARPA Speech and Natural Language Workshop, Pacific Grove, California, February 1991. Morgan Kaufmann.

DAR92
Defense Advanced Research Projects Agency. Proceedings of the Fifth DARPA Speech and Natural Language Workshop. Morgan Kaufmann, February 1992.

Dau90
I. Daubechies. The wavelet transform, time-frequency localization and signal analysis. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-36(5):961--1005, September 1990.

DK90
R. DeMori and R. Kuhn. A cache-based natural language model for speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-12(6):570--583, 1990.

DLR77
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum-likelihood from incompete data via the EM algorithm. Journal of the Royal Statistical Society, Ser. B., 39:1--38, 1977.

DLS90
R. O. Duda, R. F. Lyon, and M. Slaney. Correlograms and the separation of sounds. In Proceedings of the 24th Asilomar Conference on Signals, Systems and Computers, volume 1, pages 7457--7461, November 1990.

DM80
S. B. Davis and P. Mermelstein. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-28:357--366, August 1980.

DM94
V. Digalakis and H. Murveit. Genones: Optimizing the degree of mixture tying in a large vocabulary hidden Markov model based speech recognizer. In ICASSP [ICA94], pages 537--540.

DR72
J. N. Darroch and D. Ratcliff. Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, 43:1470--1480, 1972.

Eph92
Y. Ephraim. Gain-adapted hidden Markov models for recognition of clean and noisy speech. IEEE Transactions on Acoustics, Speech and Signal Processing, 40:1303--1316, June 1992.

ESC93
ESCA. Proceedings of the ESCA workshop on prosody. Technical Report Working Papers 41, Lund University Department of Linguistics, 1993.

Eur89
European Speech Communication Association. Eurospeech '89, Proceedings of the First European Conference on Speech Communication and Technology, Paris, 1989. European Speech Communication Association.

Eur91
Eurospeech '91, Proceedings of the Second European Conference on Speech Communication and Technology, Genova, Italy, September 1991. European Speech Communication Association.

Eur93
Eurospeech '93, Proceedings of the Third European Conference on Speech Communication and Technology, Berlin, September 1993. European Speech Communication Association.

EW90
A. Erell and M. Weintraub. Recognition of noisy speech: Using minimum-mean log-spectral distance estimation. In DARPA [DAR90], pages 341--345.

FBC95
M. Fanty, E. Barnard, and R. A. Cole. Alphabet recognition. In Handbook of Neural Computation. Publisher Unknown, 1995. In press.

Fur81
S. Furui. Cepstral analysis technique for automatic speaker verification. IEEE Transactions on Acoustics, Speech and Signal Processing, 29(2):254--272, 1981.

Fur86a
S. Furui. Research on individuality features in speech waves and automatic speaker recognition techniques. Speech Communication, 5(2):183--197, 1986.

Fur86b
S. Furui. Speaker-independent isolated word recognition using dynamic features of the speech spectrum. IEEE Transactions on Acoustics, Speech and Signal Processing, 29(1):59--59, 1986.

Fur89
Sadaoki Furui. Digital Speech Processing, Synthesis, and Recognition. Marcel Dekker, New York, 1989.

Fur91
S. Furui. Speaker-dependent-feature extraction, recognition and processing techniques. Speech Communication, 10(5-6):505--520, 1991.

Fur94
S. Furui. An overview of speaker recognition technology. In Proceedings of the ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, pages 1--9, 1994.

Ghi88
O. Ghitza. Temporal non-place information in the auditory-nerve firing patterns as a front end for speech recognition in a noisy environment. Journal of Phonetics, 16(1):109--124, 1988.

GL91
Jean-Luc Gauvain and C.-H. Lee. Bayesian learning for hidden markov model with gaussian mixture state observation densities. In Eurospeech [Eur91], pages 939--942.

GMF94
C. Griffin, T. Matsui, and S. Furui. Distance measures for text-independent speaker recognition based on MAR model. In ICASSP [ICA94], pages 309--312.

Gre88
S. Greenberg. Theme issue: Representation of speech in the auditory periphery. Journal of Phonetics, 16(1), January 1988.

GY92
M. J. F. Gales and S. J. Young. An improved approach to the hidden Markov model decomposition of speech and noise. In ICASSP [ICA92], pages 233--236.

GZ92
D. Goddeau and V. Zue. Integrating probabilistic LR parsing into speech understanding systems. In ICASSP [ICA92], pages 181--184.

HAJ90
X. D. Huang, Y. Ariki, and M. Jack. Hidden Markov Models for Speech Recognition. Edinburgh University Press, 1990.

HBP91
A. L. Higgins, L. Bahler, and J. Porter. Speaker verification using randomized phrase prompting. Digital Signal Processing, 1:89--106, 1991.

Her90
H. Hermansky. Perceptual linear predictive (PLP) analysis for speech. Journal of the Acoustical Society of America, 87(4):1738--1752, April 1990.

HH93
M. Y. Hwang and X. Huang. Shared-distribution hidden Markov models for speech recognition. IEEE Transactions on Speech and Audio Processing, 1(4):414--420, October 1993.

Hin83
D. Hindle. Deterministic parsing of syntactic nonfluencies. In Proceedings of the 21st Annual Meeting of the Association for Computational Linguistics, pages 123--128, Cambridge, Massachusetts, 1983. Association for Computational Linguistics.

HL89
M. J. Hunt and C. Lefèbvre. A comparison of several acoustic representations for speech recognition with degraded and undegraded speech. In ICASSP [ICA89], pages 262--265.

HL91
Hsiao-Wuen Hon and Kai-Fu Lee. CMU robust vocabulary-independent speech recognition system. In ICASSP [ICA91], pages 889--892.

HL93
X. D. Huang and K. F. Lee. On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition. IEEE Transactions on Speech and Audio Processing, 1(2):150--157, April 1993.

HMBK91
H. Hermansky, N. Morgan, A. Bayya, and P. Kohn. Compensation for the effects of the communication channel in auditory-like analysis of speech. In Eurospeech [Eur91], pages 1367--1370.

HMH93
H. Hermansky, N. Morgan, and H. G. Hirsch. Recognition of speech in additive and convolutional noise based on RASTA spectral processing. In ICASSP [ICA93], pages 83--86.

HMR91
H. G. Hirsch, P. Meyer, and H. W. Ruehl. Improved speech recognition using high-pass filtering of subband envelopes. In Eurospeech [Eur91], pages 413--416.

HUGN93
R. Haeb-Umbach, D. Geller, and H. Ney. Improvements in connected digit recognition using linear discriminant analysis and mixture densities. In ICASSP [ICA93], pages 239--242.

Hun93
M. J. Hunt. Signal processing for speech. In R. E. Asher, editor, The Encyclopedia of Language and Linguistics. Pergamon Press, 1993.

ICA87
Institute of Electrical and Electronic Engineers. Proceedings of the 1987 International Conference on Acoustics, Speech, and Signal Processing, Dallas, April 1987.

ICA89
Institute of Electrical and Electronic Engineers. Proceedings of the 1989 International Conference on Acoustics, Speech, and Signal Processing, Glasgow, Scotland, May 1989.

ICA90
Institute of Electrical and Electronic Engineers. Proceedings of the 1990 International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, New Mexico, April 1990.

ICA91
Institute of Electrical and Electronic Engineers. Proceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, May 1991.

ICA92
Institute of Electrical and Electronic Engineers. Proceedings of the 1992 International Conference on Acoustics, Speech, and Signal Processing, San Francisco, March 1992.

ICA93
Institute of Electrical and Electronic Engineers. Proceedings of the 1993 International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, Minnesota, April 1993.

ICA94
Institute of Electrical and Electronic Engineers. Proceedings of the 1994 International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 1994.

ICS90
Proceedings of the 1990 International Conference on Spoken Language Processing, Kobe, Japan, November 1990.

ICS92
Proceedings of the 1992 International Conference on Spoken Language Processing, Banff, Alberta, Canada, October 1992. University of Alberta.

ICS94
Proceedings of the 1994 International Conference on Spoken Language Processing, Yokohama, Japan, September 1994.

IOR94
R. Iyer, M. Ostendorf, and R. Rohlicek. An improved language model using a mixture of Markov components. In ARPA [ARP94].

Jel69
F. Jelinek. A fast sequential decoding algorithm using a stack. IBM journal of Research and Development, 13, November 1969.

JMRS91
F. Jelinek, B. Merialdo, S. Roukos, and M. Strauss. A dynamic language model for speech recognition. In DARPA [DAR91], pages 293--295.

JRW86
B. H. Juang, L. R. Rabiner, and J. G. Wilpon. On the use of bandpass liftering in speech recognition. In Proceedings of the 1986 International Conference on Acoustics, Speech, and Signal Processing, pages 765--768, Tokyo, April 1986. Institute of Electrical and Electronic Engineers.

Jua91
B. H. Juang. Speech recognition in adverse environments. Computer Speech and Language, pages 275--294, 1991.

KAM94
F. Kubala, A. Anastasakos, J. Makhoul, L. Nguyen, R. Schwartz, and G. Zavaliagkos. Comparative experiments on large vocabulary speech recognition. In ICASSP [ICA94], pages 561--564.

KDMM94
R. Kuhn, R. De Mori, and E. Millien. Learning consistent semantics from training data. In ICASSP [ICA94], pages 37--40.

KMH94
J. Koehler, N. Morgan, H. Hermansky, H. G. Hirsch, and G. Tong. Integrating RASTA-PLP into speech recognition. In ICASSP [ICA94], pages 421--424.

LBB92
P. Lockwood, J. Boudy, and M. Blanchet. Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments. In ICASSP [ICA92], pages 265--268.

Lic94
R. J. Lickley. Detecting Disfluency in Spontaneous Speech. PhD thesis, University of Edinburgh, Scotland, 1994.

LMP87
R. P. Lippmann, F. A. Martin, and D. B. Paul. Multi-style training for robust isolated-word speech recognition. In ICASSP [ICA87], pages 709--712.

LO79
J. Lim and A. Oppenheim. Enhancement and bandwidth compression of noisy speech. Proceedings of the IEEE, 67:1586--1604, 1979.

LRR93
R. Lau, R. Rosenfeld, and S. Roukos. Trigger-based language models: A maximum entropy approach. In ICASSP [ICA93], pages 45--48.

LSAM94
F.-H. Liu, R. M. Stern, A. Acero, and P. Moreno. Environment normalization for robust speech recognition using direct cepstral comparison. In ICASSP [ICA94], pages 61--64.

LST92
J. Lafferty, D. Sleator, and D. Temperley. Grammatical trigrams: A probabilistic model of link grammar. In Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 1992.

Lyo82
R. F. Lyon. A computational model of filtering, detection, and compression in the cochlea. In Proceedings of the 1982 International Conference on Acoustics, Speech, and Signal Processing, pages 1282--1285. Institute of Electrical and Electronic Engineers, May 1982.

MBDW93
H. Murveit, J. Butzberger, V. Digilakis, and M. Weintraub. Large-vocabulary dictation using SRI's DECIPHER speech recognition system: Progressive search techniques. In ICASSP [ICA93], pages 319--322.

ME91
N. Merhav and Y. Ephraim. Maximum likelihood hidden markov modeling using a dominant state sequence of states. IEEE Transactions on Signal Processing, 39(9):2111--2114, September 1991.

MF93a
T. Matsui and S. Furui. Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs. In ICASSP [ICA93], pages 157--160.

MF93b
T. Matsui and S. Furui. Concatenated phoneme models for text-variable speaker recognition. In ICASSP [ICA93], pages 391--394.

MF94a
T. Matsui and S. Furui. Similarity normalization method for speaker verification based on a posteriori probability. In Proceedings of the ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, pages 59--62, 1994.

MF94b
T. Matsui and S. Furui. Speaker adaptation of tied-mixture-based phoneme models for text-prompted speaker recognition. In ICASSP [ICA94], pages 125--128.

MG76
J. D. Markel and A. H. Gray, Jr. Linear Prediction of Speech. Springer-Verlag, Berlin, 1976.

MZ90
Helen M. Meng and Victor W. Zue. A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons. In ICSLP [ICS90], pages 1053--1056.

Nil71
N. J. Nilsson. Problem-Solving Methods in Artificial Intelligence. McGraw-Hill, New York, 1971.

NMNP92
H. Ney, D. Mergel, A. Noll, and A. Paesler. Data driven search organization for continuous speech recognition. IEEE Transactions on Signal Processing, 40(2):272--281, February 1992.

NND89
J. M. Naik, L. P. Netsch, and G. R. Doddington. Speaker verification over long distance telephone lines. In ICASSP [ICA89], pages 524--527.

NW94
L. Neumeyer and M. Weintraub. Probabilistic optimum filtering for robust speech recognition. In ICASSP [ICA94], pages 417--420.

Ohs93
Y. Ohshima. Robustness in Speech Recognition using Physiologically-Motivated Signal Processing. PhD thesis, CMU, 1993.

O'S86
D. O'Shaughnessy. Speaker recognition. IEEE Acoustics, Speech and Signal Processing Magazine, 3(4):4--17, 1986.

Pal91
D. Pallett. DARPA resource management and ATIS benchmark test poster session. In DARPA [DAR91], pages 49--58.

Pal92
D. Pallett. ATIS benchmarks. In DARPA [DAR92].

Pau94
D. B. Paul. The Lincoln large-vocabulary stack-decoder based HMM CSR. In ARPA [ARP94], pages 374--379.

PB84
J. E. Porter and S. F. Boll. Optimal estimators for spectral restoration of noisy speech. In Proceedings of the 1984 International Conference on Acoustics, Speech, and Signal Processing, pages 18.A.2.1--4. Institute of Electrical and Electronic Engineers, 1984.

PDF92
D. Pallett, N. Dahlgren, J. Fiscus, W. Fisher, J. Garofolo, and B. Tjaden. DARPA February 1992 ATIS benchmark test results. In DARPA [DAR92], pages 15--27.

Pet89
P. M. Peterson. Adaptive array processing for multiple microphone hearing aids. Technical Report 541, Research Laboratory of Electronics, MIT, Cambridge, Massachusetts, 1989.

PFF94
D. Pallett, J. Fiscus, W. Fisher, J. Garofolo, B. Lund, and M. Prysbocki. 1993 benchmark tests for the ARPA spoken language program. In ARPA [ARP94], pages 49--74.

PFF95
D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, B. A. Lund, A. Martin, and M. A. Przybocki. 1994 benchmark tests for the ARPA spoken language program. In ARPA [ARP95a], pages 5--36.

PFFG90
D. Pallett, W. Fisher, J. Fiscus, and J. Garofolo. DARPA ATIS test results. In DARPA [DAR90], pages 114--121.

PFFG93
D. Pallett, J. Fiscus, W. Fisher, and J. Garofolo. Benchmark tests for the DARPA spoken language program. In ARPA [ARP93], pages 7--18.

PO95
P. Price and M. Ostendorf. Combining linguistic with statistical methods in modeling prosody. In J. L. Morgan and K. Demuth, editors, Signal to syntax: Bootstrapping from speech to grammar in early acquisition. Lawrence Erlbaum Associates, Hillsdale, New Jersey, 1995.

PRH91
R. D. Patterson, K. Robinson, J. Holdsworth, D. McKeown, C. Zhang, and M. Allerhand. Complex sounds and auditory images. In Auditory Physiology and Perception, pages 429--446. Pergamon Press, 1991.

Rab89
L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257--286, February 1989.

RS78
Lawrence R. Rabiner and Ronald W. Schafer. Digital Processing of Speech Signals. Signal Processing. Prentice-Hall, Englewood Cliffs, New Jersey, 1978.

RS91
A. E. Rosenberg and F. K. Soong. Recent research in automatic speaker recognition. In S. Furui and M. M. Sondhi, editors, Advances in Speech Signal Processing, pages 701--737. Marcel Dekker, New York, 1991.

SCI75
Jr. Stockham, T. G., T. M. Connon, and R. B. Ingebretsen. Blind deconvolution through digital signal processing. Proceedings of the IEEE, 63(4):678--692, 1975.

SCK87
R. Schwartz, Y. Chow, and F. Kubala. Rapid speaker adaption using a probabalistic spectral mapping. In ICASSP [ICA87], pages 633--636.

Sen88
S. Seneff. A joint synchrony/mean-rate model of auditory speech processing. Journal of Phonetics, 16(1):55--76, 1988.

SF94
K. Shirai and S. Furui. Special issue on spoken dialogue. Speech Communication, 15(3-4), 1994.

Shr94
E. E. Shriberg. Preliminaries to a Theory of Speech Disfluencies. PhD thesis, U. Cal. Berkeley, 1994.

SS93
T. M. Sullivan and R. M. Stern. Multi-microphone correlation-based processing for robust speech recognition. In ICASSP [ICA93], pages 91--94.

SS94
J. Schroeter and M. M. Sondhi. Techniques for estimating vocal tract shapes from the speech signal. IEEE Transactions on Speech and Audio Processing, 2(1):133--150, January 1994.

STNE92
E. G. Shukat-Talamazzini, H. Niemann, W. Eckert, T. Kuhn, and S. Rieck. Acoustic modeling of sub-word units in the ISADORA speech recognizer. In ICASSP [ICA92], pages 577--580.

VC90
Dirk Van Compernolle. Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings. In ICASSP [ICA90], pages 833--836.

VM90
A. P. Varga and R. K. Moore. Hidden Markov model decomposition of speech and noise. In ICASSP [ICA90], pages 845--848.

WL90
A. Waibel and K. F. Lee. Readings in Speech Recognition. Morgan Kaufmann, 1990.

ZGPS90
V. Zue, J. Glass, M. Phillips, and S. Seneff. The MIT SUMMIT speech recognition system: A progress report. In DARPA [DAR90].


next up previous contents
Next: Chapter 2: Written Language Input Up: Ch. 1 Title Page Previous: 1.8: Spoken Language Understanding