CIPIC, Center for Image Processing and Integrated Computing
University of California, Davis
Pierre L. Devinyi
Speech and Hearing Research Facility
V.A. Martinez Medical Center, Martinez, CA
Richard O. Duda
Department of Electrical Engineering
San Jose State University
Recognizing these problems, a number of researchers have investigated ways to model the HRTF. Prior work can be summarized by noting that two basically different approaches have been used. One approach is mathematically based, typically involving the use of series expansions to approximate the measured data. The other approach is physically based, typically employing some kind of approximate solution to the wave equation.
Our research is based on the belief that the HRTF can be accurately modeled by a physically-based model employing a small number of free parameters. We also anticipate that these parameters can be adapted or customized to individual listeners by correlation with a small number of properly chosen anthropometric measurements. Based on these premises, we propose to develop and validate HRTF models using a combination of the physical and mathematical approaches. We will also perform extensive psychological experiments to validate the adequacy of the resulting models and filters. This research should provide the basis for a simple and inexpensive way to determine a computationally efficient way to synthesize individually customized three-dimensional sound.
V. R. Algazi, P. L. Devinyi and R. O. Duda, "Subject dependent transfer functions in spatial hearing", Proc. 1997 IEEE MWSCAS Conference. (August, 1997)
R. O. Duda and W. L. Martens, "Range-dependence of the HRTF for a spherical head," Proc. 1997 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics (October, 1997).
For other references, see the related on-going project,`` A computational model for sound localization''directed by Richard O. Duda.
Unfortunately, the sizes and shapes of torsos, heads and particularly the pinnae vary substantially from person to person, and thus the character of the HRTF also varies from person to person. These inter-subject variations are often quite significant, and serious localization errors (primarily front/back reversals and large elevation errors) can occur when one person hears the source through another person's HRTF.
Studies of the neural pathways from the cochlea to the auditory cortex provide inspiration for both the structure of a localization model and the kinds of signal processing that are appropriate, helping to define parameters such as filter bandwidths, response times, and compressive nonlinearities to cope with dynamic range. Studies in psychoacoustics reveal the different kinds of cues that humans use to localize sources, and human abilities to deal with echoes and reverberation. Our effort involves synthesizing this information in computational models of sound localization.
J. Blauert, Spatial Hearing: The Psychophysics of Human Sound Localization, Revised Edition (MIT Press, Cambridge, MA , 1997). The standard reference on the psychophysics of three-dimensional hearing.
A. S. Bregman, Auditory Scene Analysis (MIT Press, Cambridge, MA, 1990). A massive description of experiments by the author and his students on the factors that influence the formation and segregation of sound streams.
S. Carlile, Virtual Auditory Space: Generation and Applications (R. G. Landes Co., Austin, TX, 1996). A lucid and valuable book of survey chapters that emphasize the physical factors that control spatial hearing.
H. L. Hawkins, T. A. McMullen, A. N. Popper and R. R. Fay, Eds., Auditory Computation, Springer-Verlag, New York, 1996. An important edited volume of papers about models of the hearing process.
J. C. Middlebrooks and D. M. Green, "Sound localization by human listeners," Annu. Rev. Psychol., Vol. 42, pp. 135-159 (1991). An excellent review of the abilities of people to localize sound. Highly recommended.
1. Sound localization in hearing aids. The customization of binaural hearing aids would improve spatial localization and the discrimination of sound sources by their location.
2. 3-D sound for teleconferencing. Spatial sound synthesis should allow a participant in a teleconference to place the sounds from other participants in different spatial locations, and should both improve intelligibility and reduce the cognitive load.
3. Acoustic displays for pilots and air-traffic controllers.
4. Virtual reality systems.
Last modified: Tuesday Jun 17 17:16:07 PDT 1997