Speech Enhancement and Assessment Resource (SpEAR)

Home | Demos | Research | Database and Tools | Publications | People | Links
SpEAR description | SpEAR beta release

SpEAR consists of a speech database and toolkit for assessing performance of speech enhancement algorithms.

This project is currently pending NSF funding. Companies and corporate members interested in providing input and/or financial support for this project should contact Professor Eric Wan.

The objective of this project is to develop standardized data, tools, and evaluation guidelines to enable the consistent benchmarking of speech enhancement (noise removal) algorithms. CSLU is developing a Speech Enhancement Assessment Resource (SpEAR), consisting of a speech database particularly suited for enhancement tasks, along with a toolkit specifically designed for assessing the performance of different speech enhancement algorithms.

SpEAR will be provided for educational purposes as a service to the speech enhancement research community.

SpEAR Database

Most speech databases today have been designed for speech recognition experiments. They contain many utterances spoken by a large number of subjects often in a single noise free environment. In contrast, the SpEAR Database will be created to examine and assess the performance of speech enhancement systems in adverse/noisy environments. The SpEAR Database will contain carefully selected samples of noise-corrupted speech that have been recorded by acoustically combining clean speech and noise (a clock-synchronous procedure will be used to provide a time-aligned reference to the clean speech). Three separate sub-corpora will be
designed:

Each sub-corpus will be recorded in the following environments:

SpEAR Toolkit

The SpEAR Toolkit will be a collection of routines and a convenient graphical user interface for examining and assessing the performance of speech enhancement algorithms. The toolkit will contain components for both objective and subjective evaluations, with a clear set of guidelines
on how the resources are to be used, what experiments are to be conducted, and how evaluation results are to be reported.

Objective Quality Assessment Measures.

The SpEAR toolkit will contain a baseline set of standard objective speech quality measures used within the speech enhancement community. These include:

The toolkit will contain scripts and interfaces specifically designed to evaluate performance on the SpEAR Human/quality sub-corpora. A GUI will allow the user to view clean, noisy and enhanced speech along with noise sources, phonetic labels and objective quality measures, in a time aligned representation. This will allow the user to investigate the performance of a speech enhancement algorithm in detail, especially as far as how it pertains to different components (phonemes, phones, syllables, etc.) of speech. Scripts will allow the user, for example, to automatically determine objective performance measures divided into classes (e.g., voiced and un-voiced, etc.) or the cumulative performance over the entire SpEAR corpus.

Subjective Evaluation

Five experimental procedures will be implemented: two for measuring intelligibility and three for measuring quality. These experiments
will be performed on the raw SpEAR sub-corpora as well as after processing with a number standard enhancement algorithms. The
results will provide a base reference for subjective performance evaluation. In addition, the protocols, scripts, and interfaces developed for performing our evaluations will be made available to the academic community to aid in standardizing outside evaluation.