![]() |
![]() |
|||||||||||||||||||||||||||||
Alphadigit v1.3General Description The Alphadigit Corpus is a collection of 78,044 examples from 3,025 speakers saying six digit strings of letters and digits over the telephone. A total of about 82 hours of speech are included in Release 1.3. Each file has an orthographic transcription and time align transcription as well. Recording Details Each subject called the CSLU data collection system by dialing a toll-free number. The data were recorded directly off of a digital phone line without digital-to-analog or analog-to-digital conversion at the recording end. The digital data were collected with the CSLU T1 digital data collection system. The sampling rate was 8khz and the files were stored in 8-bit mu-law format on a UNIX file system . The speech files have been converted to the standard 16-bit linearly encoded RIFF file format. Subject Population Subjects whose utterances are included in this corpus are respondents to USEnet postings. Respondants were required to fill out a form on the World Wide Web and register for the data collection. In response to their registration a list of letters and digits was emailed to them along with instructions on how to participate. Annotation All of the files included in this corpus have corresponding non-time-aligned word-level transcriptions that comply with the conventions in the CSLU Labeling Guide (could be found in CSLU's publication page). A total of 78,044 utterances were recorded. Every letter and digit, including zero and oh, is in the corpus (table of all letters and digits). Sometimes the callers said extra words or partial words. Instead of removing these utterances from the corpus we have left them in for researchers who may be interested in dealing with not-so-perfect alpha-digit strings. File Formats Each speech file filename in the Alphadigit Corpus encodes information about the call number, utterance type, and file type. Here is a typical filename: AD-2.p17.wav
The "wav" files contain speech data and use the RIFF standard wav file format. This file format is 16-bit linearly encoded. The transcriptions are contained in a standard text file. |
||||||||||||||||||||||||||||||
![]() |
||||||||||||||||||||||||||||||