FFMTIMIT
| Item Name: | FFMTIMIT |
| Author(s): | John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, Victor Zue |
| LDC Catalog No.: | LDC96S32 |
| ISBN: | 1-58563-090-X |
| ISLRN: | 671-675-113-200-7 |
| DOI: | https://doi.org/10.35111/ftsx-tk27 |
| Member Year(s): | 1996 |
| DCMI Type(s): | Sound |
| Sample Type: | 1-channel pcm |
| Sample Rate: | 16000 |
| Data Source(s): | microphone speech |
| Application(s): | speech recognition |
| Language(s): | English |
| Language ID(s): | eng |
| License(s): |
LDC User Agreement for Non-Members |
| Online Documentation: | LDC96S32 Documents |
| Licensing Instructions: | Subscription & Standard Members, and Non-Members |
| Citation: | Garofolo, John S., et al. FFMTIMIT LDC96S32. Web Download. Philadelphia: Linguistic Data Consortium, 1996. |
| Related Works: | View |
FFMTIMIT contains the previously unreleased secondary microphone waveforms for TIMIT Acoustic-Phonetic Continuous Speech. The primary microphone waveforms, which were recorded using a close-talking noise-cancelling head-mounted Sennheiser microphone (model HMD-414), are available from LDC on NIST Speech Disc 1-1.1 (LDC93S1). The secondary microphone used in the recording of the TIMIT corpus was a Breul & Kjaer (B&K) 1/2" free-field microphone (model 4165).
While the Sennheiser microphone recordings are relatively "clean" with respect to non-speech noise, the FFMTIMIT recordings include significant low frequency noise, which was due to the HVAC system and mechanical vibration transmitted through the floor of the double-walled sound booth used in recording. Because it is noiser than its TIMIT counterpart, the data of FFMTIMIT may be used in the development of more noise-robust speech recognition systems. In addition, this data may be of value to researchers involved in vocal tract modeling because the B&K microphone has extremely flat free-field frequency response and calibration tones are provided.
Note that the B&K TIMIT data contained with this release has not been processed through any highpass filter, (e.g., the 1,581-point filter described in the paper "The DARPA Speech Recognition Research Database" by Fisher, Doddington and Goudie-Marshall in "DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CD-ROM," NISTIR 4930 / NTIS Order No. PB93- 173938.)