Home › Language Resources › Data

FFMTIMIT

Item Name:	FFMTIMIT
Author(s):	John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, Victor Zue
LDC Catalog No.:	LDC96S32
ISBN:	1-58563-090-X
ISLRN:	671-675-113-200-7
DOI:	https://doi.org/10.35111/ftsx-tk27
Member Year(s):	1996
DCMI Type(s):	Sound
Sample Type:	1-channel pcm
Sample Rate:	16000
Data Source(s):	microphone speech
Application(s):	speech recognition
Language(s):	English
Language ID(s):	eng
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC96S32 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Garofolo, John S., et al. FFMTIMIT LDC96S32. Web Download. Philadelphia: Linguistic Data Consortium, 1996.
Related Works: Hide	View isVersionOf LDC93S1 TIMIT Acoustic-Phonetic Continuous Speech Corpus isSimilarWith LDC93S2 NTIMIT LDC96S30 CTIMIT LDC98S67 HTIMIT LDC2008S03 STC-TIMIT 1.0 LDC2010S02 WTIMIT 1.0 LDC2017S04 Noisy TIMIT Speech

FFMTIMIT contains the previously unreleased secondary microphone waveforms for TIMIT Acoustic-Phonetic Continuous Speech. The primary microphone waveforms, which were recorded using a close-talking noise-cancelling head-mounted Sennheiser microphone (model HMD-414), are available from LDC on NIST Speech Disc 1-1.1 (LDC93S1). The secondary microphone used in the recording of the TIMIT corpus was a Breul & Kjaer (B&K) 1/2" free-field microphone (model 4165).

While the Sennheiser microphone recordings are relatively "clean" with respect to non-speech noise, the FFMTIMIT recordings include significant low frequency noise, which was due to the HVAC system and mechanical vibration transmitted through the floor of the double-walled sound booth used in recording. Because it is noiser than its TIMIT counterpart, the data of FFMTIMIT may be used in the development of more noise-robust speech recognition systems. In addition, this data may be of value to researchers involved in vocal tract modeling because the B&K microphone has extremely flat free-field frequency response and calibration tones are provided.

Note that the B&K TIMIT data contained with this release has not been processed through any highpass filter, (e.g., the 1,581-point filter described in the paper "The DARPA Speech Recognition Research Database" by Fisher, Doddington and Goudie-Marshall in "DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CD-ROM," NISTIR 4930 / NTIS Order No. PB93- 173938.)

FFMTIMIT

Available Media

View Fees