Noisy TIMIT Speech

Item Name: Noisy TIMIT Speech
Author(s): Azhar Abdulaziz, Veton Kepuska
LDC Catalog No.: LDC2017S04
ISBN: 1-58563-793-9
ISLRN: 107-834-092-668-3
DOI: https://doi.org/10.35111/m440-jj35
Release Date: March 17, 2017
Member Year(s): 2017
DCMI Type(s): Sound
Sample Type: flac
Sample Rate: 16000
Data Source(s): microphone speech
Application(s): speech recognition
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2017S04 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Abdulaziz, Azhar, and Veton Kepuska. Noisy TIMIT Speech LDC2017S04. Web Download. Philadelphia: Linguistic Data Consortium, 2017.
Related Works: View

Introduction

Noisy TIMIT Speech was developed by the Florida Institute of Technology and contains approximately 322 hours of speech from the TIMIT Acoustic-Phonetic Continuous Speech Corpus (LDC93S1) modified with different additive noise levels. Only the audio has been modified; the original arrangement of the TIMIT corpus is still as described by the TIMIT documentation.

Data

The additive noise are white, pink, blue, red, violet and babble noise with noise levels varying in 5 dB (decibel) steps and ranges from 5 to 50 dB.

The color of noise refers to the power spectrum of a noise signal. Sound waves have two characteristics: frequency, which describes how fast the waveform vibrates per second; and amplitude, the size of the waveform. Colored noises are named in an analogy to the colors of light.  For instance, white noise contains all audible frequencies just as white light contains all frequencies in the visible range. Non-white colored noises have more energy concentrated at the high or low end of the sound spectrum. White, pink and blue noise are officially defined in the federal telecommunications standard.

The white, pink, blue, red and violet noise types added to the TIMIT data in this release were generated artificially using MATLAB. For the babble noise, a random segment of recorded babble speech was selected and scaled relative to the power of the original TIMIT audio signal.

All audio files are presented as single channel 16kHz 16-flac.

Samples

Please listen to the following samples:

Updates

None at this time.

Related Works incorporating TIMIT

TIMIT was designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. Since its release in 1993, several corpora have been developed using the TIMIT database:

NTIMIT (LDC93S2): transmitting TIMIT recordings through a telephone handset and over various channels in the NYNEX telephone network

CTIMIT (LDC96S30): passing TIMIT files through cellular telephone circuits

FFMTIMIT (LDC96S32): re-recording TIMIT files with a free-field microphone

HTIMIT (LDC98S67): re-recording a subset of TIMIT files throgh different telephone handsets

STC-TIMIT (LDC2008S03): passing TIMIT files through an actual telephone channel in a single call

WTIMIT 1.0 (LDC2010S02): wideband mobile telephony TIMIT version

Available Media

View Fees





Login for the applicable fee