Speech in Noisy Environments (SPINE) Training Audio

Item Name: Speech in Noisy Environments (SPINE) Training Audio
Author(s): Astrid Schmidt-Nielsen, Elaine Marsh, John Tardelli, Paul Gatewood, Elizabeth Kreamer, Thomas Tremain, Christopher Cieri, Jonathan Wright
LDC Catalog No.: LDC2000S87
ISBN: 1-58563-173-6
ISLRN: 884-501-625-360-2
DOI: https://doi.org/10.35111/ka1n-0c43
Member Year(s): 2000
DCMI Type(s): Sound
Data Source(s): microphone conversation
Project(s): SPINE
Application(s): speech recognition
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2000S87 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Schmidt-Nielsen, Astrid, et al. Speech in Noisy Environments (SPINE) Training Audio LDC2000S87. Web Download. Philadelphia: Linguistic Data Consortium, 2000.
Related Works: View

Introduction

Speech in Noisy Environments (SPINE) Training Audio Corpus was developed for the Department of Defense (DoD) Digital Voice Processing Consortium (DDVPC) by Arcon Corp. Corresonding transripts, Speech in Noisy Environments (SPINE) Training Transcripts, are available as LDC2000T49. These corpora supported the 2000 Speech in Noisy Environments (SPINE1) evaluation.

The 2000 Speech in Noisy Environments Evaluation (SPINE1) was a first attempt to assess the state of the art and practice in speech recognition technology in noisy military environments and to exchange information on innovative speech recognition technology in the context of fully implemented systems that perform realistic tasks. It was intended to be of interest to all university, industrial and commercial speech system developers working on the problem of robust speech recognition. The evaluation gave participants the opportunity to participate in a flexible evaluation, suited to development needs and abilities.

The SPINE1 evaluation focused on the task of transcribing speech produced in noisy environments with emphasis on noisy military environments. The evaluation was designed to promote research progress in this area, to provide the opportunity for participants to try out new ideas for developing robust speech recognition systems that were of both scientific and practical interest, and to measure the performance of this technology.

This work was sponsored in part by National Science Foundation Grant No. IIS-9982201.

Data

The evaluation task was to transcribe speech produced in noisy environments. The training and test speech data used for this evaluation were generated by ARCON Corp. for the DoD Digital Voice Processing Consortium (DDVPC) under controlled conditions.

The speech data consists of conversations between two communicators working on a collaborative battleship-like task in which they seek and shoot at targets (ARCON Communicability Exercise, ACE). Participants could talk freely, but the total vocabulary used was fairly limited. Each person was seated in a sound chamber in which a previously recorded military background noise environment was accurately reproduced. The participants used handsets and transmission channels that were resident to the particular environment.

The training data includes 10 of twenty available talker pairs with 14 five-minute conversations per talker pair (about 720 minutes total), which includes four noise scenarios.

Samples

For an example of the data contained in this corpus, please listen to this audio sample.

Updates

There are no updates at this time.

Available Media

View Fees





Login for the applicable fee