CSLU: Numbers Version 1.3

Item Name: CSLU: Numbers Version 1.3
Author(s): Ronald Cole, M Noel, T Lander, T Durham
LDC Catalog No.: LDC2009S01
ISBN: 1-58563-501-4
ISLRN: 144-817-035-468-1
Release Date: January 16, 2009
Member Year(s): 2009
DCMI Type(s): Sound
Sample Type: Signed 16 bit PCM,1 Channel
Sample Rate: 8000
Data Source(s): telephone speech
Application(s): speech recognition
Language(s): English
Language ID(s): eng
License(s): CSLU Agreement
Online Documentation: LDC2009S01 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Cole, Ronald, et al. CSLU: Numbers Version 1.3 LDC2009S01. Web Download. Philadelphia: Linguistic Data Consortium, 2009.

Introduction:

CSLU: Numbers Version 1.3, Linguistic Data Consortium (LDC) catalog number LDC2009S01 and isbn 1-58563-501-4, was created by the Center for Spoken Language Understanding (CSLU) at OGI School of Science and Engineering, Oregon Health and Science University, Beaverton, Oregon. It is a collection of naturally produced numbers taken from utterances in various CSLU telephone speech data collections. The corpus consists of approximately fifteen hours of speech and includes isolated digit strings, continuous digit strings, and ordinal/cardinal numbers.

The numbers have several sources, among them, phone numbers, numbers from street addresses and zip codes, uttered by 12618 speakers in a total of 23902 files. In most of CSLU's telephone data collections, callers were asked for their phone number, birthdate or zip code. Callers would also occasionally leave numbers in the midst of another utterance. The numbers in those situations were extracted from the host utterance and added to the corpus.

Additional information about this publication is available from the corpus web page at CSLU.

Data:

The speech data was collected over analog and digital telephone lines. The analog data was recorded using a Gradient Technologies analog-to-digital conversion box; those files were recorded as 16-bit, 8 khz and stored in a linear format. The digital data was recorded with the CSLU T1 digital data collection system; those files were sampled at 8khz, 8-bit and stored as ulaw files. All of the data in this release has been linearly encoded in 16-bit RIFF standard file format.

Each file includes an orthographic transcription following the CSLU Labeling guidelines which are included in the documentation for this publication. Also, many of the utterances have been phonetically labeled.

Statistics:

CSLU: Numbers Version 1.3 consists of approximately fifteen hours of speech. The following table gives a count of the number of files for each utterance type.
Type Number
phone 2970
street 7079
zipcode 7076
other 6771

Samples:

For an example of the data contained in this corpus, please examine the audio files and labels for the following spoken sequences

  • Street Address: one sixteen wav|label
  • Zipcode: one oh three one four wav|label

Available Media

View Fees





Login for the applicable fee