CSLU: Stories v 1.2

Item Name: CSLU: Stories v 1.2
Author(s): Yeshwant Muthusamy, Ronald Cole, Beatrice Oshika
LDC Catalog No.: LDC2006S14
ISBN: 1-58563-366-6
ISLRN: 024-464-884-415-7
Release Date: October 25, 2006
Member Year(s): 2006
DCMI Type(s): Sound
Sample Type: pcm
Sample Rate: 8000
Data Source(s): telephone speech
Language(s): English
Language ID(s): eng
License(s): CSLU Agreement
Online Documentation: LDC2006S14 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Muthusamy, Yeshwant, Ronald Cole, and Beatrice Oshika. CSLU: Stories v 1.2 LDC2006S14. Web Download. Philadelphia: Linguistic Data Consortium, 2006.


This file contains documentation on CSLU: Stories V1.2, Linguistic Data Consortium (LDC) catalog number LDC2006S14 and ISBN 1-58563-366-6.

CSLU: Stories contains extemporaneous speech collected from English speakers in the CSLU Multilanguage Telephone Speech data collection. Each speaker was asked to speak on a topic of his or her choice for one minute. Those utterances are collected in the Stories corpus.


The Stories corpus comprises:

  1. Speech files for the 702 calls
  2. Time-aligned word level transcriptions (and corresponding comment files) for approximately 322 stories
  3. Word transcriptions (not time aligned) for 702 stories
  4. Time-aligned phonetic labels for 702 stories


For an example of the data in this corpus, please listen to this audio sample.

Available Media

View Fees

Extra Copy
Login for the applicable fee