Stories Corpus Release 1.2 Center for Spoken Language Understanding UPDATED: 22 September 2002 Overview -------- The Stories Corpus is made up of extemporaneous speech collected from English speakers in the CSLU Multi-language Telephone Speech data collection. Each speaker was asked to speak on a topic of their choice for one minute. These utterances are make up the Stories Corpus. The Stories Corpus comprises: 1. Speech files for the 702 calls 2. Time-aligned word level transcriptions (and corresponding comment files) for approximately 322 stories 3. Word transcriptions (not time aligned) for 702 stories 4. Time-aligned phonetic labels for 702 stories Prompt ------ The callers were responding to the following prompt: "We now want you to talk for a longer period of time. We do not care what you say as long as you keep talking. You can tell us anything about yourself, your hobbies and interests, the city that you live in, and the sports that you like. Or you can make up a story, tell a fairy-tale or recite a poem. You will have 1 minute to speak. We will now give you 10 seconds to think about what to say. Please do not read anything, we would prefer you make something up."