Spelled and Spoken Words Corpus Release 1.2 Center for Spoken Language Understanding UPDATED: 23 August 2002 Use of this corpus is permitted only under the conditions of the signed license agreement. Use or redistribution of this corpus outside the agreement is prohibited by law. Overview -------- The Spelled and Spoken Words corpus consists of spelled and spoken words. 3647 callers were prompted to to say and spell their first and last names, to say what city they grew up in and what city they were calling from, and to answer two yes/no questions. In order to collect sufficient instances of each letter, 1371 callers also recited the English alphabet with pauses between the letters. Each call was transcribed by two people, and all differences were resolved. In addition, a subset of 2648 calls has been phonetically labeled. Distribution Directory Structure -------------------------------- This is the distribution for Release 1.1 of the Spelled and Spoken Words Corpus. This corpus is distributed by the Center for Spoken Language Understanding of the Oregon Graduate Institute. Following is a description of the directory structure in this release: readme.txt General information regarding the corpus. docs/ The documentation directory. This directory contains further documentation for the Spelled and Spoken Words Corpus. labels/ Phonetic labeling directory. This directory contains phonetic labels for the speech files. misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are many subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains non-time-aligned word level transcriptions for each of the speech files. This corpus requires approximately 2.0GB of disk space. Please see the /docs directory for further documentation. Contact Information ------------------- Further information about this corpus can be found our web site: . Refer specific questions to: - Alena Tkacova - Linguistic Data Services Manager - Center for Spoken Language Understanding - Oregon Health & Science University - email : alca@asp.ogi.edu - Phone : 503 748-1600 - FAX : 503 748-7038 - Address : 20000 NW Walker Road Beaverton, OR 97006 USA Constructive feedback about this corpus is appreciated.