ISOLET Corpus Release 1.3 Center for Spoken Language Understanding UPDATED: 19 August 2002 Use of this corpus is permitted only under the conditions of the signed license agreement. Use or redistribution of this corpus outside the agreement is prohibited by law. Overview -------- ISOLET is a database of letters of the English alphabet spoken in isolation. The database consists of 7800 spoken letters, two productions of each letter by 150 speakers. It contains approximately 1.25 hours of speech. The recordings were done under quiet, laboratory conditions with a noise-canceling microphone. Distribution Directory Structure -------------------------------- This is the distribution for Release 1.2 of the ISOLET Corpus. This corpus is distributed by the Center for Spoken Language Understanding of the Oregon Health & Science University. Following is a description of the directory structure in this release: readme.txt General information regarding the corpus. docs/ The documentation directory. This directory contains further documentation for the ISOLET Corpus. labels/ Phonetic labeling directory. This directory contain time align phonetic labels for this corpus. (automatic force alingment) misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are many subdirectories within the speech directory. trans/ The transcriptions directory. This directory contain transcriptions for this corpus. This corpus requires approximately 220MB of disk space. Please see the /docs directory for further documentation. Contact Information ------------------- Further information about this corpus can be found our web site: . Refer specific questions to: - Center for Spoken Language Understanding - Oregon Health & Science University - email : corpora@cslu.ogi.edu - Address : 20000 NW Walker Road Beaverton, OR 97006 USA Constructive feedback about this corpus is appreciated.