Yes/No Corpus Release 1.2 Center for Spoken Language Understanding UPDATED: 23 August 2002 Overview -------- The Yes/No corpus is a collection of 19528 examples of people saying "yes" or "no". This corpus will be extremely useful for the development of high accuracy yes/no speech recognizers. Distribution Directory Structure -------------------------------- This is the distribution for Release 1.2 of the Yes/No Corpus. This corpus is distributed by the Center for Spoken Language Understanding of the Oregon Health & Science University. Following is a description of the directory structure in this release: readme.txt This file. docs/ The documentation directory. This directory contains further documentation for the yesno corpus. labels/ Phonetic labeling directory. This directory contains time aligned phoneme-level transcriptions (automatic forced alignment). misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are several hundred numbered subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains the orthographic transcription of each of the files. This corpus requires approximately 578MB of disk space. Please see the /docs directory for further documentation. Contact Information ------------------- Further information about this corpus can be found our web site: . Refer specific questions to: - Alena Tkacova - Linguistic Data Services Manager - Center for Spoken Language Understanding - Oregon Health & Science University - email : alca@asp.ogi.edu - Phone : 503 748-1600 - FAX : 503 748-7038 - Address : 20000 NW Walker Road Beaverton, OR 97006 USA Constructive feedback about this corpus is appreciated.