Yes/No Corpus
                            Release 1.2

              Center for Spoken Language Understanding


UPDATED: 23 August 2002


Overview
--------
The Yes/No corpus is a collection of 19528 examples of
people saying "yes" or "no".  This corpus will be
extremely useful for the development of high accuracy 
yes/no speech recognizers.

Distribution Directory Structure
--------------------------------
This is the distribution for Release 1.2 of the Yes/No Corpus.  This
corpus is distributed by the Center for Spoken Language Understanding
of the Oregon Health & Science University.  Following is a description of the
directory structure in this release:

  readme.txt	This file.

  docs/		The documentation directory. This directory
		contains further documentation for the yesno
		corpus.

  labels/	Phonetic labeling directory. This directory
                contains time aligned phoneme-level
                transcriptions (automatic forced alignment).

  misc/		Miscellaneous directory, possibly containing
		software tools and scripts.

  speech/	The speech directory contains the actual 
		.wav files. There are several hundred numbered
		subdirectories within the speech directory.

  trans/	The transcriptions directory. This directory
		contains the orthographic transcription of each
		of the files.

This corpus requires approximately 578MB of disk space. Please see
the /docs directory for further documentation.

Contact Information
-------------------
Further information about this corpus can be found our web site:
<http://www.cslu.ogi.edu>.

Refer specific questions to:

- Alena Tkacova
- Linguistic Data Services Manager
- Center for Spoken Language Understanding
- Oregon Health & Science University
- email   : alca@asp.ogi.edu
- Phone   : 503 748-1600    
- FAX     : 503 748-7038
- Address : 20000 NW Walker Road
            Beaverton, OR 97006 USA

Constructive feedback about this corpus is appreciated.