Yes/No Corpus
                            Release 1.2

              Center for Spoken Language Understanding


UPDATED: 23 August 2002


Directory Structure
-------------------
This document describes the directory structure of this release. 

Following is a written description of the directory structure in 
this release:

  readme.txt	General information regarding the corpus.

  docs/		The documentation directory. This directory
		contains further documentation for the yesno
		corpus.

  labels/	Phonetic labeling directory. This directory
                contains time aligned phoneme-level
                transcriptions (automatic forced alignment).

  misc/		Miscellaneous directory, possibly containing
		software tools and scripts.

  speech/	The speech directory contains the actual 
		.wav files. There are several hundred numbered
		subdirectories within the speech directory.

  trans/	The transcriptions directory. This directory
		contains the orthographic transcription of each
		of the files.

This corpus requires approximately 578MB of disk space.

Visually, the directory structure looks something like this:

				yesno
				  |
	--------------------------------------------------------
	|          |          |          |          |          |
    README.txt   /docs     /labels     /misc     /speech    /trans

The /speech directory contains the speech data.  The files are
divided into sub-directories based on their call number.  Files with
call number 0-99 are in sub-directory "0", files with call number 100-199
are in sub-directory "1", etc.

The /trans directory contains the orthographic transcription of each
of the files.  As with the speech files, the transcription files are
divided into sub-directories based on their call number.  Files with
call number 0-99 are in sub-directory "0", files with call number 100-199
are in sub-directory "1", etc. (A file called trans.txt, containing all
of the transcriptions, is located in the /docs directory.)

Each transcription file looks like:

1,ever_married yes

The first field is a call number call type pair.  This pair uniquely
defines every transcription.  After a single space, the rest of the
characters on the line are the transcription for that file.

Set Description
---------------
The call number for each file is used to determine which set it is in.
We use the call number modulo 5 to determine which set the file
belongs to as indicated by the following table:

	Mod 5 result	set
	-------------------
	0,1,2		training
	3		development test
	4		final test

For example, "YN-1.ever_married.wav" would belong to the training set
since 1%5 = 1.