SR4X Corpus
                            Release 1.2

              Center for Spoken Language Understanding


UPDATED: 23 August 2002


Directory Structure
-------------------
This document describes the directory structure of this release. 

Following is a written description of the directory structure in 
this release:

  readme.txt	General information regarding the corpus.

  docs/		The documentation directory. This directory
		contains further documentation for the SR4X
		corpus.

  labels/	Phonetic labeling directory. This directory
                contains time aligned phoneme-level
                transcriptions (automatic forced alignment).

  misc/		Miscellaneous directory, possibly containing
		software tools and scripts.

  speech/	The speech directory contains the actual 
		.wav files. There are several numbered
		subdirectories within the speech directory.

  trans/	The transcriptions directory. This directory
		contains the automatically generated word-level
		transcription of each of the speech files.

This corpus requires approximately 483MB of disk space.

Visually, the directory structure looks something like this:

				sr4x
				  |
	--------------------------------------------------------
	|          |          |          |          |          |
    README.txt   /docs     /labels     /misc     /speech    /trans

The /speech directory contains the speech data.  The files are
divided into sub-directories based on the speaker's ID number.

The /trans directory contains the automatically generated word-level
transcriptions of each of the files.  As with the speech files, the
transcription files are divided into sub-directories based on the speaker's 
ID number. (A file called trans.txt, containing all of the transcriptions, is 
located in the /docs directory. The /docs directory also includes four
files containing groups of transcriptions divided into four categories.)

Each transcription file looks like:

SD-1234-supernova-t1-71

The first field is the prefix indicating the corpus to which this data
belongs. The second field represents a unique ID number for the speaker.
The third field contains the word spoken for this utterance. The fourth
field indicates the channel used for this utterance. The final field is
a serial number assigned during the course of the call.

The four channels used for the fourth field are:

 1 - office phone
 2 - home phone
 3 - carbon microphone telephone
 4 - speaker phone (through speaker)