SR4X Corpus
                            Release 1.2

              Center for Spoken Language Understanding


UPDATED: 23 August 2002


This document describes the file naming conventions used for this
distribution and gives a brief description of the various file formats
used.

File Naming Convention
----------------------
Each filename in the /speech and /trans /labels directories encodes
information about the call number, utterance type, and file type.  A
typical filename will look like:

	SD-1234-supernova-t1-71.wav

The first field is the prefix indicating the corpus to which this data
belongs. The second field represents a unique ID number for the speaker.
The third field contains the word spoken for this utterance. The fourth
field indicates the channel used for this utterance. The final field is
a serial number assigned during the course of the call.

The four channels used for the fourth field are:

 1 - office phone
 2 - home phone
 3 - carbon microphone telephone
 4 - speaker phone (through speaker)

The following utterance types are in this corpus:

  startrek
  supernova
  tektronix
  generation
  nebula
  processing
  singularity
  71523
  abracadabra
  sungeeta
  computer

The final, three letter, extension indicates the file type.  The
following types are in this distribution:

  wav		The speech data
  txt		The text-based transcription of the speech data
  phn		The text-based time align phonetic labels

File Formats
------------
The "wav" files contain speech data and use the RIFF standard file format.
This file format is 16-bit linearly encoded.

The "trans" file in the /docs directory is a list of all of the text 
transcriptions.  Each file transcription is on a separate line. The
transcriptions in this corpus are automatically generated word-level
transcriptions.

FOR THIS VERSION OF THE CORPUS, THE CONTENTS OF THE TRANS.TXT 
FILE HAVE BEEN EXTRACTED INDIVIDUALLY INTO THE /TRANS DIRECTORY.
The /trans directory file structure exactly parallels the structure of the
/speech directory. Each file in the /trans directory is in .txt format
and contains a line, as described in the previous paragraph, that uniquely
defines each corresponding sound file.