SR4X Corpus Release 1.2 Center for Spoken Language Understanding UPDATED: 23 August 2002 This document describes the file naming conventions used for this distribution and gives a brief description of the various file formats used. File Naming Convention ---------------------- Each filename in the /speech and /trans /labels directories encodes information about the call number, utterance type, and file type. A typical filename will look like: SD-1234-supernova-t1-71.wav The first field is the prefix indicating the corpus to which this data belongs. The second field represents a unique ID number for the speaker. The third field contains the word spoken for this utterance. The fourth field indicates the channel used for this utterance. The final field is a serial number assigned during the course of the call. The four channels used for the fourth field are: 1 - office phone 2 - home phone 3 - carbon microphone telephone 4 - speaker phone (through speaker) The following utterance types are in this corpus: startrek supernova tektronix generation nebula processing singularity 71523 abracadabra sungeeta computer The final, three letter, extension indicates the file type. The following types are in this distribution: wav The speech data txt The text-based transcription of the speech data phn The text-based time align phonetic labels File Formats ------------ The "wav" files contain speech data and use the RIFF standard file format. This file format is 16-bit linearly encoded. The "trans" file in the /docs directory is a list of all of the text transcriptions. Each file transcription is on a separate line. The transcriptions in this corpus are automatically generated word-level transcriptions. FOR THIS VERSION OF THE CORPUS, THE CONTENTS OF THE TRANS.TXT FILE HAVE BEEN EXTRACTED INDIVIDUALLY INTO THE /TRANS DIRECTORY. The /trans directory file structure exactly parallels the structure of the /speech directory. Each file in the /trans directory is in .txt format and contains a line, as described in the previous paragraph, that uniquely defines each corresponding sound file.