SR4X Corpus Release 1.2 Center for Spoken Language Understanding UPDATED: 23 August 2002 Directory Structure ------------------- This document describes the directory structure of this release. Following is a written description of the directory structure in this release: readme.txt General information regarding the corpus. docs/ The documentation directory. This directory contains further documentation for the SR4X corpus. labels/ Phonetic labeling directory. This directory contains time aligned phoneme-level transcriptions (automatic forced alignment). misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are several numbered subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains the automatically generated word-level transcription of each of the speech files. This corpus requires approximately 483MB of disk space. Visually, the directory structure looks something like this: sr4x | -------------------------------------------------------- | | | | | | README.txt /docs /labels /misc /speech /trans The /speech directory contains the speech data. The files are divided into sub-directories based on the speaker's ID number. The /trans directory contains the automatically generated word-level transcriptions of each of the files. As with the speech files, the transcription files are divided into sub-directories based on the speaker's ID number. (A file called trans.txt, containing all of the transcriptions, is located in the /docs directory. The /docs directory also includes four files containing groups of transcriptions divided into four categories.) Each transcription file looks like: SD-1234-supernova-t1-71 The first field is the prefix indicating the corpus to which this data belongs. The second field represents a unique ID number for the speaker. The third field contains the word spoken for this utterance. The fourth field indicates the channel used for this utterance. The final field is a serial number assigned during the course of the call. The four channels used for the fourth field are: 1 - office phone 2 - home phone 3 - carbon microphone telephone 4 - speaker phone (through speaker)