Portland Cellular Corpus Release 1.3 Center for Spoken Language Understanding UPDATED: 23 August 2002 Directory Structure ------------------- This document describes the directory structure of this release. Following is a written description of the directory structure in this release: readme.txt General information regarding the corpus. docs/ The documentation directory. This directory contains further documentation for the Portland Cellular corpus. labels/ Phonetic labeling directory. This directory contains phonetic labels and phonetic transcriptions for corresponding speech files. misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are many labeled subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains orthographic transcriptions for most of the speech files. This corpus requires approximately 569MB of disk space. Visually, the directory structure looks something like this: portcell | -------------------------------------------------------- | | | | | | readme.txt /docs /labels /misc /speech /trans The /speech directory contains the speech data. The files Are divided into sub-directories based on the speaker's ID number.