Kids' Speech Corpus Release Version 1.1 Center for Spoken Language Understanding UPDATED: 29 March 2002 Use of this corpus is permitted only under the conditions of the signed license agreement. Use or redistribution of this corpus outside the agreement is prohibited by law. Overview -------- The OGI Kids' Speech corpus is composed of both prompted and spontaneous speech from 1100 children from kindergarten through grade 10. This release version contains 1017 files of spontaneous speech. Corresponding word-level transcriptions are also included. Distribution Directory Structure -------------------------------- This is the release distribution of the Kids' Speech Corpus. This corpus is distributed by the Center for Spoken Language Understanding of the OGI shool at OHSU Following is a description of the directory structure in this release: readme.txt This file. docs/ The documentation directory. This directory contains a Postrscript file of the paper that describes the Kids' Speech corpus and pdf file of Labeling Guide. speech/ The speech directory contains the actual .wav files. There are many further subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains the word-level transcription of each of the speech files. labels/ This directory is empty. Corpus does not contain labels verify/ The verify directory contains verification files for scripted speech with mark for quality of wave files. misc/ This directory is empty. This corpus requires approximately 12GB of disk space. Please see the /docs directory for further documentation. Contact Information ------------------- Further information about this corpus can be found our web site: . Refer specific questions to: - Center for Spoken Language Understanding - Oregon Health & Science University - email : corpora@cslu.ogi.edu - Address : 20000 NW Walker Road Beaverton, OR 97006 USA Constructive feedback about this corpus is appreciated.