VOICES Corpus Release Version 1.0 Center for Spoken Language Understanding UPDATED : 29 March 2002 Overview -------- The VOICES Corpus was created by Alexander Kain for his Ph.D. dissertation work on high resolution voice transformation. The corpus contains 12 speakers reading 50 phonetically rich sentences. The recording procedure involved a "mimicking" approach which resulted in a high degree of natural time-alignment between different speakers. The acoustic wave and the concurrent laryngograph signal were recorded for 1 "free" and 2 "mimicked" renditions of each sentence. Pitch marks, calculated from the laryngograph signal, and time marks, the output of a forced-alignment algorithm, have been added to the corpus. Distribution Directory Structure -------------------------------- This is the release distribution of the VOICES Corpus. This corpus is distributed by the Center for Spoken Language Understanding of the OGI shool at OHSU Following is a description of the directory structure in this release: readme.txt This file. docs/ The documentation directory. This directory contains a pdf file of the PhD thesis of Alex Kain that describes VOICES corpus and other text files for better orientation in corpus. misc/ This directory is empty. speech/ Wave files lar/ Concurrent laryngograph files pm/ Pitch marks derived from laryngograph signal pm2/ Manually verified and adjusted pitch marks (for selected voices/sentences/renditions) labels/ Phoneme labels as output by forced alignment trans/ The transcriptions directory. This directory contains non-time-aligned word-level transcriptions for each of the speech files. This corpus requires approximately 694MB of disk space. Please see the /docs directory for further documentation. Contact Information ------------------- Further information about this corpus can be found our web site: . Refer specific questions to: - Alena Tkacova - Linguistic Data Services Manager - Center for Spoken Language Understanding - Oregon Health & Science University - email : alca@asp.ogi.edu - Phone : 503 748-1600 - FAX : 503 748-7038 - Address : 20000 NW Walker Road Beaverton, OR 97006 USA Constructive feedback about this corpus is appreciated.