|Item Name:||CSLU: Voices|
|LDC Catalog No.:||LDC2006S01|
|Release Date:||January 19, 2006|
|Data Source(s):||microphone speech|
|Application(s):||speech recognition, speaker verification, speaker identification, speech synthesis|
|Online Documentation:||LDC2006S01 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Kain, Alexander. CSLU: Voices LDC2006S01. Web Download. Philadelphia: Linguistic Data Consortium, 2006.|
The Voices Corpus was created by Alexander Kain for his Ph.D. dissertation work on high resolution voice transformation. The corpus contains 12 speakers reading 50 phonetically rich sentences. The recording procedure involved a "mimicking" approach which resulted in a high degree of natural time-alignment between different speakers. The acoustic wave and the concurrent laryngograph signal were recorded for one "free" and two "mimicked" renditions of each sentence. Pitch marks, calculated from the laryngograph signal, and time marks, the output of a forced-alignment algorithm, have been added to the corpus.
SamplesFor an example of the data contained in this publication, please review the following samples.
- Concurrent laryngograph.
- Pitch marks derived from laryngograph signal.
- Wave file of speech.