Articulation Index LSCP, Linguistic Data Consortium (LDC) Catalog Number LDC2015S12 and ISBN 1-58563-735-1, was developed by researchers at Laboratoire de Sciences Cognitives et Psycholinguistique (LSCP), Ecole Normale Supérieure. It revises and enhances a subset of Articulation Index (AIC) (LDC2005S22), a corpus of persons speaking English syllables. Changes include the addition of forced alignment to sound files, time alignment of syllable utterances and format conversions.
AIC consists of 20 American English speakers (12 males, 8 females) pronouncing syllables, some of which form actual words, but most of which are nonsense syllables. All possible Consonant-Vowel (CV) and Vowel-Consonant (VC) combinations were recorded for each speaker twice, once in isolation and once within a carrier-sentence, for a total of 25768 recorded syllables.
Articulation Index LSCP alters AIC in the following ways.
This release does not include certain AIC triphone recordings (CVC, CCV or VCC).
Audio data is presented as 16kHz 16-bit flac compressed .wav files. The flac compression was added for distribution, and documentation may refer to the files as .wav files.
Please see file.tbl for a complete file list as well as checksums for this publication.
Additional information, updates, bug fixes may be available in the LDC catalog entry for this corpus at LDC2015S12.
Portions © 2015 Tomas Bergvelt, Anna Kolesnikov, Xuan-Nga Cao, Thomas Schatz, Emmanuel Dupoux, © 2015 Trustees of the University of Pennsylvania