Articulation Index LSCP

Item Name: Articulation Index LSCP
Author(s): Thomas Schatz, Xuan-Nga Cao, Anna Kolesnikova, Tomas Bergvelt, Jonathan Wright, Emmanuel Dupoux
LDC Catalog No.: LDC2015S12
ISBN: 1-58563-735-1
ISLRN: 607-221-014-735-8
Release Date: November 16, 2015
Member Year(s): 2015
DCMI Type(s): Sound
Sample Type: pcm
Sample Rate: 1600
Data Source(s): microphone speech
Application(s): language identification, language modeling, parsing, pronunciation modeling
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2015S12 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Schatz, Thomas, et al. Articulation Index LSCP LDC2015S12. Web Download. Philadelphia: Linguistic Data Consortium, 2015.

Introduction

Articulation Index LSCP was developed by researchers at Laboratoire de Sciences Cognitives et Psycholinguistique (LSCP), Ecole Normale Supérieure. It revises and enhances a subset of Articulation Index (AIC) (LDC2005S22), a corpus of persons speaking English syllables. Changes include the addition of forced alignment to sound files, time alignment of syllable utterances and format conversions.

AIC consists of 20 American English speakers (12 males, 8 females) pronouncing syllables, some of which form actual words, but most of which are nonsense syllables. All possible Consonant-Vowel (CV) and Vowel-Consonant (VC) combinations were recorded for each speaker twice, once in isolation and once within a carrier-sentence, for a total of 25768 recorded syllables.

Data

Articulation Index LSCP alters AIC in the following ways.

  1. Time-alignments for the onset and offset of each word and syllable were generated through forced-alignment with a standard HMM-GMM (Hidden Markov Model-Gaussian Mixture Model) ASR system.
  2. The time-alignments for the beginning and end of the syllables (whether in isolation or within a carrier sentence) were manually adjusted. The time-alignments for the other words in carrier sentences were not manually adjusted.
  3. The recordings of isolated syllables were cut according to the manual time-alignments to remove the silent portions at the beginning and end, and the time-alignments were altered to correspond to the cut recordings.
  4. The file naming scheme was slightly altered for compatibility with the Kaldi speech recognition toolkit.
  5. AIC contains a wide-band (16 KHz, 16-bit PCM) and a narrow-band (8 KHz, 8 bit u-law) version of the recordings distributed in sphere format. The LSCP version contains the wide-band version only distributed as wave files.

This release does not include certain AIC triphone recordings (CVC, CCV or VCC).

Audio data is presented as 16kHz 16-bit flac compressed .wav files. The flac compression was added for distribution, and documentation may refer to the files as .wav files.

Samples

Please listen to this audio sample.

Updates

None at this time.

Available Media

View Fees





Login for the applicable fee