National Cellular Corpus Release 2.3 Center for Spoken Language Understanding Release Version 2.3 (22 September 2002) --------------------------------------- We have added more orthographical transcriptions and created the phoneme time aligned transcriptions. Release Version 2.2 (28 January 2002) ------------------------------------- this release includes all speech files (2337 speakers) from which 1996 speakers is transcriptef on world level. Release Version 2.1 (22 August 2000) ------------------------------------ Several changes have been implemented that differentiate this version of the National Cellular Corpus from version 2.0. These changes include the following: * Speech files in the /speech directory have been converted from NIST format to RIFF format. * Individual transcription files have been extracted from the data.txt file. These individual files have been placed in /trans directory which has a structure that exactly parallels that of the /speech directory. * The documentation has been updated so that it accurately reflects the corpus contents. These and other changes have been made to make the corpus more useful to the end-user. Release Version 2.0 ------------------- Second release. Contains speech from 1350 (60% male, 40% female) speakers in nine cities. Release Version 1.0 ------------------- First release. Contains speech from 676 (53% male, 47% female) speakers in four cities.