ATIS0 Pilot
Item Name: | ATIS0 Pilot |
Author(s): | Charles T. Hemphill, John J. Godfrey, George R. Doddington, John S. Garofolo, Jonathan G. Fiscus, Nancy Dahlgren, William Fisher, Brett Tjaden, David Pallett |
LDC Catalog No.: | LDC93S4B |
ISBN: | 1-58563-002-0 |
ISLRN: | 477-521-980-972-9 |
DOI: | https://doi.org/10.35111/4t8c-r397 |
Member Year(s): | 1993 |
DCMI Type(s): | Sound |
Sample Type: | 1-channel pcm |
Sample Rate: | 16000 |
Data Source(s): | microphone speech |
Project(s): | ATIS |
Application(s): | speech recognition, spoken dialogue systems |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC93S4B Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Hemphill, Charles T., et al. ATIS0 Pilot LDC93S4B. Web Download. Philadelphia: Linguistic Data Consortium, 1993. |
Related Works: | View |
Introduction
ATIS0 Pilot is comprised of about four hours of spontaneous speech, read speech and other material from participants in the ATIS collection. Other corpora in the collection are: ATIS0 Read (LDC93S4B-2) and ATIS0 SD-Read (LDC93S4B-3). ATIS0 Complete (LDC93S4A) contains all three corpora.
The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International.
ATIS0 Pilot contains spontaneous utterances elicited in a "Wizard-of-Oz" simulation, along with a relational database containing travel information (excluding connecting flights). In that data set, 36 speakers (24 male, 12 female) produced a total of 912 utterances.
Data
ATIS speech data was recorded at 16kHz sample rate, 16-bit quantization, from two microphones: a close-talking (Sennheiser HMD414) and a desk-top (Crown PCC-160) model. Utterances were transcribed.
Samples
Please view this audio sample (wav) and transcript sample (txt).
Updates
None at this time.