ATIS2

Item Name: ATIS2
Author(s): John S. Garofolo, Jonathan G. Fiscus, Kate Hunicke-Smith, Denise Danielson, Elizabeth Shriberg, Enrico Bocchieri, Bruce Buntschuh, Beverly Schwartz, Sandra Peters, Robert Ingria, Robert Weide, Yuzong Chang, Eric Thayer, Lynette Hirschman, Joe Polifroni, Bruce Lund, Goh Kawai, Tom Kuhn, Lew Norton, Deborah Dahl, Madeleine Bates, Michael Brown, Alexander Rudnicky, David Pallett
LDC Catalog No.: LDC93S5
ISBN: 1-58563-005-5
ISLRN: 387-394-427-128-0
DOI: https://doi.org/10.35111/2ktf-2q33
Member Year(s): 1993
DCMI Type(s): Sound
Sample Type: 1-channel pcm compressed
Sample Rate: 16000
Data Source(s): microphone speech
Project(s): ATIS
Application(s): speech recognition, spoken dialogue systems
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC93S5 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Garofolo, John S., et al. ATIS2 LDC93S5. Web Download. Philadelphia: Linguistic Data Consortium, 1993.
Related Works: View

Introduction

ATIS2 contains approximately 34 hours of audio, including 15,000 utterances from 450 participants in the ATIS (Air Travel Information Services) collection. About half of the participants were male and half female. The ATIS collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems.

Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International.

Data

Audio files are presented as single channel, 16-bit, 16 kHz sphere files. All utterances were recorded and transcribed and approximately 10,000 were annotated with categorizations and canonical reference answers.

For ATIS2, the ATIS0 ten-city relational database was revised to accommodate connecting flights and fares and some table headings were renamed.

This release includes training data and the February 1992 and November 1992 ATIS benchmark tests. Each set contains approximately 1,000 utterances from the pool of data collected by the five sites.

Samples

Audio Sample

Updates

None at this time.

 

Available Media

View Fees





Login for the applicable fee