ATIS2

Item Name: ATIS2
Author(s): John S. Garofolo, Jonathan G. Fiscus, Kate Hunicke-Smith, Denise Danielson, Elizabeth Shriberg, Enrico Bocchieri, Bruce Buntschuh, Beverly Schwartz, Sandra Peters, Robert Ingria, Robert Weide, Yuzong Chang, Eric Thayer, Lynette Hirschman, Joe Polifroni, Bruce Lund, Goh Kawai, Tom Kuhn, Lew Norton, Deborah Dahl, Madeleine Bates, Michael Brown, Alexander Rudnicky, David Pallett
LDC Catalog No.: LDC93S5
ISBN: 1-58563-005-5
ISLRN: 387-394-427-128-0
Member Year(s): 1993
DCMI Type(s): Sound
Sample Type: 1-channel pcm compressed
Sample Rate: 16000
Data Source(s): microphone speech
Project(s): ATIS
Application(s): spoken dialogue systems, speech recognition
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC93S5 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Garofolo, John S., et al. ATIS2 LDC93S5. Web Download. Philadelphia: Linguistic Data Consortium, 1993.

Introduction

ATIS2 contains approximately 15,000 utterances from 450 participants in the ATIS (Air Travel Information Services) collection. The ATIS collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems.

Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International.

Data

All utterances were recorded and transcribed and approximately 10,000 were annotated with categorizations and canonical reference answers.

For ATIS2, the ATIS0 ten-city relational database was revised to accommodate connecting flights and fares and some table headings were renamed.

This release includes training data and the February 1992 and November 1992 ATIS benchmark tests. Each set contains approximately 1,000 utterances from the pool of data collected by the five sites.

Audio Sample

Update

This publication has been condensed from four CD-ROM discs to a single web download.

Available Media

View Fees





Login for the applicable fee