Home › Language Resources › Data

ATIS0 Read

Item Name:	ATIS0 Read
Author(s):	Charles T. Hemphill, John J. Godfrey, George R. Doddington, John S. Garofolo, Jonathan G. Fiscus, Nancy Dahlgren, William Fisher, Brett Tjaden, David Pallett
LDC Catalog No.:	LDC93S4B-2
ISBN:	1-58563-003-9
ISLRN:	470-709-845-333-7
DOI:	https://doi.org/10.35111/n7da-xk25
Member Year(s):	1993
DCMI Type(s):	Sound
Sample Type:	1-channel pcm
Sample Rate:	16000
Data Source(s):	microphone speech
Project(s):	ATIS
Application(s):	speech recognition, spoken dialogue systems
Language(s):	English
Language ID(s):	eng
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC93S4B-2 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Hemphill, Charles T., et al. ATIS0 Read LDC93S4B-2. Web Download. Philadelphia: Linguistic Data Consortium, 1993.
Related Works: Hide	View isPartOf LDC93S4A ATIS0 Complete isPartWith LDC93S4B ATIS0 Pilot LDC93S4B-3 ATIS0 SD Read isContinuationOf LDC93S4B ATIS0 Pilot hasContinuation LDC93S4B-3 ATIS0 SD Read LDC93S5 ATIS2 LDC94S19 ATIS3 Training Data LDC95S26 ATIS3 Test Data relatesTo LDC2019T04 Multilingual ATIS

Introduction

ATIS0 Read is comprised of about five hours of spontaneous speech, read speech and other material from participants in the ATIS collection. Other corpora in the collection are: ATIS0 Pilot (LDC93S4B) and ATIS0 SD-Read (LDC93S4B-3). ATIS0 Complete (LDC93S4A) contains all three corpora.

The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International.

ATIS0 Read contains "read" versions of the spontaneous utterances for 20 of the 36 speakers above (11 male, 9 female), for a total of 478 productions. This is supplemented by a set of 40 "adaptation" sentences read by each of the 20 speakers.

Data

ATIS speech data was recorded at 16kHz sample rate, 16-bit quantization, from two microphones: a close-talking (Sennheiser HMD414) and a desk-top (Crown PCC-160) model. Utterances were transcribed.

ATIS0 Read

Introduction

Data

Samples

Updates

Copyright

Available Media

View Fees