CALLHOME Egyptian Arabic Speech Supplement


Item Name: CALLHOME Egyptian Arabic Speech Supplement
Authors: .
LDC Catalog No.: LDC2002S37
ISBN: 1-58563-243-0
Release Date: Aug 06, 2002
Data Type: speech
Sample Rate: 8000 Hz
Sampling Format: 2-channel ulaw
Data Source(s): telephone conversations
Project(s): EARS, GALE, Hub5-LVCSR
Application(s): speech recognition
Language(s): Egyptian Arabic
Language ID(s): arz
Distribution: 1 CD
Member fee: $0 for 2002 members
Non-member Fee: US $1000.00
Reduced-License Fee: US $500.00
Extra-Copy Fee: US $150.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: .
2002
CALLHOME Egyptian Arabic Speech Supplement
Linguistic Data Consortium, Philadelphia

Introduction

The CALLHOME Egyptian Arabic Speech Supplement was produced by Linguistic Data Consortium (LDC), catalog number LDC2002S37 and ISBN 1-58563-243-0.

This publication contains 20 CALLHOME Egyptian Arabic telephone conversations. The corresponding transcripts are published as CALLHOME Egyptian Arabic Transcripts Supplement, LDC catalog number LDC2002T38. These conversations had originally been held in reserve for future NIST HUB5 Non-English evaluations, but are being "re-tasked" to provide additional material for general use.

Data

There are 20 data files in sphere format. The files are 8 KHz shorten-compressed two-channel mulaw. 12 of the files were recorded from domestic phone calls (both parties living in the continental U.S.), while the other eight are overseas calls (a participant in the U.S. called a friend or relative in Egypt or some other overseas country).

There is a total of 273,681,144 bytes (261 Mbytes) or eight hours of audio data.

Updates

There are no updates at this time.

Content Copyright

Portions 1996-2002 Trustees of the University of Pennsylvania.