Arabic Broadcast News Speech ============================ This data set consists of eight audio files recorded by the Linguistic Data Consortium (LDC) from Voice of America (VOA) satellite radio news broadcasts in Arabic. The recordings were made at time of transmission between June 2000 and January 2001. This work was sponsored in part by National Science Foundation Grant No. IIS-9982201. The recordings, captured from a dedicated satellite receiver, are stored as 16-bit PCM, 16-kHz, single-channel, in NIST SPHERE format. The duration of each recording is either 60 minutes or 120 minutes, depending on the VOA broadcast schedule; the date (YYYYMMDD), start-time and end-time (HHMM EST) for each recording are indicated in the file names. The sample data are not compressed. Transcripts for these recordings are available as a separate corpus from the LDC: Arabic Broadcast News Transcripts, LDC2006T20. ldc@ldc.upenn.edu