Voice of America (VOA) Czech Broadcast News Audio

Item Name: Voice of America (VOA) Czech Broadcast News Audio
Author(s): David Graff
LDC Catalog No.: LDC2000S89
ISBN: 1-58563-179-5
ISLRN: 748-783-667-076-9
Member Year(s): 2000
DCMI Type(s): Sound
Data Source(s): broadcast news
Application(s): speech recognition
Language(s): Czech
Language ID(s): ces
Online Documentation: LDC2000S89 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Graff, David. Voice of America (VOA) Czech Broadcast News Audio LDC2000S89. DVD. Philadelphia: Linguistic Data Consortium, 2000.

Introduction

Voice of America (VOA) Czech Broadcast News Audio was developed by the Linguistic Data Consortium (LDC). Corresponding transcripts are contained in Voice of America (VOA) Czech Broadcast News Transcripts (LDC2000T53), the documentation for which is included with this release.

Data

Between February 9 and May 28, 1999, LDC collected approximately 30 hours of Czech broadcast audio from the Voice of America news service. The 62 data files presented in this corpus represent the audio of the daily broadcasts of 30-minute news programs.

Due to technical limitations in the hardware at LDC that was used to receive the VOA broadcasts via a satellite downlink, a number of files contain brief portions where the audio signal was interrupted. These interruptions typically yielded regions of complete silence that lasted less than two seconds and were scattered sparsely throughout an affected audio file. Additional markup was provided in the transcription texts to isolate the regions where these interruptions occurred.

The 62 audio files in this corpus are single-channel, 16 KHz, 16-bit linear SPHERE files.

Samples

For an example of the data in this corpus, please review this audio sample.

Updates

There are no updates at this time.

Available Media

View Fees





Login for the applicable fee