1997 HUB4 Broadcast News Evaluation Non-English Test Material
Item Name: | 1997 HUB4 Broadcast News Evaluation Non-English Test Material |
Author(s): | Jonathan G. Fiscus, John S. Garofolo, Mark Przybocki, William Fisher, David Pallett |
LDC Catalog No.: | LDC2001S91 |
ISBN: | 1-58563-182-5 |
ISLRN: | 639-420-515-411-0 |
DOI: | https://doi.org/10.35111/4b0m-6182 |
Member Year(s): | 2001 |
DCMI Type(s): | Sound |
Data Source(s): | broadcast news |
Project(s): | EARS, GALE, Hub4 |
Application(s): | speech recognition |
Language(s): | Spanish, Mandarin Chinese |
Language ID(s): | spa, cmn |
Online Documentation: | LDC2001S91 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Fiscus, Jonathan G., et al. 1997 HUB4 Broadcast News Evaluation Non-English Test Material LDC2001S91. Web Download. Philadelphia: Linguistic Data Consortium, 2001. |
Related Works: | View |
Introduction
1997 HUB4 Broadcast News Evaulation Non-English Test Material was developled by the Linguistic Data Consortium. It contains the evaluation test material used in the 1997 DARPA/NIST Continuous Speech Recognition Broadcast News HUB4 Non-English Benchmark Test administered by the NIST Spoken Natural Language Processing Group.
Data
The test material is contained in two SPHERE-formatted waveform files. The file h4ne97sp.sph (set1) contains one hour of Spanish broadcast news excerpts from 1997. The file h4ne97ma.sph (set2) contains one hour of Mandarin broadcast news excerpts from 1997. Each file should be separately recognized per the HUB4 Non English Evaluation Specification.
Note: 1997 HUB4 English evaluation material is contained in 1997 HUB4 English Evaluation Speech and Transcripts LDC2002S11.
Updates
There are no updates at this time.
Additional Licensing Instructions
This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.