1998 HUB4 Broadcast News Evaluation English Test Material
Item Name: | 1998 HUB4 Broadcast News Evaluation English Test Material |
Author(s): | Linguistic Data Consortium |
LDC Catalog No.: | LDC2000S86 |
ISBN: | 1-58563-172-8 |
ISLRN: | 786-335-176-662-7 |
DOI: | https://doi.org/10.35111/j4qt-7y88 |
Member Year(s): | 2000 |
DCMI Type(s): | Sound |
Data Source(s): | broadcast news |
Project(s): | Hub4 |
Application(s): | speech recognition |
Language(s): | English |
Language ID(s): | eng |
License(s): |
USC Marketplace Agreement |
Online Documentation: | LDC2000S86 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Linguistic Data Consortium. 1998 HUB4 Broadcast News Evaluation English Test Material LDC2000S86. Web Download. Philadelphia: Linguistic Data Consortium, 2000. |
Related Works: | View |
Introduction
This publication contains the evaluation test material used in the 1998 DARPA/NIST Continuous Speech Recognition Broadcast News HUB4 English Benchmark Test administered by the NIST Spoken Natural Language Processing Group and produced by the Linguistic Data Consortium (LDC), catalog number LDC2000S86, ISBN 1-58563-172-8.
Data
The test material is contained in two SPHERE-formatted waveform files. The file h4e_98_1.sph (set1) contains 1.5 hours of Broadcast News excerpts from 1996. The file h4e_98_2.sph (set2) contains 1.5 hours of Broadcast News excerpts from 1998. Each file should be separately recognized per the HUB4 English Evaluation Specification.
Additional Licensing Instructions
This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.