1998 HUB4 Broadcast News Evaluation English Test Material


Item Name: 1998 HUB4 Broadcast News Evaluation English Test Material
Authors: .
LDC Catalog No.: LDC2000S86
ISBN: 1-58563-172-8
Data Type: speech
Data Source(s): broadcast news
Project(s): Hub4
Application(s): speech recognition
Language(s): English
Language ID(s): eng
Distribution: 1 CD
Member fee: $0 for 2000 members
Non-member Fee: N/A (Members Only)
Reduced-License Fee: N/A
Extra-Copy Fee: US $150.00
Member License: yes
Online documentation: yes
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: .
2000
1998 HUB4 Broadcast News Evaluation English Test Material
Linguistic Data Consortium, Philadelphia

Introduction

This publication contains the evaluation test material used in the 1998 DARPA/NIST Continuous Speech Recognition Broadcast News HUB4 English Benchmark Test administered by the NIST Spoken Natural Language Processing Group and produced by the Linguistic Data Consortium (LDC), catalog number LDC2000S86, ISBN 1-58563-172-8.

Data

The test material is contained in two SPHERE-formatted waveform files. The file h4e_98_1.sph (set1) contains 1.5 hours of Broadcast News excerpts from 1996. The file h4e_98_2.sph (set2) contains 1.5 hours of Broadcast News excerpts from 1998. Each file should be separately recognized per the HUB4 English Evaluation Specification.