Home › Language Resources › Data

2004 Spring NIST Rich Transcription (RT-04S) Evaluation Data

Item Name:	2004 Spring NIST Rich Transcription (RT-04S) Evaluation Data
Author(s):	Jonathan G. Fiscus, John S. Garofolo, Audrey Le, Alvin Martin, Greg Sanders, Mark Przybocki, David Pallett
LDC Catalog No.:	LDC2007S12
ISBN:	1-58563-448-4
ISLRN:	581-401-882-415-9
DOI:	https://doi.org/10.35111/xc4g-he80
Release Date:	October 17, 2007
Member Year(s):	2007
DCMI Type(s):	Sound
Data Source(s):	meeting speech, microphone conversation
Project(s):	RT
Application(s):	diarization, discourse analysis, metadata extraction, speaker identification, speaker verification, speech recognition
Language(s):	English
Language ID(s):	eng
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC2007S12 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Fiscus, Jonathan G., et al. 2004 Spring NIST Rich Transcription (RT-04S) Evaluation Data LDC2007S12. Web Download. Philadelphia: Linguistic Data Consortium, 2007.
Related Works: Hide	View hasAnnotation LDC2019S09 First DIHARD Challenge Development - Eight Sources LDC2021S10 Second DIHARD Challenge Development - Eleven Sources LDC2022S12 Third DIHARD Challenge Development isOutcomeOf LDC2004S02 ICSI Meeting Speech LDC2004S05 ISL Meeting Speech Part 1 LDC2004S09 NIST Meeting Pilot Corpus Speech LDC2004T04 ICSI Meeting Transcripts LDC2004T10 ISL Meeting Transcripts Part 1 LDC2004T13 NIST Meeting Pilot Corpus Transcripts and Metadata isContinuationOf LDC2007S11 2004 Spring NIST Rich Transcription (RT-04S) Development Data isSimilarWith LDC2004S11 2002 Rich Transcription Broadcast News and Conversational Telephone Speech LDC2007S10 2003 NIST Rich Transcription Evaluation Data LDC2011S06 2005 Spring NIST Rich Transcription (RT-05S) Evaluation Set

Introduction

2004 Spring NIST Rich Transcription (RT-04S) Evaluation Data contains the test material (meeting speech and reference transcripts) used in the RT-04S evaluation administered by the NIST (National Institute of Standards and Technology) Speech Group. Rich Transcription (RT) is broadly defined as a fusion of speech-to-text technology and metadata extraction technologies designed to provide the basis for a generation of more usable transcriptions of human-human meeting speech.

The data in this release consists of portions of meeting speech collected and/or transcribed by the International Computer Science Institute (ICSI) at Berkeley, the Interactive Systems Laboratories (ISL) at Carnegie Mellon University, NIST and LDC. The complete meeting speech and corresponding transcript data sets are available from LDC's catalog as follows: ICSI Meeting Speech (LDC2004S02), ICSI Meeting Transcripts (LDC2004T04), ISL Meeting Speech Part 1 (LDC2004S05), ISL Meeting Transcripts Part 1 (LDC2004T10), NIST Meeting Pilot Corpus Speech (LDC2004S09) and NIST Meeting Pilot Corpus Transcripts and Metadata (LDC2004T13).

RT-04S included the following tasks in the meeting domain:

Speech-to-Text Transcription (STT) tasks

Microphone conditions:

Multiple distant microphones
Single distant microphone
Individual head microphone

Processing time conditions:

Unlimited time STT
Less than or equal to twenty times realtime
Less than or equal to ten times realtime
Less than or equal to one times realtime

Diarization (SPKR) task (?who spoke when?)

Microphone conditions:

Multiple distant microphones
Single distant microphone

Input conditions:

Speech input only
Speech plus reference transcript input

Processing time conditions:

Unlimited time
Less than or equal to twenty times realtime
Less than or equal to ten times realtime
Less than or equal to one time realtime

Futher information about the evaluation is available on the RT-04 Spring Evaluation Website.

Samples

For an example of the data in this corpus, please review this audio sample.

2004 Spring NIST Rich Transcription (RT-04S) Evaluation Data

Introduction

Samples

Copyright

Available Media

View Fees