Resource Management RM1 2.0
Item Name: | Resource Management RM1 2.0 |
Author(s): | P Price, W M. Fisher, Jared Bernstein, D S. Pallett |
LDC Catalog No.: | LDC93S3B |
ISBN: | 1-58563-221-X |
ISLRN: | 925-711-891-661-2 |
DOI: | https://doi.org/10.35111/9c7j-rb08 |
Member Year(s): | 1993, 1996 |
DCMI Type(s): | Sound |
Sample Type: | 1-channel pcm |
Sample Rate: | 16000 |
Data Source(s): | microphone speech |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC93S3B Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Price, P, et al. Resource Management RM1 2.0 LDC93S3B. Web Download. Philadelphia: Linguistic Data Consortium, 1993. |
Related Works: | View |
Introduction
Resource Management RM1 2.0 was developed by NIST and consists of approximately 20 hours of English speech along with transcriptions. All RM material consists of read sentences modeled after a naval resource management task. There are two main parts, often referred to as RM1 and RM2. RM1 contains three sections, Speaker-Dependent (SD) training data, Speaker-Independent (SI) training data and test and evaluation data. Resource Management Complete Set 2.0 (LDC93S3A) contains both RM1 and RM2.
Data
The material was recorded at 16KHz, with 16-bit resolution, using a Sennheiser HMD-414 headset microphone.
The Speaker-Dependent (SD) Training Data contains 12 subjects (seven male and five female), each reading a set of 600 "training sentences," two "dialect" sentences and ten "rapid adaptation" sentences, for a total of 7,344 recorded sentence utterances. The 600 sentences designated as training cover 97 of the lexical items in the corpus.
The Speaker-Independent (SI) Training Data contains 80 speakers (55 male and 25 female), each reading two "dialect" sentences plus 40 sentences from the Resource Management text corpus, for a total of 3,360 recorded sentence utterances. Any given sentence from a set of 1,600 Resource Management sentence texts was recorded by two subjects, while no sentence was read twice by the same subject.
RM1 contains all SD and SI system test material used in five DARPA benchmark tests conducted in March and October of 1987, June 1988, and February and October 1989, along with scoring and diagnostic software and documentation for those tests. Documentation is also provided outlining use of the Resource Management training and test material at CMU in development of the SPHINX system. Example output and scored results for state-of-the-art speaker-dependent and speaker-independent systems (i.e. the BBN BYBLOS and CMU SPHINX systems) for the October 1989 benchmark tests are included.
Samples
Updates
None at this time.