ISL Meeting Speech Part 1
Item Name: | ISL Meeting Speech Part 1 |
Author(s): | Susanne Burger, Victoria MacLaren, Alex Waibel |
LDC Catalog No.: | LDC2004S05 |
ISBN: | 1-58563-294-5 |
ISLRN: | 459-840-211-562-6 |
DOI: | https://doi.org/10.35111/64zw-4k57 |
Release Date: | May 21, 2004 |
Member Year(s): | 2004 |
DCMI Type(s): | Sound |
Sample Type: | pcm |
Sample Rate: | 16000 |
Data Source(s): | microphone conversation |
Project(s): | ROAR, JANUS, GENOA |
Application(s): | speech recognition, speaker identification, meeting summarization |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2004S05 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Burger, Susanne, Victoria MacLaren, and Alex Waibel. ISL Meeting Speech Part 1 LDC2004S05. Web Download. Philadelphia: Linguistic Data Consortium, 2004. |
Related Works: | View |
Introduction
ISL Meeting Speech Part 1 was produced by Linguistic Data Consortium (LDC) catalog number LDC2004S05 and ISBN 1-58563-294-5.
The ISL Meeting Speech Part 1 is a first subset of the ISL Meeting Corpus (112 meetings). It contains 18 meetings collected at the Interactive Systems Laboratories at Carnegie Mellon University in Pittsburgh, PA during the years 2000-2001. The recorded meetings were either natural meetings where participants needed to meet in the real world, or artificial meetings, which were designed explicitly for the purposes of data collection but still had real topics and tasks. The duration of the meetings in this corpus ranges from eight to 64 minutes and averages at 34 minutes. Word-level orthographic transcriptions are available as ISL Meeting Transcripts Part 1. The transcriptions are available as ISL Meeting Transcripts Part 1.
Data
The collection includes 105 speech files, for a total of approximately 10 hours of meeting speech. The speech for each meeting consists of wave files for each channel and a wave file containing a mix of all channels.
The audio was collected at a 16 kHz sample-rate. Audio files for each meeting are provided as separate time-synchronous recordings for each channel, encoded as 16-bit (little-endian) wave files.
During meeting recordings, each speaker wore an individual lapel microphone and was recorded via an Alesis 8-channel mix board and an ECHO Layla 8-channel sound card. This setup was designed to obtain a consumer- or application-style sound quality. All meetings were recorded in the same instrumented meeting area.
For an example transcript, please click here.
There are a total of 31 unique speakers in the corpus. Meetings involved anywhere from three to nine participants, averaging at five. The corpus contains a significant proportion of non-native English speakers, varying in fluency.
Sponsorship
The collection and preparation of this corpus was made possible in large part through funding from DARPA, both through the GENOA project and through ROAR.
Updates
Additional information, updates, bug fixes may be avaibale on the ISL Meeting Room project page.