NIST Meeting Pilot Corpus Speech


Introduction

NIST (National Institute of Standards and Technology) Meeting Pilot Corpus Speech, Linguistic Data Consortium (LDC) catalog number LDC2004S09 and ISBN 1-58563-302-x, was collected in the NIST Meeting Data Collection Laboratory for the NIST Automatic Meeting Recognition Project. The corresponding transcripts are available as the NIST Meeting Pilot Corpus Transcripts and Metadata, LDC2004T13.

Data

The data consists of 369 SPHERE audio files generated from 19 meetings (comprising about 15 hours of meeting room data and amounting to about 32 GB) recorded between November 2001 and December 2003.

Each meeting was recorded using two wireless "personal" microphones attached to each meeting participant: a close-talking noise-cancelling boom microphone and an omni-directional lapel microphone. Each meeting was also recorded using three omni-directional table microphoness and one 4-channel directional table microphone covering 365 degrees (each channel is recorded in a separate file). Each individual channel was converted from its 48Khz, 24-bits, linear PCM source format to 16 Khz, 16 bits, linear PCM-sampled audio SPHERE-formatted files.

Presented here are excerpts from a group discussion on bioterrorism. These excerpts are each approximately three minutes long. The excerpts are linked below with their metadata.

1. 20011115-1050 Meeting Information


General information

Topic :Bioterrorism
Type :focus group discussion
Date :
Thursday 15th of November 2001 - 10:50
Duration :00:17:52
Participants miked : 4
Location : NIST/225/B243

 

Subjects, Audio and Transcripts

Participant ID Gender Native
1: Audio/Transcripts Male Yes
2: Audio/Transcripts Male Yes
3: Audio/Transcripts Female Yes
4: Audio/Transcripts Male Yes
All: Audio/Transcripts    

Microphones used by subjects

MicrophoneParticipant ID StatusTypeSerialModelNotes
HM-1 1OKhead0308010165U1-B -
LM-1 1OKlapel0328011365U1-B -
HM-2 2OKhead0308010167U1-B -
LM-2 2OKlapel0308010168U1-B -
HM-3 3OKhead0328011370U1-B -
LM-3 3OKlapel0328011366U1-B -
HM-4 4OKhead0328011368U1-B -
LM-4 4OKlapel0328011369U1-A -

Room microphones

MicrophoneStatusTypeModelSerialLocationNotes
ARRAY-1OKarrayNIST_ARRAY_V1-001-A1NIST_ARRAY_V1(2.88,0.00,1.20) -
ARRAY-2OKarrayNIST_ARRAY_V1-001-A2NIST_ARRAY_V1(-0.40,3.26,0.40) -
ARRAY-3OKarrayNIST_ARRAY_V1-001-A3NIST_ARRAY_V1(4.54,6.60,1.20) -
OMNI-1OKtableAT841A-001-OMNI1AT841A(1.51,3.26,0.74) -
OMNI-2OKtableAT841A-002-OMNI2AT841A(4.40,3.26,0.74) -
OMNI-3OKtableAT841A-003-OMNI3AT841A(3.30,3.26,0.74) -
QUAD1-1OKtableAT854R-001-QUAD1AT854R(2.95,3.26,0.74) -
QUAD1-2OKtableAT854R-002-QUAD2AT854R(2.95,3.26,0.74) -
QUAD1-3OKtableAT854R-003-QUAD3AT854R(2.95,3.26,0.74) -
QUAD1-4OKtableAT854R-004-QUAD4AT854R(2.95,3.26,0.74) -

Cameras details

CameraStatusSerialModelViewLocationNotes
CAMERA-1 OK155388EVI-D30 wall_machine_room(2.52,0.00,2.40) -
CAMERA-2 OK157073EVI-D30 wall_entrance_door(6.84,2.36,2.40) -
CAMERA-3 OK129833EVI-D30 wall_projector_screen(-0.40,2.36,2.70) -
CAMERA-4 OK157074EVI-D30 wall_white_board(2.65,6.60,2.40) -
CAMERA-5 OK120631EVI-D30 (2.75,6.60,2.40) -

Artifacts

ObjectsSizeLocationDescription
Projector screen 2.40x1.50 m - -

Room layout

ObjectsSizeLocationDescription
Rect Tab Mod 1.51x0.75x0.74 m(1.37,2.51,0.74)Rectangular Table Module
Rect Tab Mod 1.51x0.75x0.74 m(2.90,2.51,0.74)Rectangular Table Module
Rect Tab Mod 1.51x0.75x0.74 m(3.26,1.37,0.74)Rectangular Table Module
Rect Tab Mod 1.51x0.75x0.74 m(3.26,2.90,0.74)Rectangular Table Module
Half Round Tab Mod R=0.75 L=1.51 H=0.74(4.40,2.51,0.74)Half Round Table module
Rect Tab Mod 1.51x0.75x0.74 m(2.90,2.51,0.74)Rectangular Table Module

*The red bullet (junction of West wall and North wall at the floor) is the origin (0.00; 0.00; 0.00) from which all measurements were made.
For example, Camera 3 is situated on coordinate (-0.40; 2.36; 2.70) from the origin it's 0.40 meters to the north, 2.36 meters to the East and 2.7 meters High.

Room layout

Content Copyright

Portions © 2004 Trustees of the University of Pennsylvania


Back to Table of Contents


Contact: ldc@ldc.upenn.edu
© 2004 Linguistic Data Consortium, Trustees of the University of Pennsylvania. All Rights Reserved.