2010 NIST Speaker Recognition Evaluation Test Set 
Authors: Craig Greenberg, Alvin Martin (NIST) 
Dave Graff, Linda Brandschain, Kevin Walker (LDC)


DESCRIPTION 

The 2010 NIST Speaker Recognition Evaluation Test Set was developed by LDC
and NIST (National Institute of Standards and Technology).  The evaluation
data contains 2255 hours of American English telephone speech and speech
recorded over a microphone channel involving an interview scenario.  Some
of the data involves telephone conversations made in conditions that
involve high vocal effort on the part of the speaker.

NIST SRE is part of an ongoing series of evaluations conducted by NIST.
These evaluations are an important contribution to the direction of
research efforts and the calibration of technical capabilities. They are
intended to be of interest to all researchers working on the general
problem of text independent speaker recognition.

DATA SOURCES

The speech recordings in this release were collected in 2009 and 2010 by
LDC at its Human Subjects Data Collection Laboratories in
Philadelphia. This collection was part of the Mixer 6 project, which was
designed to support the development of robust speaker recognition
technology by providing carefully collected and audited speech from a large
pool of speakers recorded simultaneously across numerous microphones.

The telephone speech segments include two-channel excerpts of approximately
10 seconds and 5 minutes.  There are also summed-channel excerpts in the
range of 5 minutes.  The microphone excerpts are 3-15 minutes in
duration. As in prior evaluations, intervals of silence were not removed.
The data included in this release is 8 bit ulaw with a sample rate of 8000.

DIRECTORY STRUCTURE

In addition to evaluation data, this package also consists of answer keys,
trial and train files, development data and evaluation documentation.  The
directory structure is as follows:

/data
	/dev/NIST_SRE10_DEV_DATA
	/eval/data
	/eval/doc
	/eval/keys
	/eval/train
	/eval/trials
/docs
/README.txt (this file).

A file count of all test data found in the /data/eval/data directory is
listed by sub-directory below:

    /summed 16GB 7130 files


    /phonecall 48.9GB
        /mic 11.2 GB 2478 files
        /tel 37.7 GB 8355 files

    /interview 37.6GB
        /3min 16.2 GB 6028 files
        /8min 21.3 GB 2987 files

    /10sec 2.28GB
        /0     1.15GB 6980 files
        /1    1.13GB 6827 files



The train files in /data/eval/train provide three pieces of information for 
building models:

- model ID
- model gender
- audio ID of one or more data files in the eval data

The training conditions are described in /data/eval/doc/NIST_SRE10_evalplan.r4.pdf

The development data in /data/dev/NIST_SRE10_DEV_DATA consists of 18
interview recordings with a key for each recording and 36 phone call/speech
recordings with a key. (Each speech recording file is either a telephone
recording of the subject and interlocutor or a microphone recording of the
subject on channel "a" with the telephone channel of the interlocutor on
channel "b".  For the telephone recordings, the channel that the subject is
on is communicated in the key).

More information about the SRE10 evaluation can be found in the NIST SRE10
Evaluation Plan contained in data/eval/doc/NIST_SRE10_evalplan.r4.pdf

README created on September 14, 2016 by Karen Jones