SRI-FRTIV
Item Name: | SRI-FRTIV |
Author(s): | Elizabeth Shriberg, Andreas Kathol, Martin Graciarena, Harry Bratt, Sachin Kajarekar, Huda Jameel, Colleen Richey, Fred Goodman |
LDC Catalog No.: | LDC2017S18 |
ISBN: | 1-58563-813-7 |
ISLRN: | 856-028-165-105-5 |
DOI: | https://doi.org/10.35111/aw84-h425 |
Release Date: | September 14, 2017 |
Member Year(s): | 2017 |
DCMI Type(s): | Sound |
Sample Type: | pcm |
Sample Rate: | 16000 |
Data Source(s): | microphone conversation, microphone speech |
Application(s): | speech recognition |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2017S18 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Shriberg, Elizabeth, et al. SRI-FRTIV LDC2017S18. Web Download. Philadelphia: Linguistic Data Consortium, 2017. |
Introduction
SRI-FRTIV (Five-way Recorded Toastmaster Intrinsic Variation) was developed by SRI International in 2007-2008 and is comprised of approximately 232 hours of English speech from thirty-four speakers who were members of Toastmaster clubs. Participants were asked to speak at three different levels of effort (low, normal and high) in four different styles (interview, conversation, reading and oration) to study the question of how intrinsic variations -- associated with the speaker rather than the recording environment -- affect text-independent speaker verification.
Data
Participants were native speakers of North American English who were members of local Toastmasters clubs and had experience in public speaking. This release includes demographic information for 30 speakers (15 male, 15 female), including gender, birth year, height, education level, years in Toastmasters, and a self-evaluation of speaking skills.
Not all effort levels were applicable for each speaking style and so were not collected. Interviews and phone conversations were not recorded at high effort and oration was not recorded at low or normal effort levels.
Speech data is presented as 16kHz 16-bit single channel flac compressed pcm wav (.flac).
Samples
Please listen to this low effort interview sample.
Updates
None at this time.