Home › Language Resources › Data

UCLA Speaker Variability Database

Item Name:	UCLA Speaker Variability Database
Author(s):	Patricia Keating, Jody Kreiman, Abeer Alwan, Adam Chong, Yoonjeong Lee
LDC Catalog No.:	LDC2021S09
ISBN:	1-58563-977-X
ISLRN:	890-519-200-259-7
DOI:	https://doi.org/10.35111/c5gk-6j49
Release Date:	October 15, 2021
Member Year(s):	2021
DCMI Type(s):	Sound, Text
Sample Type:	pcm
Sample Rate:	16000
Data Source(s):	microphone conversation, microphone speech
Application(s):	linguistic analysis, phonetics, psycholinguistics, speaker identification, speech recognition
Language(s):	English
Language ID(s):	eng
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC2021S09 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Keating, Patricia, et al. UCLA Speaker Variability Database LDC2021S09. Web Download. Philadelphia: Linguistic Data Consortium, 2021.
Related Works: Hide	View relatesTo LDC2023S07 LDC Spoken Language Sampler - Sixth Release

Introduction

UCLA Speaker Variability Database was developed by UCLA Speech Processing and Auditory Perception Laboratory and is comprised of approximately 34 hours of English speech and orthographic transcripts.

This corpus was designed to sample variability in speaking within individual speakers and across a large number of speakers. Participants took part in six different tasks: vowel sounds, reading sentences, giving instructions, neutral conversation, happy conversation, a phone conversation, annoyed conversation, and responding to a video.

Data

Speakers (101 female, 101 male) were recruited from the UCLA university community and represented a variety of language backgrounds. Speaker metadata is included in the documentation for this release.

Audio files are presented as 16kHz, 16-bit single channel, flac compressed wav files.

Most speech files have two corresponding Praat TextGrid files: an orthographic sentence/utterance transcription and output from forced alignment.

UCLA Speaker Variability Database

Introduction

Data

Samples

Updates

Copyright

Available Media

View Fees