King Saud University Arabic Speech Database
Item Name: | King Saud University Arabic Speech Database |
Author(s): | Mansour Alsulaiman, Ghulam Muhammad, Bencherif Mohamed Abdelkader, Awais Mahmood, Zulfiqar Ali |
LDC Catalog No.: | LDC2014S02 |
ISBN: | 1-58563-669-X |
ISLRN: | 789-673-729-277-5 |
DOI: | https://doi.org/10.35111/vpqe-bz17 |
Release Date: | February 17, 2014 |
Member Year(s): | 2014 |
DCMI Type(s): | Sound |
Sample Type: | pcm |
Sample Rate: | 48000 |
Data Source(s): | microphone speech |
Application(s): | speech recognition, speaker identification |
Language(s): | Arabic |
Language ID(s): | ara |
License(s): |
King Saud University Arabic Speech Database |
Online Documentation: | LDC2014S02 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Alsulaiman, Mansour, et al. King Saud University Arabic Speech Database LDC2014S02. Hard Drive. Philadelphia: Linguistic Data Consortium, 2014. |
Related Works: | View |
Introduction
King Saud University Arabic Speech Database was developed by Speech Group (SG) at King Saud University and contains 590 hours of recorded Arabic speech from 269 male and female speakers. The utterances include read and spontaneous speech. The recordings were conducted in varied environments representing quiet and noisy settings.
Data
The corpus was designed principally for speaker recognition research. However, other possible applications include first language recognition, mobile effect, multichannel effect, and use of different type of microphones. The speech sources are word lists, sentence lists, paragraphs and question and answer sessions. Read speech text includes the following:
- Sets of sentences devised to cover allophones of each phoneme, phonetic balance, and differentiation of accents.
- Word lists developed to minimize missing phonemes and to represent nasals fricatives, commonly used words, and numbers.
- Two paragraphs selected because they included all letters of the alphabet and were easy to read.
Spontaneous speech was captured through question and answer sessions where speakers answer questions displayed on screen. The questions were on general topics such as the weather and food and included the speaker name or number.
The speakers were Saudis and non-Saudis. Among the non-Saudi participants were Arabs and non-Arabs. All female speakers were either Saudis or non-Saudi Arabs. Male speakers included non-Arabs from the Indian subcontinent, Africa, South East Asia and East Europe. Non-Arab participants were required to be able to read Arabic at an acceptable level. Most of the Non-Arab speakers were from the fourth level in the Arabic Linguistics Institute at King Saud University. The non-Saudi participants represented 28 nationalities and were chosen from clusters of areas or countries.
Each speaker was recorded in three different environments: in a soundproof room , in an office and in a cafeteria. The recordings were collected via different microphones and a mobile phone and averaged between 16-19 minutes. The recordings were done in three sessions with a time-gap of an approximately 6 weeks.
The data was verified for missing recordings, problems with the recording system or errors in the recording process. All files are presented as two channel 48 kHz 16-bit FLAC compressed PCM wav files. Note that sizes and file names in the documentation are for the uncompressed wav files.
Samples
Please view this male sample and female sample.
Updates
None at this time.