Arabic Speech Recognition Pronunciation Dictionary

Item Name: Arabic Speech Recognition Pronunciation Dictionary
Author(s): Ahmed Ali
LDC Catalog No.: LDC2017L01
ISBN: 1-58563-783-1
ISLRN: 445-866-322-325-6
DOI: https://doi.org/10.35111/9abp-k222
Release Date: January 19, 2017
Member Year(s): 2017
DCMI Type(s): Text
Data Source(s): newswire
Application(s): language teaching, speech recognition, pronunciation modeling
Language(s): Arabic, Standard Arabic
Language ID(s): ara, arb
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2017L01 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Ali, Ahmed. Arabic Speech Recognition Pronunciation Dictionary LDC2017L01. Web Download. Philadelphia: Linguistic Data Consortium, 2017.

Introduction

Arabic Speech Recognition Pronunciation Dictionary was developed by the Qatar Computing Research Institute. It contains approximately two million pronunciation entries for 526,000 Modern Standard Arabic words, for an average of 3.84 pronunciations for each grapheme word.

Data

The dictionary was developed from news archive resources, including the Arabic news website Aljazeera.net. The selected words were those that occurred more than once in the news collection. The text was processed using MADA.

The dictionary is presented in a single UTF-8 plain text file.

Samples

Please view this sample.

Updates

None at this time.

Available Media

View Fees





Login for the applicable fee