Arabic Speech Recognition Pronunciation Dictionary
Item Name: | Arabic Speech Recognition Pronunciation Dictionary |
Author(s): | Ahmed Ali |
LDC Catalog No.: | LDC2017L01 |
ISBN: | 1-58563-783-1 |
ISLRN: | 445-866-322-325-6 |
DOI: | https://doi.org/10.35111/9abp-k222 |
Release Date: | January 19, 2017 |
Member Year(s): | 2017 |
DCMI Type(s): | Text |
Data Source(s): | newswire |
Application(s): | language teaching, speech recognition, pronunciation modeling |
Language(s): | Arabic, Standard Arabic |
Language ID(s): | ara, arb |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2017L01 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Ali, Ahmed. Arabic Speech Recognition Pronunciation Dictionary LDC2017L01. Web Download. Philadelphia: Linguistic Data Consortium, 2017. |
Introduction
Arabic Speech Recognition Pronunciation Dictionary was developed by the Qatar Computing Research Institute. It contains approximately two million pronunciation entries for 526,000 Modern Standard Arabic words, for an average of 3.84 pronunciations for each grapheme word.
Data
The dictionary was developed from news archive resources, including the Arabic news website Aljazeera.net. The selected words were those that occurred more than once in the news collection. The text was processed using MADA.
The dictionary is presented in a single UTF-8 plain text file.
Samples
Please view this sample.
Updates
None at this time.