CALLHOME Mandarin Chinese Lexicon
|Item Name:||CALLHOME Mandarin Chinese Lexicon|
|Author(s):||Shudong Huang, Xuejun Bian, Grace Wu, Cynthia McLemore|
|LDC Catalog No.:||LDC96L15|
|Member Year(s):||1996, 1997|
|Data Source(s):||telephone conversations|
|Project(s):||EARS, Hub5-LVCSR, GALE|
CALLHOME Lexicon Agreement (Commercial)
CALLHOME Lexicon Agreement (Non-Commercial)
CALLHOME Lexicon Agreement (Non-Member)
|Online Documentation:||LDC96L15 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Huang, Shudong, et al. CALLHOME Mandarin Chinese Lexicon LDC96L15. Web Download. Philadelphia: Linguistic Data Consortium, 1996.|
The CALLHOME Mandarin Chinese collection includes a lexical component. The CALLHOME Mandarin Lexicon consists of 44,405 words and contains separate information fields with phonological, morphological and frequency information for each word.
The token coverage by the LDC Mandarin lexicon of words occurring in the 20 LDC Mandarin CALLHOME devtest transcripts (ten minutes of conversation each) is 98%.
Orthographic Chinese characters are GB-encoded and are simplified in the Mainland style. A representation of the headword in tone pinyin with strictly lexical tone, i.e. not reflecting phonetic/phonological processes is also provided.