Korean Telephone Conversations Lexicon
Item Name: | Korean Telephone Conversations Lexicon |
Author(s): | Na-Rae Han, David Graff, Myeonchul Kim |
LDC Catalog No.: | LDC2003L02 |
ISBN: | 1-58563-265-1 |
ISLRN: | 261-728-030-958-6 |
DOI: | https://doi.org/10.35111/hpva-ym97 |
Release Date: | May 16, 2003 |
Member Year(s): | 2003 |
DCMI Type(s): | Text |
Data Source(s): | telephone conversations |
Application(s): | speech recognition |
Language(s): | Korean |
Language ID(s): | kor |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2003L02 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Han, Na-Rae, David Graff, and Myeonchul Kim. Korean Telephone Conversations Lexicon LDC2003L02. Web Download. Philadelphia: Linguistic Data Consortium, 2003. |
Related Works: | View |
Introduction
Korean Telephone Conversations Lexicon was produced by Linguistic Data Consortium (LDC) catalog number LDC2003L02 and ISBN 1-58563-265-1.
Korean Telephone Conversations Lexicon consists of 25,251 words, and contains separate fields with phonological, morphological, and frequency information for each word.
The lexicon covers the tokens occurring in 100 telephone conversations transcribed and published as Korean Telephone Conversations Transcripts. The token coverage is 100%. The corresponding speech is published as Korean Telephone Conversations Speech.
Data
The lexicon contains five tab-separated information fields:
- orthographic form in Hangul (head-word), encoded in the KSC-5601 (Wansung) system
- orthographic form in Yale romanization
- pronunciation
- frequency of the word in Korean Telephone Conversations Transcripts
- morphological analysis of the word
Please follow this link for a sample page from the lexicon: txt | gif.
Updates
There are no updates available at this time.