CALLHOME German Lexicon

Item Name: CALLHOME German Lexicon
Authors: Krisjanis Karins, Robert MacIntyre, Monika Brandmair, Susanne Lauscher and Cynthia McLemore
LDC Catalog No.: LDC97L18
ISBN: 1-58563-116-7
Data Type: lexicon
Data Source(s): telephone conversations
Project(s): Hub5-LVCSR
Application(s): speech recognition
Language(s): German
Language ID(s): DEU
Distribution: Web Download
Member fee: $0 for 1997 members
Non-member Fee: US $2250.00
Reduced-License Fee: US $1125.00
Extra-Copy Fee: N/A
Non-member License: yes
Member License: yes
Online documentation: yes
Citation: Krisjanis Karins, et al.
CALLHOME German Lexicon
Linguistic Data Consortium, Philadelphia


The CALLHOME German corpus collection includes a lexical component. The CALLHOME German lexicon consists of 318,807 words. Of these, 315,503 words are adapted from the CELEX German lexicon produced by The Centre for Lexical Information, Max Planck Institute for Psycholinguistics in Nijmigen and 3,304 additional words come from the 80 training and 20 development test (devtest) transcripts (ten minutes each) from the LDC German CALLHOME telephone speech corpus.


The German lexicon contains tab-separated information fields with orthographic, morphological, phonological, stress, source and frequency information for each word.

Here is a sample page from the lexicon. The transcripts and documentation (LDC97T15) are available separately, as is a corpus of telephone speech (LDC97S43).


