CALLHOME German Lexicon


Item Name: CALLHOME German Lexicon
Authors: Krisjanis Karins, Robert MacIntyre, Monika Brandmair, Susanne Lauscher and Cynthia McLemore
LDC Catalog No.: LDC97L18
ISBN: 1-58563-116-7
Data Type: lexicon
Data Source(s): telephone conversations
Project(s): Hub5-LVCSR
Application(s): speech recognition
Language(s): German
Language ID(s): DEU
Distribution: Web Download
Member fee: $0 for 1997 members
Non-member Fee: US $2250.00
Reduced-License Fee: US $1125.00
Extra-Copy Fee: N/A
Non-member License: yes
Member License: yes
Online documentation: yes
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Krisjanis Karins, et al.
1997
CALLHOME German Lexicon
Linguistic Data Consortium, Philadelphia

Introduction

The CALLHOME German corpus collection includes a lexical component. The CALLHOME German lexicon consists of 318,807 words. Of these, 315,503 words are adapted from the CELEX German lexicon produced by The Centre for Lexical Information, Max Planck Institute for Psycholinguistics in Nijmigen and 3,304 additional words come from the 80 training and 20 development test (devtest) transcripts (ten minutes each) from the LDC German CALLHOME telephone speech corpus.

Data

The German lexicon contains tab-separated information fields with orthographic, morphological, phonological, stress, source and frequency information for each word.

Here is a sample page from the lexicon. The transcripts and documentation (LDC97T15) are available separately, as is a corpus of telephone speech (LDC97S43).

Updates

There are no updates at this time.

Content Copyright