The CALLHOME Spanish collection includes a lexical component. The CALLHOME Spanish Lexicon consists of 45,582 words and contains separate information fields with phonological, morphological and frequency information for each word.
The token coverage by the LDC Spanish lexicon of words occurring in the 20 LDC Spanish CALLHOME devtest transcripts (ten minutes of conversation each) is 98.7%.
For examples of listings from the Lexicon, please look at the following samples pages:sample1 sample2
The transcripts and documentation (LDC96T17) are available separately, as is a corpus of telephone speech (LDC96S35).