|The CALLHOME Mandarin Chinese corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Mandarin Chinese. |
All calls, which lasted up to 30 minutes, originated in North America and were placed to locations overseas. Most participants called family members or close friends.
This corpus contains speech data files ONLY, along with the minimal amount of documentation needed to describe the contents and format of the speech files and the software packages needed to uncompress the speech data. The transcripts and documentation (LDC96T16) are available separately, as is an associated lexicon (LDC96L15).