CALLFRIEND Mandarin Chinese-Mainland Dialect
|Item Name:||CALLFRIEND Mandarin Chinese-Mainland Dialect|
|Author(s):||Alexandra Canavan, George Zipperlen|
|LDC Catalog No.:||LDC96S55|
|Member Year(s):||1996, 1997|
|Sample Type:||2-channel ulaw|
|Data Source(s):||telephone conversations|
|Project(s):||LID, GALE, EARS|
LDC User Agreement for Non-Members
|Online Documentation:||LDC96S55 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Canavan, Alexandra, and George Zipperlen. CALLFRIEND Mandarin Chinese-Mainland Dialect LDC96S55. Web Download. Philadelphia: Linguistic Data Consortium, 1996.|
CALLFRIEND Mandarin Chinese-Mainland Dialect was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of unscripted telephone conversations between native speakers of the Mandarin Chinese dialect spoken in mainland China.
The CALLFRIEND series is a collection of telephone conversations in several languages conducted by LDC in support of language identification technology development. Languages covered in the collection include American English, Canadian French, Egyptian Arabic, Farsi, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil and Vietnamese.
An updated edition of this corpus is available as CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition (LDC2018S09). The second edition updates the audio files to wav format, simplifies the directory structure and adds documentation and metadata.
The corpus consists of 60 unscripted telephone conversations, lasting between 5-30 minutes. The corpus also includes documentation describing speaker information (sex, age, education, callee telephone number) and call information (channel quality, number of speakers).
For each conversation, both the caller and callee are native speakers of Mandarin Chinese from Mainland China. All calls are domestic and were placed inside the continental United States and Canada.
Callers in the "Mainland" and "Taiwan" collections of CALLFRIEND Mandarin were identified primarily on the basis of specific attributes in their speech characteristic of geographic origin.
There are no updates at this time.