Home › Language Resources › Data

CALLFRIEND Mandarin Chinese-Mainland Dialect

Item Name:	CALLFRIEND Mandarin Chinese-Mainland Dialect
Author(s):	Alexandra Canavan, George Zipperlen
LDC Catalog No.:	LDC96S55
ISBN:	1-58563-070-5
ISLRN:	608-636-717-091-6
DOI:	https://doi.org/10.35111/5ha0-rb62
Member Year(s):	1996, 1997
DCMI Type(s):	Sound
Sample Type:	2-channel ulaw
Sample Rate:	8000
Data Source(s):	telephone conversations
Project(s):	LID, GALE, EARS
Application(s):	language identification
Language(s):	Mandarin Chinese
Language ID(s):	cmn
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC96S55 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Canavan, Alexandra, and George Zipperlen. CALLFRIEND Mandarin Chinese-Mainland Dialect LDC96S55. Web Download. Philadelphia: Linguistic Data Consortium, 1996.
Related Works: Hide	View hasVersion LDC2018S09 CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition isPartOf LDC2025S04 BOLT CTS CALLFRIEND CALLHOME Mainland Mandarin Chinese Audio isPartWith LDC96S34 CALLHOME Mandarin Chinese Speech LDC98S69 HUB5 Mandarin Telephone Speech Corpus LDC2002S12 2001 HUB5 Mandarin Evaluation LDC2018S09 CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition LDC2018S18 HUB5 Mandarin Telephone Speech and Transcripts Second Edition hasAnnotation LDC98T26 HUB5 Mandarin Transcripts LDC2007S10 2003 NIST Rich Transcription Evaluation Data hasOutcome LDC98S69 HUB5 Mandarin Telephone Speech Corpus LDC2018S18 HUB5 Mandarin Telephone Speech and Transcripts Second Edition isSimilarWith LDC96S34 CALLHOME Mandarin Chinese Speech LDC96S56 CALLFRIEND Mandarin Chinese-Taiwan Dialect LDC2005S15 HKUST Mandarin Telephone Speech, Part 1 LDC2020S06 CALLFRIEND Mandarin Chinese-Taiwan Dialect Second Edition

Introduction

CALLFRIEND Mandarin Chinese-Mainland Dialect was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of unscripted telephone conversations between native speakers of the Mandarin Chinese dialect spoken in mainland China.

The CALLFRIEND series is a collection of telephone conversations in several languages conducted by LDC in support of language identification technology development. Languages covered in the collection include American English, Canadian French, Egyptian Arabic, Farsi, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil and Vietnamese.

An updated edition of this corpus is available as CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition (LDC2018S09). The second edition updates the audio files to wav format, simplifies the directory structure and adds documentation and metadata.

Data

The corpus consists of 60 unscripted telephone conversations, lasting between 5-30 minutes. The corpus also includes documentation describing speaker information (sex, age, education, callee telephone number) and call information (channel quality, number of speakers).

For each conversation, both the caller and callee are native speakers of Mandarin Chinese from Mainland China. All calls are domestic and were placed inside the continental United States and Canada.

Callers in the "Mainland" and "Taiwan" collections of CALLFRIEND Mandarin were identified primarily on the basis of specific attributes in their speech characteristic of geographic origin.

Updates

There are no updates at this time.

CALLFRIEND Mandarin Chinese-Mainland Dialect

Introduction

Data

Updates

Copyright

Available Media

View Fees