Author(s): Karen Jones, Kevin Walker, David Graff, Jonathan Wright, Stephanie Strassel
LDC Catalog No.: LDC2024S05
ISLRN: 703-177-980-898-8
Release Date: May 15, 2024
Member Year(s): 2024
DCMI Type(s): Sound
Sample Type: pcm
Sample Rate: 8000
Data Source(s): telephone conversations
Project(s): NIST SRE
Application(s): speaker identification
Language(s): Yue Chinese, Mandarin Chinese, Cebuano, Tagalog
Language ID(s): yue, cmn, ceb, tgl
Citation: Jones, Karen, et al. Call My Net 1 LDC2024S05. Web Download. Philadelphia: Linguistic Data Consortium, 2024.
Call My Net 1 was developed by the Linguistic Data Consortium and contains 364 hours of conversational telephone speech in four languages (Tagalog, Cebuano, Cantonese and Mandarin) collected in 2015 from 221 native speakers located in the Philippines and China along with metadata and speaker demographic information. Recordings and data from this collection were used to support the NIST 2016 Speaker Recognition Evaluation.


Speakers were recruited to make 10 telephone calls each to people within their existing social networks, using different handsets and under a variety of noise conditions. Speakers were connected through a robot operator to carry on casual conversations on topics of their choice.

All recordings were manually audited to confirm language and speaker requirements. The documentation for this release includes metadata about phone type, noise conditions and call quality. Speaker demographic information on year of birth, sex and native language is also included.

This corpus contains 2472 telephone recordings. Audio files are presented as 2-channel, 16-bit, 8 kHz, PCM FLAC.

Metadata and demographic information are presented in tab-delimited files.

