CALLFRIEND Farsi Second Edition Speech was developed by the Linguistic Data Consortium (LDC) and consists
of approximately 42 hours of telephone conversation (100 recordings) among native
Farsi speakers. The calls were recorded in 1995 and 1996 as part of the CALLFRIEND
collection, a project designed primarily to support research in automatic language
identification. One hundred native Farsi speakers living in the continental
United States each made a single telephone call, lasting up to 30 minutes, to
a family member or friend living in the United States.
This release represents all calls from the collection. LDC released recordings
from 60 calls without transcripts in 1996 as CALLFRIEND Farsi (LDC96S50)
after 20 of those calls were used as evaluation data in the first NIST
Language Recognition Evaluation (LRE).
Corresponding transcripts are available in CALLFRIEND Farsi Second Edition Speech
All recordings involved domestic calls routed through the automated telephone
collection platform at LDC and were stored as 2-channel (4-wire), 8-KHz mu-law samples
taken directly from the public telephone network via a T-1 circuit. Each audio
file is a FLAC-compressed MS-WAV (RIFF)
format audio file containing 2-channel, 8-KHz, 16-bit PCM sample data.
This release includes speaker information, including gender, the number of
speakers on each channel and call duration.
Please listen to this
None at this time.
Portions © 1995-1996, 2014 Trustees of the University of Pennsylvania