This is the release of the CallFriend Mandarin Speech Corpus Mainland Dialect, produced by the Linguistic Data Consortium. This release contains speech data files ONLY, along with documentation describing speaker information (sex, age, education, callee telephone number) and call information (channel quality, number of speakers). These files are not compressed. Summary of contents: --------------------------- index.html html page that links to everything in the docs folder. docs/ README.txt This file. cf_man_m.txt Description of the CallFriend telephone speech corpus for Mandarin. callinfo.txt Explanation of the audit information provided in "callinfo.tbl". callinfo.tbl A list of audit information as explained in "callinfo.txt", with information on number and sex of speakers and several sound quality judgements. headerinfo.txt Explanation of the SPH header information provided in "header_info.tbl". headerinfo.tbl A table of the data that was originally in the SPH header for each audio file before they were converted. spkrinfo.txt Explanation of the speaker demographic information provided in "spkrinfo.tbl". spkrinfo.tbl A table of information provided about the speakers involved in each phone call, such as age and hometown. The first 11 of the speakers in this corpus did not provide this information, so their entries in the table are empty. file_partitions.txt Categroizes each of the audio files in the corpus into their original partitions (train, devtest, evltest) data/ The speech data files. These files were originally divided into train, devtest and evltest partitions, which are now described in file_partitions.txt Note that the partitioning of speech data into sets for "training", "development test" and "evaluation test" sets reflected the original usage of the speech files by participants in the U.S. Government- sponsored project on Language Identification (LID). As of this release, there are 20 conversations in the training set, 20 in the development test set, and 20 in the evaluation test set. METADATA: ---------------- Total Duration: 24:30:01 Duration by language: - Mandarin 24:30:01 Calls per caller: 1 Unique Speakers: >120 - ma_0636: 2 speakers (all female) - ma_0664: 2 speakers (all female) - ma_0928: 2 speakers (all male) - ma_0941: 2 speakers (male and female) - ma_0964: 2 speakers (all female) - ma_0966: 2 speakers (all female) - ma_1017: 2 speakers (all female) - ma_1034: 2 speakers (all female) - ma_1191: 2 speakers (all female) - ma_1253: 2 speakers (male and female) - ma_1306: 2 speakers (all male) - ma_4160: 2 speakers (male and female) - ma_4198: 2 speakers (all male) - ma_4227: 2 speakers (all female) - ma_4233: 2 speakers (male and female) - ma_4249: 2 speakers (all female) - ma_4257: 2 speakers (all female) - ma_4270: 2 speakers (all female) - ma_4281: 2 speakers (all female) - ma_4336: 2 speakers (all female) - ma_4389: 2 speakers (male and female) - ma_4426: 2 speakers (all female) - ma_4447: 2 speakers (all female) - ma_4559: 2 speakers (all female) - ma_4689: 2 speakers (all female) - ma_4970: 2 speakers (male and female) - ma_4972: 2 speakers (all female) - ma_5010: 2 speakers (all female) - ma_5195: 2 speakers (male and female) - ma_5260: 2 speakers (all male) - ma_5347: 2 speakers (all male) - ma_5520: 2 speakers (all male) - ma_5530: 2 speakers (all female) - ma_5542: 2 speakers (male and female) - ma_5560: 2 speakers (male and female) - ma_5636: 2 speakers (all male) - ma_5653: 2 speakers (all male) - ma_5673: 2 speakers (all male) - ma_5690: 2 speakers (male and female) - ma_5715: 2 speakers (male and female) - ma_5716: >2 speakers (male and female) - ma_5731: >2 speakers (male and female) - ma_5750: 2 speakers (male and female) - ma_5773: 2 speakers (all male) - ma_5779: 2 speakers (all female) - ma_5784: 2 speakers (all male) - ma_5800: 2 speakers (male and female) - ma_5806: 2 speakers (all female) - ma_5812: 2 speakers (all female) - ma_5821: 2 speakers (male and female) - ma_5824: 2 speakers (all male) - ma_5904: >2 speakers (male and female) - ma_5905: 2 speakers (all male) - ma_5906: 2 speakers (all female) - ma_5916: 2 speakers (all female) - ma_5930: 2 speakers (all female) - ma_5949: 2 speakers (all female) - ma_5973: 2 speakers (all female) - ma_5975: 2 speakers (male and female) - ma_5982: 2 speakers (male and female)