|Author(s):||Alvin Martin, Jack Godfrey, Ed Holliman, Mark Przybocki|
|LDC Catalog No.:||LDC94S15|
|Member Year(s):||1994, 1997|
|Sample Type:||2-channel ulaw|
|Data Source(s):||telephone speech|
LDC User Agreement for Non-Members
|Online Documentation:||LDC94S15 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Martin, Alvin, et al. SPIDRE LDC94S15. Web Download. Philadelphia: Linguistic Data Consortium, 1994.|
This subset of the SWITCHBOARD collection (see above), selected for speaker ID research and with special attention to telephone instrument variation. It contains training and testing data for experiments in closed or open set recognition or verification. Combining the two sides of the conversations also permits speaker change detection, or speaker monitoring, experiments.
There are 45 "target" speakers; four conversations from each target are included, of which two are from the same handset. There are also 100 calls in which no target appears. Since all conversations are two-sided, this results in 180 target sides and 180 + 200 = 380 nontarget sides.
Except for truncations of a few longer calls at five minutes, the call themselves are as described under SWITCHBOARD.