Translanguage English Database (TED) Transcripts
Item Name: | Translanguage English Database (TED) Transcripts |
Author(s): | A Kipp, L Lamel, J Mariani, F Schiel, N Martey, D A. Miller, K T. Jones, S Dunn, R Markoff |
LDC Catalog No.: | LDC2002T03 |
ISBN: | 1-58563-202-3 |
ISLRN: | 502-719-830-448-5 |
DOI: | https://doi.org/10.35111/1hgg-8891 |
Member Year(s): | 2002 |
DCMI Type(s): | Text |
Data Source(s): | microphone speech |
Application(s): | speech recognition |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2002T03 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Kipp, A, et al. Translanguage English Database (TED) Transcripts LDC2002T03. Web Download. Philadelphia: Linguistic Data Consortium, 2002. |
Related Works: | View |
Introduction
Translanguage English Database (TED) Transcripts consists of transcripts of presentations by 39 native English and non-native English speakers at the Third European Conference on Speech Communication and Technology, EUROSPEECH 1993 in Berlin, Germany. This is a joint publication with the European Language Resources Association (ELRA) sponsored in part by National Science Foundation Grant No. IIS-9982201. The data set is released by ELRA as Translanguage English Database (TED) Transcripts database (ELRA-S0120).
Data
The transcripts in this release were developed by the Linguistic Data Consortium and are a subset of the speech recordings in Translanguage English Database (TED) Speech LDC2002S04 and ELRA publication ELRA-S0031.
The transcripts are in Universal Transcription Format (UTF). All UTF files were validated against a utf.dtd. Tables containing speaker demographic information and cross-references of file names from the TED audio corpus are included this release. A transcript sample is available here.
Updates
There are no updates at this time