Translanguage English Database (TED) Transcripts

Item Name: Translanguage English Database (TED) Transcripts
Author(s): A Kipp, L Lamel, J Mariani, F Schiel, N Martey, D A. Miller, K T. Jones, S Dunn, R Markoff
LDC Catalog No.: LDC2002T03
ISBN: 1-58563-202-3
ISLRN: 502-719-830-448-5
DOI: https://doi.org/10.35111/1hgg-8891
Member Year(s): 2002
DCMI Type(s): Text
Data Source(s): microphone speech
Application(s): speech recognition
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2002T03 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Kipp, A, et al. Translanguage English Database (TED) Transcripts LDC2002T03. Web Download. Philadelphia: Linguistic Data Consortium, 2002.
Related Works: View

Introduction

Translanguage English Database (TED) Transcripts consists of transcripts of presentations by 39 native English and non-native English speakers at the Third European Conference on Speech Communication and Technology, EUROSPEECH 1993 in Berlin, Germany. This is a joint publication with the European Language Resources Association (ELRA) sponsored in part by National Science Foundation Grant No. IIS-9982201. The data set is released by ELRA as Translanguage English Database (TED) Transcripts database (ELRA-S0120).  

Data

The transcripts in this release were developed by the Linguistic Data Consortium and are a subset of the speech recordings in Translanguage English Database (TED) Speech LDC2002S04 and ELRA publication ELRA-S0031

The transcripts are in Universal Transcription Format (UTF). All UTF files were validated against a utf.dtd. Tables containing speaker demographic information and cross-references of file names from the TED audio corpus are included this release. A transcript sample is available here.

Updates

There are no updates at this time

Available Media

View Fees





Login for the applicable fee