English Chinese Translation Treebank v 1.0

Item Name: English Chinese Translation Treebank v 1.0
Author(s): Ann Bies, Martha Palmer, Justin Mott, Colin Warner
LDC Catalog No.: LDC2007T02
ISBN: 1-58563-408-5
ISLRN: 877-578-293-641-1
DOI: https://doi.org/10.35111/jn2g-zd52
Release Date: January 22, 2007
Member Year(s): 2007
DCMI Type(s): Text
Data Source(s): newswire
Project(s): TIDES
Application(s): tagging, parsing, natural language processing, machine translation
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2007T02 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Bies, Ann, et al. English Chinese Translation Treebank v 1.0 LDC2007T02. Web Download. Philadelphia: Linguistic Data Consortium, 2007.
Related Works: View


This release of English Chinese Translation Treebank v. 1.0 consists of 146,300 words in 325 files of individual news stories from Xinhua News Agency (corresponding to the Xinhua data in Chinese Treebank 5.0 LDC2005T01) that are translated into English, part-of-speech tagged and treebanked. The files were compressed using gzip.

The source files for the treebank annotation contain the final updated translation of these files. Translation errors that prevented complete treebank annotation have been corrected. This translation and annotation were completed in October 2004 and supersede any earlier translation.

This publication was compiled under National Science Foundation Grant #IIS-0325646.


For an example of the data in this publication, please view this sample.

Available Media

View Fees

Login for the applicable fee