1. Publication title: Chinese Discourse TreeBank 0.5 (CDTB 0.5) 2. Authors: Yuping Zhou, Jill Lu, Jennifer Zhang, Nianwen Xue Contact: Nianwen Xue 3. Data type: text 4. Genres: Newswire 5. Project: the Chinese Treebank Project (http://www.cs.brandeis.edu/~clp/cdtb) 6. Applications: natural language processing, linguistic analysis 7. Language: Chinese 8. Special license: None 9. Grant numbers and funding agencies: This research is supported by the IIS Division of the National Science Foundation via Grant No. 0910532 entitled ˇ°Richer Representations for Machine Translationˇ± and by the CNS Division via Grant No. 0855184 entitled ˇ°Building a community resource for temporal inference in Chineseˇ±. Jill Lu and Jennifer Zhang contributed to the annotation of the project. 10. Copyright. Portions Copyright 1994-1998, Xinhua News Agency 11. Description of the corpus structure and data attributes: There are 164 documents in this release, containing 73,314 words and 2,891 sentences. There are 5,534 total annotation instances. The annotation files contains token offsets that point to text spans in their corresponding raw text files. The content of the discourse annotation is described in the enclosed paper and annotation guidelines. 12. Quality control: The data is double-annotated and and then adjudicated. 13. References: Yuping Zhou and Nianwen Xue. 2012. PDTB-style of discourse annotation of Chinese Text. In Proceedings of ACL-2012. Jeju Island, Korea. Yuping Zhou and Nianwen Xue. Submited. The Chinese Discourse TreeBank: A Chinese corpus annotated with discourse relations. Submitted to Journal of Language Resources and Evaluation.