Mandarin Chinese News Text
Item Name: | Mandarin Chinese News Text |
Author(s): | Zhibiao Wu |
LDC Catalog No.: | LDC95T13 |
ISBN: | 1-58563-052-7 |
ISLRN: | 133-578-348-091-2 |
DOI: | https://doi.org/10.35111/ajd2-0b82 |
Member Year(s): | 1995, 1996, 1997 |
DCMI Type(s): | Text |
Data Source(s): | newswire |
Project(s): | TREC, Tipster, TIDES, GALE, EARS |
Application(s): | language modeling, information retrieval |
Language(s): | Mandarin Chinese |
Language ID(s): | cmn |
License(s): |
Mandarin Chinese News Text Agreement |
Online Documentation: | LDC95T13 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Wu, Zhibiao. Mandarin Chinese News Text LDC95T13. Web Download. Philadelphia: Linguistic Data Consortium, 1995. |
Related Works: | View |
The Linguistic Data Consortium (LDC) announces the availability of a Mandarin Chinese text corpus. This corpus includes about 250 million GB-encoded text characters.
The Mandarin News Corpus includes text from various journalistic sources:
- newspaper text from Renmin Ribao (People's Daily)
- radio scripts from China Radio International
- newswire text from Xinhua newswire service