Portuguese Newswire Text
|Item Name:||Portuguese Newswire Text|
|Author(s):||Jonathan Wright, David Graff|
|LDC Catalog No.:||LDC99T40|
|Application(s):||information retrieval, language modeling|
LDC User Agreement for Non-Members
|Online Documentation:||LDC99T40 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Wright, Jonathan, and David Graff. Portuguese Newswire Text LDC99T40. Web Download. Philadelphia: Linguistic Data Consortium, 1999.|
This corpus builds on the Portuguese data published previously in the European Language Newswire Text Corpus and contains the previously published material, as well as more recent material.
The data in this corpus comes from Agence France Presse from May 13, 1994 through December 31, 1998 (June 27, 1996 - December 31, 1998 was previously unpublished by the LDC). The data has been tagged using SGML to identify article boundaries.