Message Understanding Conference (MUC) 6 Additional News Text was produced by Linguistic Data Consortium (LDC) catalog number LDC96T10 and ISBN 1-58563-105-1.
In the 1990s, the MUC evaluations funded the development of metrics and statistical algorithms to support government evaluations of emerging information extraction technologies. Additional information from NIST can be found at http://www.itl.nist.gov/iaui/894.02/related_projects/muc.
This corpus contains additional training data, which had been tagged, but not annotated. Both the MUC 6 and the MUC 6 Additional News Text are necessary in order to replicate the evaluation. All the materials are published as received from the corpus creators, without any quality control being done at the LDC (the only difference is that the files have been uncompressed).
Updates August 20th, 2003: This corpus was formerly published under the name "MUC VI Text Collection." The more suitable name of "Message Understanding Conference (MUC) 6 Additional News Text" was adopted when MUC 6 (LDC2003T13), the main corpus containing the evaluation materials, was published in 2003.
RESTRICTED RIGHTS LEGEND: INFORMATION FROM THE WALL STREET JOURNAL AND/OR THE DOW JONES NEWS SERVICE CONTAINED HEREIN IS THE PROPERTY OF DOW JONES & COMPANY, INC. AND IS PROTECTED BY COPYRIGHT. USE, DUPLICATION OR DISCLOSURE BY YOU IS SUBJECT TO THE RESTRICTIONS SET FORTH IN THE USER AGREEMENT DELIVERED TO YOU BY THE LINGUISTIC DATA CONSORTIUM OF THE UNIVERSITY OF PENNSYLVANIA. COPYRIGHT 1993-1994 DOW JONES & COMPANY, INC. ALL RIGHTS RESERVED.
Portions © 1993-1994 Dow Jones & Company, Inc., © 1994 Reuters America Inc., © 1996 Trustees of the University of Pennsylvania