Message Understanding Conference (MUC) 6
Item Name: | Message Understanding Conference (MUC) 6 |
Author(s): | Nancy Chinchor, Beth Sundheim |
LDC Catalog No.: | LDC2003T13 |
ISBN: | 1-58563-239-2 |
ISLRN: | 402-267-910-068-8 |
DOI: | https://doi.org/10.35111/wbcc-y063 |
Release Date: | August 22, 2003 |
Member Year(s): | 2003 |
DCMI Type(s): | Text |
Data Source(s): | newswire |
Project(s): | MUC |
Application(s): | message understanding |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2003T13 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Chinchor, Nancy, and Beth Sundheim. Message Understanding Conference (MUC) 6 LDC2003T13. Web Download. Philadelphia: Linguistic Data Consortium, 2003. |
Related Works: | View |
Introduction
Message Understanding Conference (MUC) 6 was produced by the Linguistic Data Consortium (LDC) and contains 318 annotated Wall Street Journal (WSJ) articles, as well as the scoring software and the corresponding documentation used in the MUC6 evaluation.
In the 1990s, the MUC evaluations funded the development of metrics and statistical algorithms to support government evaluations of emerging information extraction technologies. Additional information from NIST can be found at http://www.itl.nist.gov/iaui/894.02/related_projects/muc.
Data
In addition to the 318 WSJ articles in the main directory, this corpus also contains 460 articles in concatenated files for dry run and formal testing and training, answer keys, and scorer configuration files.
Both the Message Understanding Conference (MUC) 6 Additional News Text (LDC96T10) and the MUC 6 corpus are necessary in order to replicate the evaluation. All the materials are published as received from the corpus creators, without any quality control being done at the LDC (the only difference is that the files have been uncompressed).
Samples
Please view this text sample.
Updates
August 20th, 2003: What was formerly published as MUC VI Text Collection (LDC1996T10) was renamed as MUC 6 Additional News Text, because LDC96T10 consists only of additional training materials.
Copyright
Portions © 1986-1994 Dow Jones & Company, Inc.RESTRICTED RIGHTS LEGEND: INFORMATION FROM THE WALL STREET JOURNAL AND/OR THE DOW JONES NEWS SERVICE CONTAINED HEREIN IS THE PROPERTY OF DOW JONES & COMPANY, INC. AND IS PROTECTED BY COPYRIGHT. USE, DUPLICATION OR DISCLOSURE BY YOU IS SUBJECT TO THE RESTRICTIONS SET FORTH IN THE USER AGREEMENT DELIVERED TO YOU BY THE LINGUISTIC DATA CONSORTIUM OF THE UNIVERSITY OF PENNSYLVANIA. COPYRIGHT 1986-1994 DOW JONES & COMPANY, INC. ALL RIGHTS RESERVED.