TREC Spanish

Item Name: TREC Spanish
Author(s): Willie Rogers
LDC Catalog No.: LDC2000T51
ISBN: 1-58563-177-9
ISLRN: 445-901-162-731-2
Member Year(s): 2000
DCMI Type(s): Text
Data Source(s): newswire
Project(s): TREC, TIDES, GALE
Application(s): information retrieval
Language(s): Spanish
Language ID(s): spa
License(s): TREC Spanish Agreement
Online Documentation: LDC2000T51 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Rogers, Willie. TREC Spanish LDC2000T51. Web Download. Philadelphia: Linguistic Data Consortium, 2000.
Related Works: View

Introduction

TREC Spanish was developed by the Linguistic Data Consortium and consists of Spanish newswire data from Agence France Presse and El Norte that was used in the TREC (Text REtrieval Conference) Spanish tasks sponsored by NIST (National Institute of Standards and Technology), specifically, TREC-3, TREC-4 and TREC-5.

Data

The El Norte material (250 megabytes) was used in TREC-3 and TREC-4; the Agence France Presse documents (300 megabytes) were used in TREC-5. The text has been formatted to include TREC document IDs.

Further information about TREC-5 is available from the NIST TREC-5 website

Updates

There are no updates at this time.

Available Media

View Fees





Login for the applicable fee