AIDA Scenario 1 and 2 Reference Knowledge Base

Item Name: AIDA Scenario 1 and 2 Reference Knowledge Base
Author(s): Jennifer Tracey, Stephanie Strassel, Jeremy Getman, Ann Bies, Kira Griffitt, David Graff, Christopher Caruso
LDC Catalog No.: LDC2023T10
ISLRN: 644-411-403-964-6
Release Date: October 16, 2023
Member Year(s): 2023
DCMI Type(s): Text
Data Source(s): government documents, web collection
Project(s): AIDA, LORELEI
Application(s): entity extraction, information extraction, knowledge base population
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2023T10 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Tracey, Jennifer, et al. AIDA Scenario 1 and 2 Reference Knowledge Base LDC2023T10. Web Download. Philadelphia: Linguistic Data Consortium, 2023.
Related Works: View


AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data.

The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.

Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2.


This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot).

There are four inputs to the KB: GPE and LOC entities from GeoNames (GEO), PER entities from the CIA World Leaders List (WLL), ORG entities from Appendix B of the CIA World Factbook (APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10).


This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013.


Please view the following samples:


None at this time.

Available Media

View Fees

Login for the applicable fee