AIDA Scenario 1 and 2 Reference Knowledge Base
|Item Name:||AIDA Scenario 1 and 2 Reference Knowledge Base|
|Author(s):||Jennifer Tracey, Stephanie Strassel, Jeremy Getman, Ann Bies, Kira Griffitt, David Graff, Christopher Caruso|
|LDC Catalog No.:||LDC2023T10|
|Release Date:||October 16, 2023|
|Data Source(s):||government documents, web collection|
|Application(s):||entity extraction, information extraction, knowledge base population|
LDC User Agreement for Non-Members
|Online Documentation:||LDC2023T10 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Tracey, Jennifer, et al. AIDA Scenario 1 and 2 Reference Knowledge Base LDC2023T10. Web Download. Philadelphia: Linguistic Data Consortium, 2023.|
AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data.
The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.
Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2.
This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot).
There are four inputs to the KB: GPE and LOC entities from GeoNames (GEO), PER entities from the CIA World Leaders List (WLL), ORG entities from Appendix B of the CIA World Factbook (APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10).
This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013.
Please view the following samples:
None at this time.