KAIROS Phase 1 Quizlet
Item Name: | KAIROS Phase 1 Quizlet |
Author(s): | Song Chen, Ann Bies, Justin Mott, Christopher Caruso, Jennifer Tracey, Stephanie Strassel |
LDC Catalog No.: | LDC2025T11 |
ISLRN: | 357-044-554-407-1 |
DOI: | https://doi.org/10.35111/rcba-vb61 |
Release Date: | August 15, 2025 |
Member Year(s): | 2025 |
DCMI Type(s): | Image, MovingImage, Software, Sound, StillImage, Text |
Data Source(s): | web collection |
Project(s): | KAIROS |
Application(s): | event detection, information extraction, knowledge representation |
Language(s): | English, Spanish |
Language ID(s): | eng, spa |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2025T11 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Chen, Song, et al. KAIROS Phase 1 Quizlet LDC2025T11. Web Download. Philadelphia: Linguistic Data Consortium, 2025. |
Related Works: | View |
Introduction
KAIROS Phase 1 Quizlet was developed by the Linguistic Data Consortium (LDC). It contains English and Spanish text, video and image data and annotations used for pre-evaluation research and system development during Phase 1 of the DARPA KAIROS program.
KAIROS Quizlets were a series of narrowly defined tasks designed to explore specific evaluation objectives enabling KAIROS system developers to exercise individual system components on a small data set prior to the full program evaluation. This corpus contains the complete set of Quizlet data used in Phase 1 which focused on two real-world complex events (CEs) within the Improvised Explosive Device bombing scenario: CE1001 (2018 Caracas drone attack) and CE1002 (Utah High School backpack bombing).
The DARPA KAIROS (Knowledge-directed Artificial Intelligence Reasoning Over Schemas) program aimed to build technology capable of understanding and reasoning about complex real-world events in order to provide actionable insights to end users. KAIROS systems utilized formal event representations in the form of schema libraries that specified the steps, preconditions and constraints for an open set of complex events; schemas were then used in combination with event extraction to characterize and make predictions about real-world events in a large multilingual, multimedia corpus.
Data
Four quizlets were developed in Phase 1. In additon to the source documents, this release contains the contents of Quizlet 3 (graph G annotation generated with manual annotation) and Quizlet 4 (source documents, manual annotation, updated graph G). Quizlet 1 (evaluation task introduction) did not require data or annotation and is not included in this release. Quizlet 2 (schema generation and instantiation) used source documents but did not include annotation.
Source data was collected from the web; 30 root web pages were collected and processed, yielding 29 text data files, 216 image files and 5 video files. Annotation steps included labeling scenario-relevant events and relations for each document to develop a structured representation of temporally ordered events, relations and arguments and to generate a reference knowledge graph.
Source data is presented in various formats: .gif, .jpg,. ltf, .mp4, .png, .psm, and .svg. Annotations are presented as tab separated files (.tab) for temporal ordering, relations, events, and arguments.
Samples
Please view these samples:
Sponsorship
KAIROS was sponsored by the Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. HR0011-19-S-0014.
Updates
No updates at this time.