FORM2 Kinematic Gesture
Item Name: | FORM2 Kinematic Gesture |
Author(s): | Craig Martell, Paul Howard, Chris Osborn, Lisa Britt, Kari Myers |
LDC Catalog No.: | LDC2003V01 |
ISBN: | 1-58563-269-4 |
ISLRN: | 685-159-396-611-7 |
DOI: | https://doi.org/10.35111/ge24-bd53 |
Release Date: | July 28, 2003 |
Member Year(s): | 2003 |
DCMI Type(s): | MovingImage, Text |
Data Source(s): | field recordings |
Project(s): | Talkbank |
Application(s): | gesture recognition, gesture synthesis, information extraction, natural language processing |
Language(s): | English |
Language ID(s): | eng |
License(s): |
LDC User Agreement for Non-Members |
Online Documentation: | LDC2003V01 Documents |
Licensing Instructions: | Subscription & Standard Members, and Non-Members |
Citation: | Martell, Craig, et al. FORM2 Kinematic Gesture LDC2003V01. Web Download. Philadelphia: Linguistic Data Consortium, 2003. |
Related Works: | View |
Introduction
FORM2 Kinematic Gesture was produced by the Linguistic Data Consortium (LDC) and contains eight video files totalling 12 min and associated annotations.
FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded.
Data
There are a total of 24 data files: eight movie files, eight Anvil files, and eight Form files.
The movie files represent 12 minutes of audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. These video recordings were chosen because they are part of the NSF-funded Talkbank project.
The video format is as follows:
Size | 360 x 240 pixels |
---|---|
Compression | H.261 |
Data rate | 696 K/sec |
Video rate | 29.82 fps |
Audio rate | 48.000 kHz |
Audio format | 8-bit stereo |
The gesture annotations were created using the FORM 2.0 tag set. The Anvil annotation files used in their creation, augmented with FORM 1.0 data, are also included. (FORM1 data will be the subject of a separate publication to be released in the near future). FORM1 values that are not included in the FORM2 spec are not included in the publication. A full description of the FORM tag set with explanations of each value can be found in the documentation.
Samples
For an example of the data in this corpus, please view these video (MOV) and annotation (ANVIL) samples.
Sponsorship
This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture - EIA98-09209
Updates
There are no updates available at this time.
Note
The cost of the first 50 copies of this publication (not counting the copies distributed to LDC members) is covered by the sponsoring grants, and therefore free of charge to qualified researchers; a $30 shipping and handling fee applies. After these first 50 copies are distributed, additional copies will be available for the production cost of $500.