FORM2 Kinematic Gesture

Item Name: FORM2 Kinematic Gesture
Author(s): Craig Martell, Paul Howard, Chris Osborn, Lisa Britt, Kari Myers
LDC Catalog No.: LDC2003V01
ISBN: 1-58563-269-4
ISLRN: 685-159-396-611-7
Release Date: July 28, 2003
Member Year(s): 2003
DCMI Type(s): MovingImage
Data Source(s): field recordings
Project(s): Talkbank
Application(s): natural language processing, information extraction, gesture synthesis, gesture recognition
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2003V01 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Martell, Craig, et al. FORM2 Kinematic Gesture LDC2003V01. Web Download. Philadelphia: Linguistic Data Consortium, 2003.
Related Works: View


FORM2 Kinematic Gesture was produced by the Linguistic Data Consortium.

FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded.


There are a total of 24 data files: eight movie files, eight Anvil files, and eight Form files.

The movie files represent 12 minutes of audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. These video recordings were chosen because they are part of the NSF-funded Talkbank project.

The video format is as follows:

Size 360 x 240 pixels
Compression H.261
Data rate 696 K/sec
Video rate 29.82 fps
Audio rate 48.000 kHz
Audio format 8-bit stereo

The gesture annotations were created using the FORM 2.0 tag set. The Anvil annotation files used in their creation, augmented with FORM 1.0 data, are also included. (FORM1 data will be the subject of a separate publication to be released in the near future). FORM1 values that are not included in the FORM2 spec are not included in the publication. A full description of the FORM tag set with explanations of each value can be found in the documentation.


This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture - EIA98-09209


There are no updates available at this time.


The cost of the first 50 copies of this publication (not counting the copies distributed to LDC members) is covered by the sponsoring grants, and therefore free of charge to qualified researchers; a $30 shipping and handling fee applies. After these first 50 copies are distributed, additional copies will be available for the production cost of $500.

Available Media

View Fees

Login for the applicable fee