FORM2 Kinematic Gesture
|Item Name:||FORM2 Kinematic Gesture|
|Author(s):||Craig Martell, Paul Howard, Chris Osborn, Lisa Britt, Kari Myers|
|LDC Catalog No.:||LDC2003V01|
|Release Date:||July 28, 2003|
|Data Source(s):||field recordings|
|Application(s):||natural language processing, information extraction, gesture synthesis, gesture recognition|
LDC User Agreement for Non-Members
|Online Documentation:||LDC2003V01 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Martell, Craig, et al. FORM2 Kinematic Gesture LDC2003V01. CD. Philadelphia: Linguistic Data Consortium, 2003.|
FORM2 Kinematic Gesture was produced by the Linguistic Data Consortium.
FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded.
There are a total of 24 data files: eight movie files, eight Anvil files, and eight Form files.
The movie files represent 12 minutes of audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. These video recordings were chosen because they are part of the NSF-funded Talkbank project.
The video format is as follows:
|Size||360 x 240 pixels|
|Data rate||696 K/sec|
|Video rate||29.82 fps|
|Audio rate||48.000 kHz|
|Audio format||8-bit stereo|
The gesture annotations were created using the FORM 2.0 tag set. The Anvil annotation files used in their creation, augmented with FORM 1.0 data, are also included. (FORM1 data will be the subject of a separate publication to be released in the near future). FORM1 values that are not included in the FORM2 spec are not included in the publication. A full description of the FORM tag set with explanations of each value can be found in the documentation.
This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture - EIA98-09209
There are no updates available at this time.
The cost of the first 50 copies of this publication (not counting the copies distributed to LDC members) is covered by the sponsoring grants, and therefore free of charge to qualified researchers; a $30 shipping and handling fee applies. After these first 50 copies are distributed, additional copies will be available for the production cost of $500 per CD.