FORM2 Kinematic Gesture


Item Name: FORM2 Kinematic Gesture
Authors: Craig Martell, Paul Howard, Chris Osborn, Lisa Britt, and Kari Myers
LDC Catalog No.: LDC2003V01
ISBN: 1-58563-269-4
Release Date: Jul 28, 2003
Data Type: video
Data Source(s): field recordings
Project(s): Talkbank
Application(s): gesture recognition, gesture synthesis, information extraction, natural language processing
Language(s): English
Language ID(s): eng
Distribution: 1 CD
Member fee: $0 for 2003 members
Non-member Fee: US $500.00
Reduced-License Fee: US $250.00
Extra-Copy Fee: US $150.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Craig Martell, et al.
2003
FORM2 Kinematic Gesture
Linguistic Data Consortium, Philadelphia

Introduction

FORM2 Kinematic Gesture was produced by Linguistic Data Consortium (LDC) catalog number LDC2003V01 and ISBN 1-58563-269-4.

FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded.

Please go to the FORM website for more information.

Data

There are a total of 24 data files: eight movie files, eight Anvil files, and eight Form files.

The movie files represent 12 minutes of audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. These video recordings were chosen because they are part of the NSF-funded Talkbank project.

The video format is as follows:

Size 360 x 240 pixels
Compression H.261
Data rate 696 K/sec
Video rate 29.82 fps
Audio rate 48.000 kHz
Audio format 8-bit stereo

The gesture annotations were created using the FORM 2.0 tag set. The Anvil annotation files used in their creation, augmented with FORM 1.0 data, are also included. (FORM1 data will be the subject of a separate publication to be released in the near future). FORM1 values that are not included in the FORM2 spec are not included in the publication. A full description of the FORM tag set with explanations of each value can be found in the documentation.

Sponsorship

This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture - EIA98-09209

Updates

There are no updates available at this time.

Note

The cost of the first 50 copies of this publication (not counting the copies distributed to LDC members) is covered by the sponsoring grants, and therefore free of charge to qualified researchers; a $30 shipping and handling fee applies. After these first 50 copies are distributed, additional copies will be available for the production cost of $500 per CD.

Content Copyright

Portions 2003 Trustees of the University of Pennsylvania, 2000 Brian MacWhinney