Home › Language Resources › Data

FORM2 Kinematic Gesture

Item Name:	FORM2 Kinematic Gesture
Author(s):	Craig Martell, Paul Howard, Chris Osborn, Lisa Britt, Kari Myers
LDC Catalog No.:	LDC2003V01
ISBN:	1-58563-269-4
ISLRN:	685-159-396-611-7
DOI:	https://doi.org/10.35111/ge24-bd53
Release Date:	July 28, 2003
Member Year(s):	2003
DCMI Type(s):	MovingImage, Text
Data Source(s):	field recordings
Project(s):	Talkbank
Application(s):	gesture recognition, gesture synthesis, information extraction, natural language processing
Language(s):	English
Language ID(s):	eng
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC2003V01 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Martell, Craig, et al. FORM2 Kinematic Gesture LDC2003V01. Web Download. Philadelphia: Linguistic Data Consortium, 2003.
Related Works: Hide	View isVersionOf LDC2004V01 FORM1 Kinematic Gesture

Introduction

FORM2 Kinematic Gesture was produced by the Linguistic Data Consortium (LDC) and contains eight video files totalling 12 min and associated annotations.

FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded.

Data

There are a total of 24 data files: eight movie files, eight Anvil files, and eight Form files.

The movie files represent 12 minutes of audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. These video recordings were chosen because they are part of the NSF-funded Talkbank project.

The video format is as follows:

Size	360 x 240 pixels
Compression	H.261
Data rate	696 K/sec
Video rate	29.82 fps
Audio rate	48.000 kHz
Audio format	8-bit stereo

The gesture annotations were created using the FORM 2.0 tag set. The Anvil annotation files used in their creation, augmented with FORM 1.0 data, are also included. (FORM1 data will be the subject of a separate publication to be released in the near future). FORM1 values that are not included in the FORM2 spec are not included in the publication. A full description of the FORM tag set with explanations of each value can be found in the documentation.

Samples

For an example of the data in this corpus, please view these video (MOV) and annotation (ANVIL) samples.

Sponsorship

This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture - EIA98-09209

Updates

There are no updates available at this time.

Note

The cost of the first 50 copies of this publication (not counting the copies distributed to LDC members) is covered by the sponsoring grants, and therefore free of charge to qualified researchers; a $30 shipping and handling fee applies. After these first 50 copies are distributed, additional copies will be available for the production cost of $500.