Nationwide Speech Project


Item Name: Nationwide Speech Project
Authors: Cynthia G. Clopper and David B. Pisoni
LDC Catalog No.: LDC2007S15
ISBN: 1-58563-449-2
Release Date: Sep 17, 2007
Data Type: speech
Sample Rate: 44100 Hz
Sampling Format: pcm
Data Source(s): microphone speech
Application(s): language modeling, linguistic analysis, natural language processing, sociolinguistics, speech recognition
Language(s): English
Language ID(s): eng
Distribution: 1 DVD
Member fee: $0 for 2007 members
Non-member Fee: US $1500.00
Reduced-License Fee: US $750.00
Extra-Copy Fee: US $200.00
Non-member License: yes
Member License: yes
Online documentation: yes
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Cynthia G. Clopper and David B. Pisoni
2007
Nationwide Speech Project
Linguistic Data Consortium, Philadelphia

Introduction

This corpus represents part of the work of the Nationwide Speech Project (NSP) conducted by the authors at Indiana University. The purpose of the NSP was to collect a large amount of speech produced by male and female talkers representing the primary regional varieties of American English: New England, Mid-Atlantic, North, Midland, South and West. This release contains approximately 60 hours of speech or nearly one hour of speech from each of 60 white American English speakers --including five male and five female talkers from the six dialect regions -- reading words and sentences. The corpus can be used for perceptual and acoustic experiments designed to explore the role of variation in spoken language processing. Such applications include speech science experiments and sociolinguistic or sociophonetic research.

Data

The speakers were recruited from the Indiana University community; they were all 18-25 years old at the time of recording, had lived exclusively in one region prior to age 18, and both parents of each speaker were also raised in the same region. Further demographic information about the speakers is provided in the file talkers.txt. The materials include 102 high predictability sentences and five repetitions of each of 10 hVd words. The high predictability sentences are 5-8 words in length and the final word in each sentence is highly predictable based on the preceding semantic context. The 10 hVd words are: heed, hid, hayed, head, had, hod, hud, hoes, hood and who'd.

Participants were recorded one at a time by an experimenter in a sound attenuated booth (IAC Audiometric Testing Room, Model 402). Both the experimenter and the participant sat in the sound booth during testing. During the recording session, the participant was seated in front of a ViewSonic LCD flatscreen monitor (ViewPanel VG151) which mirrored the screen of a Macintosh Powerbook G3 laptop. The participant wore a Shure head-mounted microphone (SM10A) that was positioned approximately one inch from the left corner of the talker's mouth. The microphone output was fed to an Applied Research Technology microphone tube pre-amplifier. The output gain on the pre-amplifier was adjusted by the experimenter while the participant read the Grandfather Passage as a warm-up before recording began. The output of the microphone pre-amplifier was connected to a Roland UA-30 USB audio interface which digitized the signal and transmitted it via USB ports to the laptop where each utterance was recorded in an individual AIFF 16-bit digital sound file at a sampling rate of 44.1 kHz (converted to .wav format files for this release) The experimenter held the laptop on her lap and wore headphones connected to the Roland device so that she could hear the same audio signal that inputted into the laptop for recording.

Samples

Content Copyright

Portions 2003 Indiana University Research and Technology Corporation, 2007 Trustees of the University of Pennsylvania