Author(s): Yeshwant Muthusamy
LDC Catalog No.: LDC96S41
ISBN: 1-58563-107-8
ISLRN: 759-953-194-215-4
Member Year(s): 1996
DCMI Type(s): Sound
Sample Type: 1-channel ulaw
Sample Rate: 8000
Data Source(s): telephone speech
Application(s): speech recognition
Language(s): Spanish
Language ID(s): spa
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC96S41 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Muthusamy, Yeshwant. VAHA (POLYPHONE II) LDC96S41. Web Download. Philadelphia: Linguistic Data Consortium, 1996.
Related Works: View


Voice Across Hispanic America (VAHA) is a corpus of Spanish telephone speech, recorded digitally from 915 native speakers of Spanish in various parts of the United States. With nearly 39,000 recorded and transcribed utterances, VAHA will be useful for a variety of research studies, but it is intended primarily for speech technology research and development in telecommunications applications. It is patterned after Macrophone (1), an American English corpus (LDC94S21) which is widely used for this purpose.


This corpus was collected by Texas Instruments in Dallas, TX for the Linguistic Data Consortium.


There are no updates at this time.

Available Media

View Fees

Login for the applicable fee