Voice Across Hispanic America (VAHA) is a corpus of Spanish telephone speech, recorded digitally from 915 native speakers of Spanish in various parts of the United States. With nearly 39,000 recorded and transcribed utterances, VAHA will be useful for a variety of research studies, but it is intended primarily for speech technology research and development in telecommunications applications. It is patterned after Macrophone (1), an American English corpus (LDC94S21) which is widely used for this purpose.
This corpus was collected by Texas Instruments in Dallas, TX for the Linguistic Data Consortium at the University of Pennsylvania.
Updates There are no updates at this time.