Spoken Digits in Hindi and Indian English, Linguistic Data Consortium (LDC) Catalog Number LDC2022S03 and ISBN 1-58563-986-9, was developed by the Birla Institute of Technology and Science Pilani. It contains approximately two hours of speech comprised of spoken digits from one to ten in Hindi and English with regional accents from across India.
The speech data was collected as follows: in person, on a mobile handset recorder app; via one-to-one online communications over social apps; and from social media sites. Each audio file represents a single spoken digit in either Hindi or Indian English. Background noise was mostly retained. Some data was recorded in a noise-free environment or cleaned after recording to avoid abrupt noises such as car horns.
The audio data is organized by number, language and gender. The gender breakdown for speakers is 17% female, 27% male, and 56% unspecified.
A Google Colab Notebook file which can be used for basic functionalities such as removing noise or unwanted spaces is also included in this release.
All audio data is presented as single channel 16-bit 16kHz flac compressed linear PCM.
Please see file.tbl (TXT) for a complete file list as well as checksums for this publication.
Additional information, updates, bug fixes may be available in the LDC catalog entry for this corpus at LDC2022S03.
Portions © 2022 Basabdatta Sen Bhattacharya, © 2022 Trustees of the University of Pennsylvania