Home › Language Resources › Data

AISHELL-1

Item Name:	AISHELL-1
Author(s):	Hui Bu
LDC Catalog No.:	LDC2018S14
ISBN:	1-58563-866-8
ISLRN:	733-251-884-636-1
DOI:	https://doi.org/10.35111/kvd2-8a82
Release Date:	November 15, 2018
Member Year(s):	2018
DCMI Type(s):	Sound, Text
Sample Type:	pcm
Sample Rate:	44100
Data Source(s):	telephone speech, microphone speech
Application(s):	speech recognition
Language(s):	Mandarin Chinese
Language ID(s):	cmn
License(s):	LDC User Agreement for Non-Members
Online Documentation:	LDC2018S14 Documents
Licensing Instructions:	Subscription & Standard Members, and Non-Members
Citation:	Bu, Hui. AISHELL-1 LDC2018S14. Hard Drive. Philadelphia: Linguistic Data Consortium, 2018.
Related Works: Hide	View relatesTo LDC2019S17 LDC Spoken Language Sampler - Fifth Release

Introduction

AISHELL-1 was developed by Beijing Shell Shell Technology Co., Ltd. It contains approximately 520 hours of Chinese Mandarin speech from 400 speakers recorded simultaneously on three different devices with associated transcripts.

The goal of the collection was to support speech recognition system development in 11 domains, five of which are include in this corpus: Finance, Science & Technology, Sports, Entertainment, and News. Participants read 500 sentences covering the domains; sentences were chosen for their speech and phonetic characteristics.

Speakers were recruited from different accent areas across China, including North, South and Yue-Gui-Min regions. There were 214 female speakers and 186 male speakers, constituting 53% and 47% of the database, respectively. Additional demographic information about the participants is included in this release.

Data

Speech was recorded in a quiet indoor environment on a high fidelity microphone and two mobile phones (Android and iOS). All speech is presented as 16-bit flac compressed wav files; the microphone speech sample rate is 44.1kHz and the phone speech sample rate is 16kHz. Each speech file ranges from approximately 1 second to 14 seconds in length.

Transcripts are stored as UTF-8 encoded plain text files and are not time-aligned.

AISHELL-1

Introduction

Data

Samples

Updates

Copyright

Available Media

View Fees