CALLHOME Japanese Transcripts
| Item Name: | CALLHOME Japanese Transcripts |
| Author(s): | Barbara Wheatley, Masayo Kaneko, Megumi Kobayashi |
| LDC Catalog No.: | LDC96T18 |
| ISBN: | 1-58563-078-0 |
| ISLRN: | 476-552-220-214-8 |
| DOI: | https://doi.org/10.35111/3r13-pw63 |
| Member Year(s): | 1996, 1997 |
| DCMI Type(s): | Text |
| Data Source(s): | telephone conversations |
| Project(s): | Hub5-LVCSR |
| Application(s): | speech recognition |
| Language(s): | Japanese |
| Language ID(s): | jpn |
| License(s): |
LDC User Agreement for Non-Members |
| Online Documentation: | LDC96T18 Documents |
| Licensing Instructions: | Subscription & Standard Members, and Non-Members |
| Citation: | Wheatley, Barbara, Masayo Kaneko, and Megumi Kobayashi. CALLHOME Japanese Transcripts LDC96T18. Web Download. Philadelphia: Linguistic Data Consortium, 1996. |
| Related Works: | View |
Introduction
CALLHOME Japanese Transcripts was developed by the Linguistic Data Consortium (LDC) and contains transcripts corresponding to approximately 49 hours of speech from 120 unscripted telephone conversations between native Japanese speakers.
The CALLHOME series consists of telephone conversations, transcripts and lexicons developed by LDC and Rutgers, The State University of New Jersey, in support of research in speaker identification, language identification and related technologies. Languages in the series include American English, Egyptian Arabic, German, Japanese, Mandarin Chinese, and Spanish.
Data
Transcripts cover a contiguous five-minute or ten-minute call segment and are presented in standard orthography, time-stamped by speaker turn for alignment with the speech signal.
Calls were manually audited for language, recording quality, channel characteristics, dialect, and region. Auditing information and other metadata (information on calls, speakers and demographics on call originators) are included in the documentation accompanying this release.
The corresponding conversational telephone speech dataset (LDC96S37) and an associated lexicon (LDC96L17) are available separately.
Samples
Updates
There are no updates at this time.