Top Ten LDC Corpora
| LDC2013T19 | OntoNotes Release 5.0 | |
| LDC93S1 | TIMIT Acoustic-Phonetic Continuous Speech Corpus | |
| LDC2006T13 | Web 1T 5-gram Version 1 | |
| LDC96L14 | CELEX2 | |
| LDC99T42 | Treebank-3 | |
| LDC93S10 | TIDIGITS | |
| LDC97S62 | Switchboard-1 Release 2 | |
| LDC93S6A | CSR-I (WSJ0) Complete | |
| LDC2018T24 | TAC Relation Extraction Dataset | |
| LDC2006T06 | ACE 2005 Multilingual Training Corpus |