|Author(s):||Julia Hockenmaier, Mark Steedman|
|LDC Catalog No.:||LDC2005T13|
|Release Date:||May 15, 2005|
|Application(s):||natural language processing, information detection, cross-lingual information retrieval, automatic content extraction|
LDC User Agreement for Non-Members
|Online Documentation:||LDC2005T13 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Hockenmaier, Julia, and Mark Steedman. CCGbank LDC2005T13. Web Download. Philadelphia: Linguistic Data Consortium, 2005.|
CCGbank is a translation of the Penn Treebank into a corpus of Combinatory Categorial Grammar derivations. It pairs syntactic derivations with sets of word-word dependencies which approximate the underlying predicate-argument structure.
CCGbank contains 99.44% of the sentences in the Penn Treebank, for which it corrects a number of inconsistencies and errors in the original annotation.
For an example of this corpus, please examine this sample.
The current version, 1.1, is a bug fix that supersedes the old package. It is available for download.