|Item Name:||Discourse Graphbank|
|Author(s):||Florian Wolf, Edward Gibson, Amy Fisher, Meredith Knight|
|LDC Catalog No.:||LDC2005T08|
|Release Date:||March 15, 2005|
|Application(s):||information retrieval, discourse analysis, summarization|
LDC User Agreement for Non-Members
|Online Documentation:||LDC2005T08 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Wolf, Florian, et al. Discourse Graphbank LDC2005T08. Web Download. Philadelphia: Linguistic Data Consortium, 2005.|
As Florian Wolf's Ph.D thesis, the Discourse Treebank aimed to define a descriptively adequate data structure for representing discourse coherence structures. This project also investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphor resolution,summarization, information retrieval), and developed and tested discourse parsing algorithms.
The data consists of 135 texts from AP Newswire and Wall Street Journal, annotated with coherence relations. The source was UPenn TIPSTER.
A screenshot of the output of the annotator tool has been provided as an example of this corpus.