Discourse Graphbank

Item Name: Discourse Graphbank
Author(s): Florian Wolf, Edward Gibson, Amy Fisher, Meredith Knight
LDC Catalog No.: LDC2005T08
ISBN: 1-58563-320-8
ISLRN: 983-656-398-539-6
Release Date: March 15, 2005
Member Year(s): 2005
DCMI Type(s): Text
Project(s): GALE, EARS
Application(s): information retrieval, discourse analysis, summarization
Language(s): English
Language ID(s): eng
License(s): LDC User Agreement for Non-Members
Online Documentation: LDC2005T08 Documents
Licensing Instructions: Subscription & Standard Members, and Non-Members
Citation: Wolf, Florian, et al. Discourse Graphbank LDC2005T08. Web Download. Philadelphia: Linguistic Data Consortium, 2005.


As Florian Wolf's Ph.D thesis, the Discourse Treebank aimed to define a descriptively adequate data structure for representing discourse coherence structures. This project also investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphor resolution,summarization, information retrieval), and developed and tested discourse parsing algorithms.


The data consists of 135 texts from AP Newswire and Wall Street Journal, annotated with coherence relations. The source was UPenn TIPSTER.


A screenshot of the output of the annotator tool has been provided as an example of this corpus.

Available Media

View Fees

Login for the applicable fee