Building a Discourse Parser | In our work, we focused exclusively on the second step of the discourse parsing problem, i.e., constructing the RST tree from a sequence of edus that have been segmented beforehand. |
Building a Discourse Parser | The motivation for leaving aside segmenting were both practical — previous discourse parsing efforts (Soricut and Marcu, 2003; LeThanh et al., 2004) already provide alternatives for standalone segmenting tools — and scientific, namely, the greater need for improvements in labeling. |
Conclusions and Future Work | In this paper, we have shown that it is possible to build an accurate automatic text-level discourse parser based on supervised machine-learning algorithms, using a feature-driven approach and a manually annotated corpus. |
Conclusions and Future Work | A complete online discourse parser , incorporating the parsing tool presented above combined with a new segmenting method has since been made freely available at http: / /nlp . |
Evaluation | To the best of our knowledge, only two fully functional text-level discourse parsing algorithms for general text have published their results: Marcu’s decision-tree-based parser (Marcu, 2000) and the multilevel rule-based system built by LeThanh et al. |
Introduction | The goal of discourse parsing is to extract this high-level, rhetorical structure. |
Introduction | Discourse parsing , on the other hand, focuses on a higher-level view of text, allowing some flexibility in the choice of formal representation while providing a wide range of applications in both analytical and computational linguistics. |
Introduction | Several attempts to automate discourse parsing have been made. |
Abstract | Segmentation is the first step in a discourse parser , a system that constructs discourse trees from elementary discourse units. |
Discussion | Besides its use in automatic discourse parsing , the system could |
Introduction* | Since segmentation is the first stage of discourse parsing , quality discourse segments are critical to building quality discourse representations (Soricut and Marcu, 2003). |
Introduction* | Most parsers can break down a sentence into constituent clauses, approaching the type of output that we need as input to a discourse parser . |
Related Work | Soricut and Marcu (2003) construct a statistical discourse segmenter as part of their sentence-level discourse parser (SPADE), the only implementation available for our comparison. |