Wide-coverage deep statistical parsing using automatic dependency structure annotation

Cahill, Aoife; Burke, Michael; O'Donovan, Ruth; Riezler, Stefan; van Genabith, Josef; Way, Andy

Cahill, Aoife ORCID: 0000-0002-3519-7726, Burke, Michael, O'Donovan, Ruth, Riezler, Stefan, van Genabith, Josef ORCID: 0000-0003-1322-7944 and Way, Andy ORCID: 0000-0001-5736-5930 (2008) Wide-coverage deep statistical parsing using automatic dependency structure annotation. Computational Linguistics, 34 (1). pp. 81-124.

Abstract
Metadata
Downloads
Documents
Metrics

[+][-]

Abstract

A number of researchers (Lin 1995; Carroll, Briscoe, and Sanfilippo 1998; Carroll et al. 2002; Clark and Hockenmaier 2002; King et al. 2003; Preiss 2003; Kaplan et al. 2004;Miyao and Tsujii 2004) have convincingly argued for the use of dependency (rather than CFG-tree) representations for parser evaluation. Preiss (2003) and Kaplan et al. (2004) conducted a number of experiments comparing “deep” hand-crafted wide-coverage with “shallow” treebank- and machine-learning based parsers at the level of dependencies, using simple and automatic methods to convert tree output generated by the shallow parsers into dependencies. In this article, we revisit the experiments in Preiss (2003) and Kaplan et al. (2004), this time using the sophisticated automatic LFG f-structure annotation methodologies of Cahill et al. (2002b, 2004) and Burke (2006), with surprising results. We compare various PCFG and history-based parsers (based on Collins, 1999; Charniak, 2000; Bikel, 2002) to find a baseline parsing system that fits best into our automatic dependency structure annotation technique. This combined system of syntactic parser and dependency structure annotation is compared to two hand-crafted, deep constraint-based parsers (Carroll and Briscoe 2002; Riezler et al. 2002). We evaluate using dependency-based gold standards (DCU 105, PARC 700, CBS 500 and dependencies for WSJ Section 22) and use the Approximate Randomization Test (Noreen 1989) to test the statistical significance of the results. Our experiments show that machine-learning-based shallow grammars augmented with sophisticated automatic dependency annotation technology outperform hand-crafted, deep, widecoverage constraint grammars. Currently our best system achieves an f-score of 82.73% against the PARC 700 Dependency Bank (King et al. 2003), a statistically significant improvement of 2.18%over the most recent results of 80.55%for the hand-crafted LFG grammar and XLE parsing system of Riezler et al. (2002), and an f-score of 80.23% against the CBS 500 Dependency Bank (Carroll, Briscoe, and Sanfilippo 1998), a statistically significant 3.66% improvement over the 76.57% achieved by the hand-crafted RASP grammar and parsing system of Carroll and Briscoe (2002).

Metadata

Item Type:	Article (Published)
Refereed:	Yes
Uncontrolled Keywords:	Treebank; Parsers; Translation
Subjects:	Computer Science > Machine translating
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:	Massachusetts Institute of Technology Press
Official URL:	http://dx.doi.org/10.1162/coli.2008.34.1.81
Copyright Information:	© 2008 Massachusetts Institute of Technology Press.
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	16173
Deposited On:	16 May 2011 13:05 by Shane Harper . Last Modified 25 Jan 2019 11:55

Documents

Full text available as:

[thumbnail of Wide-Coverage_Deep_Statistical_Parsing_using_Automatic_Dependency_Structure_Annotation.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
367kB

Metrics

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Wide-coverage deep statistical parsing using automatic dependency structure annotation

Altmetric Badge

Dimensions Badge

Downloads