Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Evaluating automatic F-structure annotation for the Penn-II treebank

Cahill, Aoife orcid logoORCID: 0000-0002-3519-7726, McCarthy, Mairéad, van Genabith, Josef orcid logoORCID: 0000-0003-1322-7944 and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2002) Evaluating automatic F-structure annotation for the Penn-II treebank. In: TLT 2002 - 1st Workshop on Treebanks and Linguistic Theories, 20-21 September 2002, Sozopol, Bulgaria.

Abstract
Methodologies have been developed (van Genabith et al., 1999a,b; Sadler et al., 2000; Frank, 2000; van Genabith et al., 2001; Frank et al., 2002) for automatically annotating treebank resources with Lexical-Functional Grammar (LFG: Kaplan and Bresnan, 1982) fstructure information. Until recently, however, most of this work on automatic annotation has been applied only to limited datasets, so while it may have shown 'proof of concept', it has not been demonstrated that the techniques developed scale up to much larger data sets (Liakata and Pulman, 2002). More recent work (Cahill et al., 2002a,b) has presented efforts in evolving and scaling techniques established in these previous papers to the full Penn-ll Treebank (Marcus et al., 1994). In this paper, we present and assess a number of quantitative and qualitative evaluation methodologies which provide insights into the effectiveness of the techniques developed to derive automatically a set of f-structures for the more than 1,000,000 words and 49,000 sentences of Penn-II.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:http://www.bultreebank.org/Proceedings.html
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15828
Deposited On:24 Nov 2010 14:08 by Shane Harper . Last Modified 21 Jan 2022 16:37
Documents

Full text available as:

[thumbnail of Evaluating_Automatic_F-Structure_Annotation_for_the_Penn-II_Treebank.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
286kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record