Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Treebank-based acquisition of wide-coverage, probabilistic LFG resources: project overview, results and evaluation

Burke, Michael, Cahill, Aoife orcid logoORCID: 0000-0002-3519-7726, O'Donovan, Ruth, van Genabith, Josef and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2004) Treebank-based acquisition of wide-coverage, probabilistic LFG resources: project overview, results and evaluation. In: IJCNLP-04 Workshop - The First International Joint Conference on Natural Language Processing, 21 March 2004, Sanya City, Hainan Island, China.

Abstract
This paper presents an overview of a project to acquire wide-coverage, probabilistic Lexical-Functional Grammar (LFG) resources from treebanks. Our approach is based on an automatic annotation algorithm that annotates “raw” treebank trees with LFG f-structure information approximating to basic predicate-argument/dependency structure. From the f-structure-annotated treebank we extract probabilistic unification grammar resources. We present the annotation algorithm, the extraction of lexical information and the acquisition of wide-coverage and robust PCFG-based LFG approximations including long-distance dependency resolution. We show how the methodology can be applied to multilingual, treebank-based unification grammar acquisition. Finally we show how simple (quasi-)logical forms can be derived automatically from the f-structures generated for the treebank trees.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:lexical-functional grammar;
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Institutes and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:http://www-tsujii.is.s.u-tokyo.ac.jp/bsa/
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Enterprise Ireland, EI SC/2001/186, Irish Research Council for Science Engineering and Technology
ID Code:15302
Deposited On:15 Mar 2010 11:11 by DORAS Administrator . Last Modified 25 Jan 2019 11:41
Documents

Full text available as:

[thumbnail of burke.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
89kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record